Improving a neural network model for semantic segmentation of images of monitored objects in aerial photographs

Slyusar, V.; Protsenko, M.; Chernukha, A.; Melkin, V.; Petrova, O.; Kravtsov, M.; Velma, S.; Kosenko, N.; Sydorenko, O.; Sobol, Maksym

doi:https://doi.org/10.15587/1729-4061.2021.248390

Improving a neural network model for semantic segmentation of images of monitored objects in aerial photographs

dc.contributor.author	Slyusar, V.	en
dc.contributor.author	Protsenko, M.	en
dc.contributor.author	Chernukha, A.	en
dc.contributor.author	Melkin, V.	en
dc.contributor.author	Petrova, O.	en
dc.contributor.author	Kravtsov, M.	en
dc.contributor.author	Velma, S.	en
dc.contributor.author	Kosenko, N.	en
dc.contributor.author	Sydorenko, O.	en
dc.contributor.author	Sobol, Maksym	en
dc.date.accessioned	2022-01-14T12:46:02Z
dc.date.available	2022-01-14T12:46:02Z
dc.date.issued	2021
dc.description.abstract	This paper considers a model of the neural network for semantically segmenting the images of monitored objects on aerial photographs. Unmanned aerial vehicles monitor objects by analyzing (processing) aerial photographs and video streams. The results of aerial photography are processed by the operator in a manual mode; however, there are objective difficulties associated with the operator's handling a large number of aerial photographs, which is why it is advisable to automate this process. Analysis of the models showed that to perform the task of semantic segmentation of images of monitored objects on aerial photographs, the U-Net model (Germany), which is a convolutional neural network, is most suitable as a basic model. This model has been improved by using a wavelet layer and the optimal values of the model training parameters: speed (step) ‒ 0.001, the number of epochs ‒ 60, the optimization algorithm ‒ Adam. The training was conducted by a set of segmented images acquired from aerial photographs (with a resolution of 6,000×4,000 pixels) by the Image Labeler software in the mathematical programming environment MATLAB R2020b (USA). As a result, a new model for semantically segmenting the images of monitored objects on aerial photographs with the proposed name U-NetWavelet was built. The effectiveness of the improved model was investigated using an example of processing 80 aerial photographs. The accuracy, sensitivity, and segmentation error were selected as the main indicators of the model's efficiency. The use of a modified wavelet layer has made it possible to adapt the size of an aerial photograph to the parameters of the input layer of the neural network, to improve the efficiency of image segmentation in aerial photographs; the application of a convolutional neural network has allowed this process to be automatic.	en
dc.identifier.citation	Improving a neural network model for semantic segmentation of images of monitored objects in aerial photographs / V. Slyusar [et al.] // Eastern-European Journal of Enterprise Technologies. – 2021. – Vol. 6, No. 2 (114). – P. 86-95.	en
dc.identifier.doi	https://doi.org/10.15587/1729-4061.2021.248390
dc.identifier.uri	https://repository.kpi.kharkov.ua/handle/KhPI-Press/55626
dc.language.iso	en
dc.publisher	PC Technology Center	en
dc.publisher	Ukrainian State University of Railway Transport
dc.subject	semantic segmentation of images	en
dc.subject	convolutional neural network	en
dc.subject	aerial photograph	en
dc.subject	unmanned aerial vehicle	en
dc.title	Improving a neural network model for semantic segmentation of images of monitored objects in aerial photographs	en
dc.type	Article	en

Файли

Контейнер файлів

Зараз показуємо 1 - 1 з 1

Назва:: EEJET_2021_6_2_Slyusar_Improving.pdf
Розмір:: 1.17 MB
Формат:: Adobe Portable Document Format
Опис:

Завантажити

Ліцензійна угода

Зараз показуємо 1 - 1 з 1

Назва:: license.txt
Розмір:: 11.25 KB
Формат:: Item-specific license agreed upon to submission
Опис:

Завантажити

Колекції

Кафедра "Інформатика та інтелектуальна власність"