Road segmentation in SAR/PolSAR images using Convolutional Neural Networks

Thomas P. Grandin; Anderson A. de Borba; Alejandro C. Frery; Maurício Marengoni

Authors

Thomas P. Grandin Mackenzie Presbyterian University
Anderson A. de Borba Mackenzie Presbyterian University
Alejandro C. Frery Victoria University of Wellington
Maurício Marengoni Albion College

Abstract

Synthetic aperture radar (SAR) and polarimetric aperture radar (PolSAR) images are remote sensing images captured from an aircraft or artificial satellite to produce topological scans of the earth’s surface. The advantage of using these images is that they can be acquired day or night, regardless of the weather. They are used primarily to study deforestation, glacier defrosting, urban growth, and the prevention of natural hazards [3]. Neural networks have successfully solved problems in artificial intelligence (AI) in the last two decades. Among all possible architectures, the Convolution Neural Networks (CNNs) architecture dominates the field of computer vision [2]. CNN is very adaptable and can precisely segment objects in images. This article’s primary approach was to build a synthetic dataset that could be used for training, validation, and testing. Then, the Speckle noise, inherent noise in SAR images, was introduced into the Massachusetts Road Dataset, an aerial optical image database, to create a database that simulates SAR images. We use this database to study how different CNN architectures perform when trained to detect roads. Our research methodology consists of building a simulated dataset based on the optical Massachusetts Roads Dataset [4] by adding the Speckle noise with a Gamma law represented by a univariate probability density function f_Z(z; μ, L) = L^L/(Γ(L)μ^L) z^L-1 exp {- (L/μ) z} 1_R+(z), where L > 0, μ > 0 is the mean, 1_A is the indicator function of the set A, and Γ(L) is the Gamma function. After this, we trained two CNN architectures: the U-Net architecture[5], which uses skip connections between the encoder and decoder blocks to transfer information, and the DeepLabV3[1] architecture, which uses Atrous Convolution, a technique that changes the kernel size to gather context from multiple scales. Fig. 1, and Fig. 2 show the results achieved until this research phase on an image chosen arbitrarily in the test dataset. By visual inspection of Fig. 1 and Fig. 2, and with results shown in Tab. 1, we can note that the U-net shows a slight advantage in road detection over DeepLabV3. We have ongoing work such as measuring road detection accuracy, adding real SAR images to the dataset, or using Generative Adversarial Networks (GANs) to transform optical images into SAR images. [...]

Downloads

Download data is not yet available.

References

S. Chen, X. Wei, and W. Zheng. “ASA-DRNet: An Improved Deeplabv3+ Framework for SAR Image Segmentation”. In: Electronics (2023).

S. Cong and Y. Zhou. “A review of convolutional neural network architectures and their optimizations”. In: Artificial Intelligence Review 56 (June 2022), pp. 1–65. doi: 10.1007/s10462-022-10213-5.

J. S. Lee and E. Pottier. Polarimetric Radar Imaging: From Basics to Applications. 1st ed. Boca Raton: CRC Press, 2009. isbn: 9781315219332.

V. Mnih. “Machine Learning for Aerial Image Labeling”. PhD thesis. University of Toronto, 2013.

O. Ronneberger, P. Fischer, and T. Brox. U-Net: Convolutional Networks for Biomedical Image Segmentation. 2015. arXiv: 1505.04597 [cs.CV]. url: https://arxiv.org/abs/1505.04597.

Road segmentation in SAR/PolSAR images using Convolutional Neural Networks

Authors

Abstract

Downloads

References

Downloads

Published

Issue

Section

issn

Developed By