A Lightweight CNN-Transformer Network with Laplacian Loss for Low-Altitude UAV Imagery Semantic Segmentation
aut.relation.endpage | 1 | |
aut.relation.issue | 99 | |
aut.relation.journal | IEEE Transactions on Geoscience and Remote Sensing | |
aut.relation.startpage | 1 | |
aut.relation.volume | PP | |
dc.contributor.author | Lu, Wen | |
dc.contributor.author | Zhang, Zhiqi | |
dc.contributor.author | Nguyen, Minh | |
dc.date.accessioned | 2024-05-21T03:52:10Z | |
dc.date.available | 2024-05-21T03:52:10Z | |
dc.date.issued | 2024-04-04 | |
dc.description.abstract | Semantic segmentation is crucial for enabling autonomous flight and landing of low-altitude unmanned aerial vehicles (UAVs) and is indispensable for various intelligent applications. However, real-time semantic segmentation is a computationally intensive task because it involves pixel-wise classification, which renders conventional semantic segmentation networks impractical for deployment on embedded systems of limited hardware resources. Moreover, variations in flight height and object appearance increase the likelihood of misjudgment in segmentation results. To address these challenges, we propose an efficient approach consisting of a convolutional neural network (CNN)–Transformer network and an auxiliary loss. The encoder of the network integrates a newly designed module, which equally handles objects with varying scales. The decoder is composed of the innovative query–value squeeze axial transformer attention (QVSATA), which reduces computational complexity from quadratic in terms of image size to O(2C(H2+W2)) , linear in terms of image size. By incorporating Laplacian operator convolution, the novel network-agnostic loss effectively captures intricate patterns, boundaries, and small objects. This enables extra penalization of misjudgments in these areas and compels the network to focus on objects that are challenging to distinguish. Our approach attains impressive accuracy when processing 4K resolution images in real time (15 FPS) on a mobile GPU. It demonstrates over 2× faster speed compared to representative lightweight networks, underscoring its suitability for onboard deployment. | |
dc.identifier.citation | IEEE Transactions on Geoscience and Remote Sensing, ISSN: 0196-2892 (Print); 1558-0644 (Online), Institute of Electrical and Electronics Engineers (IEEE), PP(99), 1-1. doi: 10.1109/tgrs.2024.3385318 | |
dc.identifier.doi | 10.1109/tgrs.2024.3385318 | |
dc.identifier.issn | 0196-2892 | |
dc.identifier.issn | 1558-0644 | |
dc.identifier.uri | http://hdl.handle.net/10292/17573 | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.relation.uri | https://ieeexplore.ieee.org/document/10491345 | |
dc.rights | © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | |
dc.rights.accessrights | OpenAccess | |
dc.subject | 40 Engineering | |
dc.subject | 0404 Geophysics | |
dc.subject | 0906 Electrical and Electronic Engineering | |
dc.subject | 0909 Geomatic Engineering | |
dc.subject | Geological & Geomatics Engineering | |
dc.subject | 37 Earth sciences | |
dc.subject | 40 Engineering | |
dc.title | A Lightweight CNN-Transformer Network with Laplacian Loss for Low-Altitude UAV Imagery Semantic Segmentation | |
dc.type | Journal Article | |
pubs.elements-id | 544428 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- A_Lightweight_CNN-Transformer_Network_with_Laplacian_Loss_for_Low-altitude_UAV_Imagery_Semantic_Segmentation.pdf
- Size:
- 24.52 MB
- Format:
- Adobe Portable Document Format
- Description:
- Journal article