TY - GEN
T1 - AA-RPN
T2 - 31st International Conference on Neural Information Processing, ICONIP 2024
AU - Cheng, Shuishui
AU - Shi, Qingxuan
AU - Lim, Nick Jin Sean
AU - Bifet, Albert
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025/1/1
Y1 - 2025/1/1
N2 - The performance of two-stage remote sensing object detection methods largely depends on the quality of oriented region proposals generated in the first stage. However, most existing methods’ region proposal networks face significant challenges: 1) Manually designed anchor boxes struggle to balance recall and the number of anchors due to variations in scale, aspect ratio, and orientation of targets in remote sensing images. This also necessitates separate anchor box designs for different datasets, causing inconvenience. 2) Horizontal bounding boxes fail to capture the orientation of objects, leading to misclassification of some negative samples as positive in label assignment, thereby impairing training. Additionally, static label assignment strategies can cause misalignment between classification and localization tasks. To address these issues, we propose an Adaptive Anchor-based Region Proposal Network (AA-RPN). Without the need for predefined anchor boxes, our approach dynamically generates Adaptive Anchor Boxes (AAB) based on the target’s scale, aspect ratio, and orientation during training. The proposed Dynamic Label Assignment (DLA) strategy assigns labels dynamically based on center priors and network outputs. Additionally, we introduce an Adaptive Feature Pyramid Network (AFPN) to provide specific scale contexts for different targets. Using Oriented R-CNN as a baseline, extensive experiments on public benchmarks demonstrate significant improvements in accuracy and computational efficiency.
AB - The performance of two-stage remote sensing object detection methods largely depends on the quality of oriented region proposals generated in the first stage. However, most existing methods’ region proposal networks face significant challenges: 1) Manually designed anchor boxes struggle to balance recall and the number of anchors due to variations in scale, aspect ratio, and orientation of targets in remote sensing images. This also necessitates separate anchor box designs for different datasets, causing inconvenience. 2) Horizontal bounding boxes fail to capture the orientation of objects, leading to misclassification of some negative samples as positive in label assignment, thereby impairing training. Additionally, static label assignment strategies can cause misalignment between classification and localization tasks. To address these issues, we propose an Adaptive Anchor-based Region Proposal Network (AA-RPN). Without the need for predefined anchor boxes, our approach dynamically generates Adaptive Anchor Boxes (AAB) based on the target’s scale, aspect ratio, and orientation during training. The proposed Dynamic Label Assignment (DLA) strategy assigns labels dynamically based on center priors and network outputs. Additionally, we introduce an Adaptive Feature Pyramid Network (AFPN) to provide specific scale contexts for different targets. Using Oriented R-CNN as a baseline, extensive experiments on public benchmarks demonstrate significant improvements in accuracy and computational efficiency.
KW - Adaptive anchor boxes
KW - Adaptive feature pyramid network
KW - Dynamic label assignment
KW - Remote sensing object detection
UR - https://www.scopus.com/pages/publications/105008663911
U2 - 10.1007/978-981-96-6596-9_10
DO - 10.1007/978-981-96-6596-9_10
M3 - Conference contribution
AN - SCOPUS:105008663911
SN - 9789819665983
T3 - Lecture Notes in Computer Science
SP - 138
EP - 152
BT - Neural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
A2 - Mahmud, Mufti
A2 - Doborjeh, Maryam
A2 - Wong, Kevin
A2 - Leung, Andrew Chi Sing
A2 - Doborjeh, Zohreh
A2 - Tanveer, M.
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 2 December 2024 through 6 December 2024
ER -