• 中国科技核心期刊
  • Scopus收录期刊
  • DOAJ收录期刊
  • JST收录期刊
  • Euro Pub收录期刊
Turn off MathJax
Article Contents
ZHAO Yan, LI Jinxin, JIA Rujian. MFLM-FPN and GAFF-driven Underwater Target Detection Algorithms and Class Balancing Strategies[J]. Journal of Unmanned Undersea Systems. doi: 10.11993/j.issn.2096-3920.2026-0007
Citation: ZHAO Yan, LI Jinxin, JIA Rujian. MFLM-FPN and GAFF-driven Underwater Target Detection Algorithms and Class Balancing Strategies[J]. Journal of Unmanned Undersea Systems. doi: 10.11993/j.issn.2096-3920.2026-0007

MFLM-FPN and GAFF-driven Underwater Target Detection Algorithms and Class Balancing Strategies

doi: 10.11993/j.issn.2096-3920.2026-0007
  • Received Date: 2026-01-08
  • Accepted Date: 2026-03-04
  • Rev Recd Date: 2026-02-08
  • Available Online: 2026-05-19
  • To address the problem of scarce feature information for underwater targets, this paper proposes a feature pyramid mapping mechanism combined with global attention. This mechanism maps each proposal box to different feature layers, resulting in four feature layers of consistent size and complementary information after region-of-interest pooling. Global attention is then used to achieve feature fusion, fully utilizing the feature information from each layer and effectively alleviating the problem of feature scarcity for underwater targets. To address the class imbalance problem in underwater datasets, a copy-paste class balancing strategy is designed to enhance the neural network's attention to scarce categories such as sea cucumbers, starfish, and scallops. To address the issue of insufficient penalty in the loss function leading to decreased detection accuracy, the normalized distance between the predicted and target boxes is introduced as a penalty term in the smoothed L1 loss function, significantly improving the localization accuracy of underwater multi-scale targets. Experimental results show that on the National Underwater Robotics Competition dataset, the proposed method achieves a recognition accuracy of 81.93%, a 5.71% improvement over the baseline model Faster R-CNN, effectively reducing false negatives and false positives in complex underwater environments.

     

  • loading
  • [1]
    魏楠, 杨万扣, 周伟杰, 等. 基于小波变换特征增强的水下目标检测方法[J]. 水下无人系统学报, 2025, 33(2): 204-211.

    Wei N, Yang W K, Zhou W J, et al. Underwater object detection method with enhanced wavelet transform features[J]. Journal of Unmanned Undersea Systems, 2025, 33(2): 204-211.
    [2]
    焦文沛, 李杰, 张春燕, 等. 声呐图像智能感知算法综述[J]. 水下无人系统学报, 2025, 33(3): 559-572.

    Jiao W P, Li J, Zhang C Y, et al. Intelligent perception algorithms for sonar images: A review[J]. Journal of Unmanned Undersea Systems, 2025, 33(3): 559-572.
    [3]
    Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2014: 580-587.
    [4]
    Girshick R. Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2015: 1440-1448.
    [5]
    Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. Advances in neural information processing systems, 2015, 28: 1440-1448. doi: 10.1109/tpami.2016.2577031
    [6]
    Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector[C]//European Conference on Computer Vision, 2016: 21-37.
    [7]
    Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2016: 779-788.
    [8]
    张贺民, 王欣宇, 温显斌, 等. REL-YOLO: 融合边缘增强与多尺度注意力的水下目标检测方法[J/OL]. 光电子·激光. [2026-01-31]. https://link.cnki.net/urlid/12.1182.o4.20260130.1241.004.
    [9]
    梁秀满, 张腾, 于海峰, 等. 基于改进YOLOv8的水下目标检测算法[J]. 计算机工程与设计, 2025, 46(9): 2599-2607.

    Liang X M, Zhang T, YU H F, et al. Underwater object detection algorithm based on improved YOLOv8[J]. Computer Engineering and Design, 2025, 46(9): 2599-2607.
    [10]
    王若男, 冯春, 赵政钦, 等. 水下低分辨率小目标检测算法分析[J]. 船舶工程, 2026, 48(2): 98-108. doi: 10.13788/j.cnki.cbgc.2026.02.12

    Wang R N, Feng C, Zhao Z Q, et al. Analysis of detection algorithm for underwater low-resolution small targets[J]. Ship Engineering, 2026, 48(2): 98-108. doi: 10.13788/j.cnki.cbgc.2026.02.12
    [11]
    李海龙, 黄孙港, 饶兴昌. 跨尺度特征融合的自适应水下目标检测算法[J]. 电子测量技术, 2025, 48(13): 129-138.

    Li J L, Huang S G, Rao X C. Adaptive cross-scale feature fusion for underwater object detection algorithm[J]. Electronic Measurement Technology, 2025, 48(13): 129-138.
    [12]
    沈学利, 李东峰. 频域重标定与自适应稀疏金字塔水下实时目标检测[J/OL]. 激光与光电子学进展, [2026-01-31]. https://link.cnki.net/urlid/31.1690.TN.20260121.1736.048.
    [13]
    张红瑞, 冯威铭, 杨潞霞, 等. 基于YOLO11改进的水下小目标检测算法CSAF-YOLO[J/OL]. 计算机应用, [2026-01-31]. https://link.cnki.net/urlid/51.1307.TP.20260108.1256.004.
    [14]
    HE K, Gkioxari G, Dollár P, et al. Mask R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017: 2961-2969.
    [15]
    HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770-778.
    [16]
    Wang X, Girshick R, Gupta A, et al. Non-local neural networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 7794-7803.
    [17]
    Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 7132-7141.
    [18]
    Glorot X, Bordes A, Bengio Y. Deep sparse rectifier neural networks[C]//Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011: 315-323.
    [19]
    Ioffe S, SzegedY C. Batch normalization: Accelerating deep network training by reducing internal covariate shift[C]//International conference on machine learning, 2015: 448-456.
    [20]
    Bochkovskiy A, Wang C Y, Liao H Y M. YOLOv4: Optimal speed and accuracy of object detection[PP/OL]. V1. arXiv (2020-04-23)[2026-02-07]. https://doi.org/10.48550/arXiv.2004.10934.
    [21]
    Glenn J. YOLOv5·Github repository[EB/OL]. (2020-06-09)[2021-07-09]. https: //github. com/ultralytics/yolov5.
    [22]
    Zhang S, Wen L, Bian X, et al. Single-shot refinement neural network for object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 4203-4212.
    [23]
    Fan B, Chen W, Cong Y, et al. Dual refinement underwater object detection network[C]//European Conference on Computer Vision, 2020: 275-291.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(8)  / Tables(7)

    Article Metrics

    Article Views(37) PDF Downloads(21) Cited by()
    Proportional views
    Related
    Service
    Subscribe

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return