• 中国科技核心期刊
  • Scopus收录期刊
  • DOAJ收录期刊
  • JST收录期刊
  • Euro Pub收录期刊

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于MFLM-FPN 与 GAFF 的水下目标检测算法及类别平衡策略

赵岩 李金鑫 贾如建

赵岩, 李金鑫, 贾如建. 基于MFLM-FPN 与 GAFF 的水下目标检测算法及类别平衡策略[J]. 水下无人系统学报, 2026, 34(3): 1-11 doi: 10.11993/j.issn.2096-3920.2026-0007
引用本文: 赵岩, 李金鑫, 贾如建. 基于MFLM-FPN 与 GAFF 的水下目标检测算法及类别平衡策略[J]. 水下无人系统学报, 2026, 34(3): 1-11 doi: 10.11993/j.issn.2096-3920.2026-0007
ZHAO Yan, LI Jinxin, JIA Rujian. MFLM-FPN and GAFF-driven Underwater Target Detection Algorithms and Class Balancing Strategies[J]. Journal of Unmanned Undersea Systems. doi: 10.11993/j.issn.2096-3920.2026-0007
Citation: ZHAO Yan, LI Jinxin, JIA Rujian. MFLM-FPN and GAFF-driven Underwater Target Detection Algorithms and Class Balancing Strategies[J]. Journal of Unmanned Undersea Systems. doi: 10.11993/j.issn.2096-3920.2026-0007

基于MFLM-FPN 与 GAFF 的水下目标检测算法及类别平衡策略

doi: 10.11993/j.issn.2096-3920.2026-0007
详细信息
    作者简介:

    赵岩:赵 岩(1993-), 男, 硕士, 人工智能中级工程师, 主要研究方向为计算机视觉、目标检测、语义分割及无监督缺陷检测

  • 中图分类号: TJ630.34; U674.941

MFLM-FPN and GAFF-driven Underwater Target Detection Algorithms and Class Balancing Strategies

  • 摘要: 针对水下目标特征信息匮乏的问题, 文中提出多特征层映射特征金字塔与全局注意力特征融合机制。该机制将每个建议框分别映射至不同特征层, 经感兴趣区域池化后得到4个尺寸一致、信息互补的特征层, 再通过全局注意力实现特征融合, 可充分利用各层特征信息, 有效缓解水下目标特征稀缺的问题。针对水下数据集类别不平衡问题, 设计复制粘贴类别平衡策略, 提升神经网络对海参、海星、扇贝等稀缺类别的关注程度。针对损失函数惩罚力度不足导致检测精度下降的问题, 在平滑L1损失函数中引入预测框与目标框的归一化距离作为惩罚项, 显著提高水下多尺度目标的定位精度。实验结果表明, 在全国水下机器人大赛数据集上, 所提方法的识别准确率达81.93%, 相较于基线模型Faster R-CNN提升5.71%, 有效改善了水下复杂环境下目标的漏检与误检现象。

     

  • 图  1  特征金字塔结构

    Figure  1.  Feature pyramid structure

    图  2  多特征融合水下目标检测算法

    Figure  2.  Underwater target detection algorithm based on multi-feature fusion

    图  3  GAFF特征融合方案

    Figure  3.  GAFF feature fusion scheme

    图  4  数据增强

    Figure  4.  data enhancement

    图  5  预测框与目标框距离示意图

    Figure  5.  The distance between the predicted and target boxes

    图  6  增强后数据展示

    Figure  6.  Enhanced data presentation

    图  7  检测效果可视化

    Figure  7.  Visualization of detection effect

    图  8  模型可视化对比

    Figure  8.  Visual comparison between models

    表  1  增强前后数据集对比

    Table  1.   Comparison of datasets before and after enhancement

    类别原始数据增强后
    海参5 53716 972
    海胆2 234322 343
    海星6 84118 280
    扇贝6 72018 125
    下载: 导出CSV

    表  2  不同融合方式精度对比

    Table  2.   Accuracy comparison of different fusion methods %

    ResNet50+FPN 相加 拼接 GAFF mAPsmallIoU=
    0.50:0.95
    mAP mediumIoU=
    0.50:0.95
    mAPlargeIoU=
    0.50:0.95
    mAPallIoU=
    0.50
    18.0 35.1 45.6 75.07
    19.0 37.4 47.8 77.54
    18.2 37.0 47.1 77.12
    19.8 37.9 48.6 78.46
    下载: 导出CSV

    表  3  消融实验

    Table  3.   Ablation experiment %

    算法及模型海胆海参扇贝海星精确率召回率mAPall
    (IoU=0.50)
    Faster R-CNN86.2364.0769.1280.8678.273.575.07
    188.5065.8071.3081.4080.175.876.75
    290.7067.1372.5883.4382.377.578.46
    387.2364.8770.9381.7379.474.876.19
    490.3068.4374.3885.4083.978.679.62
    591.170.7478.1087.7885.280.381.93
    下载: 导出CSV

    表  4  不同检测算法精度对比

    Table  4.   Comparison of accuracy of different detection algorithms %

    算法海胆海参扇贝海星mAParea=all
    (IoU=0.50)
    YOLOv488.6061.1066.8085.1075.40
    YOLOv586.6065.8071.0086.6077.50
    SA-FPN74.1074.2483.6775.9676.99
    RefineDet86.1067.1071.8081.1071.80
    FERNet92.0071.9052.7082.5074.70
    YOLOv11n87.9069.8072.7081.878.05
    DETR88.6071.1075.2080.978.95
    Faster R-CNN86.8364.6769.7281.4676.82
    文中算法89.8077.3776.9083.7381.93
    下载: 导出CSV

    表  5  简单场景下单张水下图像的小目标检测数量

    Table  5.   Small object detection counts on a single underwater image in a simple scene

    算法海胆海参扇贝海星
    真实标签33+2(漏标)02+3(漏标)
    YOLOv4[20]5604
    YOLOv5[21]3301
    SA-FPN4404
    RefineDet[23]4303
    FERNet[24]4403
    Faster R-CNN4404
    文中算法5405
    下载: 导出CSV

    表  6  复杂场景下水下图像的多类别检测总数

    Table  6.   Total detection counts per category on underwater images in complex scenes

    算法海胆海参扇贝海星
    真实标签17+1
    (漏标)
    6271+1
    (漏标)
    YOLOv4164142
    YOLOv5164222
    SA-FPN163241
    RefineDet186262
    FERNet184231
    Faster R-CNN181302
    文中算法184242
    下载: 导出CSV

    表  7  基于TrashCan数据集的泛化性实验

    Table  7.   Generalization experiments based on the TrashCan dataset %

    算法精确率召回率mAParea=all
    (IoU=0.50)
    Faster R-CNN87.174.287.1
    文中算法92.380.193.4
    下载: 导出CSV
  • [1] 魏楠, 杨万扣, 周伟杰, 等. 基于小波变换特征增强的水下目标检测方法[J]. 水下无人系统学报, 2025, 33(2): 204-211.

    Wei N, Yang W K, Zhou W J, et al. Underwater object detection method with enhanced wavelet transform features[J]. Journal of Unmanned Undersea Systems, 2025, 33(2): 204-211.
    [2] 焦文沛, 李杰, 张春燕, 等. 声呐图像智能感知算法综述[J]. 水下无人系统学报, 2025, 33(3): 559-572.

    Jiao W P, Li J, Zhang C Y, et al. Intelligent perception algorithms for sonar images: A review[J]. Journal of Unmanned Undersea Systems, 2025, 33(3): 559-572.
    [3] Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2014: 580-587.
    [4] Girshick R. Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2015: 1440-1448.
    [5] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. Advances in neural information processing systems, 2015, 28: 1440-1448. doi: 10.1109/tpami.2016.2577031
    [6] Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector[C]//European Conference on Computer Vision, 2016: 21-37.
    [7] Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2016: 779-788.
    [8] 张贺民, 王欣宇, 温显斌, 等. REL-YOLO: 融合边缘增强与多尺度注意力的水下目标检测方法[J/OL]. 光电子·激光. [2026-01-31]. https://link.cnki.net/urlid/12.1182.o4.20260130.1241.004.
    [9] 梁秀满, 张腾, 于海峰, 等. 基于改进YOLOv8的水下目标检测算法[J]. 计算机工程与设计, 2025, 46(9): 2599-2607.

    Liang X M, Zhang T, YU H F, et al. Underwater object detection algorithm based on improved YOLOv8[J]. Computer Engineering and Design, 2025, 46(9): 2599-2607.
    [10] 王若男, 冯春, 赵政钦, 等. 水下低分辨率小目标检测算法分析[J]. 船舶工程, 2026, 48(2): 98-108. doi: 10.13788/j.cnki.cbgc.2026.02.12

    Wang R N, Feng C, Zhao Z Q, et al. Analysis of detection algorithm for underwater low-resolution small targets[J]. Ship Engineering, 2026, 48(2): 98-108. doi: 10.13788/j.cnki.cbgc.2026.02.12
    [11] 李海龙, 黄孙港, 饶兴昌. 跨尺度特征融合的自适应水下目标检测算法[J]. 电子测量技术, 2025, 48(13): 129-138.

    Li J L, Huang S G, Rao X C. Adaptive cross-scale feature fusion for underwater object detection algorithm[J]. Electronic Measurement Technology, 2025, 48(13): 129-138.
    [12] 沈学利, 李东峰. 频域重标定与自适应稀疏金字塔水下实时目标检测[J/OL]. 激光与光电子学进展, [2026-01-31]. https://link.cnki.net/urlid/31.1690.TN.20260121.1736.048.
    [13] 张红瑞, 冯威铭, 杨潞霞, 等. 基于YOLO11改进的水下小目标检测算法CSAF-YOLO[J/OL]. 计算机应用, [2026-01-31]. https://link.cnki.net/urlid/51.1307.TP.20260108.1256.004.
    [14] HE K, Gkioxari G, Dollár P, et al. Mask R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017: 2961-2969.
    [15] HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770-778.
    [16] Wang X, Girshick R, Gupta A, et al. Non-local neural networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 7794-7803.
    [17] Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 7132-7141.
    [18] Glorot X, Bordes A, Bengio Y. Deep sparse rectifier neural networks[C]//Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011: 315-323.
    [19] Ioffe S, SzegedY C. Batch normalization: Accelerating deep network training by reducing internal covariate shift[C]//International conference on machine learning, 2015: 448-456.
    [20] Bochkovskiy A, Wang C Y, Liao H Y M. YOLOv4: Optimal speed and accuracy of object detection[PP/OL]. V1. arXiv (2020-04-23)[2026-02-07]. https://doi.org/10.48550/arXiv.2004.10934.
    [21] Glenn J. YOLOv5·Github repository[EB/OL]. (2020-06-09)[2021-07-09]. https: //github. com/ultralytics/yolov5.
    [22] Zhang S, Wen L, Bian X, et al. Single-shot refinement neural network for object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 4203-4212.
    [23] Fan B, Chen W, Cong Y, et al. Dual refinement underwater object detection network[C]//European Conference on Computer Vision, 2020: 275-291.
  • 加载中
计量
  • 文章访问数:  0
  • HTML全文浏览量:  0
  • PDF下载量:  0
  • 被引次数: 0
出版历程
  • 收稿日期:  2026-01-08
  • 修回日期:  2026-02-08
  • 录用日期:  2026-03-04
  • 网络出版日期:  2026-05-19
图(8) / 表(7)

目录

    /

    返回文章
    返回
    服务号
    订阅号