• 中国科技核心期刊
  • JST收录期刊
  • Scopus收录期刊
  • DOAJ收录期刊
LI Junyi, HE Mingle, LIU Chang, XU Yong. Underwater Visual Multi-Target Tracking Algorithm Integrating Re-parameterization and Attention Mechanism[J]. Journal of Unmanned Undersea Systems, 2025, 33(2): 249-260. doi: 10.11993/j.issn.2096-3920.2025-0012
Citation: LI Junyi, HE Mingle, LIU Chang, XU Yong. Underwater Visual Multi-Target Tracking Algorithm Integrating Re-parameterization and Attention Mechanism[J]. Journal of Unmanned Undersea Systems, 2025, 33(2): 249-260. doi: 10.11993/j.issn.2096-3920.2025-0012

Underwater Visual Multi-Target Tracking Algorithm Integrating Re-parameterization and Attention Mechanism

doi: 10.11993/j.issn.2096-3920.2025-0012
  • Received Date: 2025-01-15
  • Accepted Date: 2025-02-25
  • Rev Recd Date: 2025-02-20
  • Available Online: 2025-03-10
  • The complex underwater environment can severely impact the stability of imaging devices and the quality of captured images, posing significant challenges for visual multi-target tracking in underwater unmanned autonomous systems. To address the difficulties arising from underwater camera jitter and image degradation, this paper proposed an underwater visual multi-target tracking algorithm that integrated re-parameterization and attention mechanisms, specifically tailored for underwater unmanned autonomous systems. First, to tackle the diversity of underwater targets and image degradation, an improved YOLOv8 algorithm based on re-parameterization and attention mechanism(RA-YOLOv8) was proposed. This algorithm effectively enhanced the network’s multi-scale feature extraction capability and improved the detection accuracy of the model by integrating a structurally re-parameterized multi-scale feature extraction convolutional structure(DBB-RFAConv) and the AMSCE-attention mechanism. Then, to address the challenges of real-time target tracking caused by underwater camera jitter, an Inner-PIoUv2-enhanced ByteTrack algorithm(IP2-ByteTrack) was proposed. Inner-PIoUv2 was used as the similarity measure in the matching process of the tracking algorithm, which enhanced the model’s performance in underwater detection and tracking tasks, improving the accuracy of tracking trajectory matching. Finally, based on the RA-YOLOv8 and IP2-ByteTrack algorithms, an underwater visual multi-target tracking algorithm that integrated re-parameterization and attention mechanisms for underwater autonomous systems was proposed. Experimental results show that the proposed algorithm exhibits excellent performance in complex underwater environments and can effectively address the shortcomings of existing methods in underwater multi-target tracking.

     

  • [1]
    徐正兴, 诸云, 吴祎楠. 基于改进Sigma点的无迹卡尔曼滤波水下目标跟踪算法[J]. 无人系统技术, 2023, 6(4): 22-30.
    [2]
    张博宇, 齐滨, 王晋晋, 等. 密集杂波背景下的水下多目标跟踪方法[J]. 导航定位与授时, 2023, 10(5): 31-39.
    [3]
    王学敏, 于洪波, 张翔宇, 等. 基于Hough变换检测前跟踪的水下多目标被动检测方法[J]. 兵工学报, 2023, 44(7): 2114-2121.
    [4]
    郑繁亭, 邢关生. 基于改进DeepSort的行人多目标跟踪算法[J]. 现代电子技术, 2023, 46(5): 40-46.
    [5]
    何水龙, 张靖佳, 张林俊, 等. 基于Transformer改进的YOLOv5+DeepSORT的车辆跟踪算法[J]. 汽车技术, 2024(7): 9-16.
    [6]
    陈辉, 杜双燕, 连峰, 等. Track-MT3: 一种基于Transformer的新型多目标跟踪算法[J]. 雷达学报, 2024, 13 (6): 1202-1219. doi: 10.12000/JR24164
    [7]
    赵海翔, 崔鸿武, 黄桢铭, 等. 基于Bytetrack的多目标跟踪算法在斑马鱼毒性行为识别中的应用[J]. 渔业科学进展, 2024, 45(2): 136-149.
    [8]
    REDMON J. You only look once: Unified, real-time object detection[C]//Computer Vision & Pattern Recognition. Las Vegas, USA: IEEE, 2016.
    [9]
    JIAN M W, LIU X Y, LUO H J, et al. Underwater image processing and analysis: A review[J]. Signal Processing: Image Communication, 2021, 91(1): 116088.
    [10]
    CHEN S, CHEN S, SHAN W, et al. A self-supervised underwater image denoising method based on pseudo-siamese neural network[C]//2023 5th International Conference on Robotics and Computer Vision. Nanjing, China: ICRCV, 2023: 130-134.
    [11]
    ZHANG X, LIU C, YANG D, et al. RFAConv: Innovating spatial attention and standard convolutional operation[EB/OL]. (2023-04-06) [2025-02-12]. https://doi.org/10.48550/arXiv.2304.03198.
    [12]
    DING X, ZHANG X, HAN J, et al. Diverse branch block: Building a convolution as an inception-like unit[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, Tennessee, USA: IEEE, 2021: 10886-10895.
    [13]
    OUYANG D, HE S, ZHANG G, et al. Efficient multi-scale attention module with cross-spatial learning[C]//ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP). Rhodes, Greece: IEEE, 2023: 1-5.
    [14]
    NARAYANAN M. SENetV2: Aggregated dense layer for channelwise and global representations[EB/OL]. (2023-11-17) [2025-02-12]. https://doi.org/10.48550/arXiv.2311.10807.
    [15]
    CAI X, LAI Q, WANG Y, et al. Poly kernel inception network for remote sensing detection[EB/OL]. (2024-03-10)[2025-02-12]. https://doi.org/10.48550/arXiv.2403.06258.
    [16]
    ZHANG Y, SUN P, JIANG Y, et al. Bytetrack: Multi-object tracking by associating every detection box[C]//European conference on computer vision. Tel Aviv, Israel: ECCV, 2022: 1-21.
    [17]
    LIU C, WANG K, LI Q, et al. Powerful-IoU: More straight forward and faster bounding box regression loss with a nonmonotonic focusing mechanism[J]. Neural Networks, 2024, 170(1): 276-284.
    [18]
    ZHANG H, XU C, ZHANG S. Inner-iou: More effective intersection over union loss with auxiliary bounding box[EB/OL]. (2023-11-06) [2025-02-12]. https://doi.org/10.48550/arXiv.2311.02877.
    [19]
    ZHANG X, ZENG H, LIU X, et al. In situ holothurian noncontact counting system: A general framework for holothurian counting[J]. IEEE Access, 2020, 8: 210041-210053. doi: 10.1109/ACCESS.2020.3038643
    [20]
    YANG L, ZHANG R Y, LI L, et al. A simple, parameter-free attention module for convolutional neural networks[C]//International conference on machine learning.Vienna, Austria: PMLR, 2021: 11863-11874.
    [21]
    WOO S, PARK J, LEE J Y, et al. Cbam: Convolutional block attention module[C]//Proceedings of the European conference on computer vision. Munich, Germany: ECCV, 2018: 3-19.
    [22]
    SUN S, REN W, GAO X, et al. Restoring images in adverse weather conditions via histogram transformer[C]//European Conference on Computer Vision. Shanghai, China: ECCV, 2025: 111-129.
    [23]
    WANG Y, LI Y, WANG G, et al. Multi-scale attention network for single image super-resolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE, 2024.
    [24]
    GUO M H, LU C Z, HOU Q, et al. Segnext: Rethinking convolutional attention design for semantic segmentation[J]. Advances in Neural Information Processing Systems, 2022, 35: 1140-1156.
    [25]
    HUANG H, CHEN Z, ZOU Y, et al. Channel prior convolutional attention for medical image segmentation[EB/OL]. (2023-11-06) [2025-02-12]. https://doi.org/10.48550/arXiv.2306.05196
    [26]
    MAGGIOLINO G, AHMAD A, CAO J, et al. Deep oc-sort: Multi-pedestrian tracking by adaptive re-identification[C]//2023 IEEE International Conference on Image Processing(ICIP). KL, Malaysia: IEEE, 2023: 3025-3029.
  • Relative Articles

    [1]HAN Jingqi, NAN Mingxing, ZHANG Peng, CHEN Jiajie, HU Zhengliang. A Sonar Image Target Detection Method with Low False Alarm Rate Based on Self-Trained YOLO11 Model[J]. Journal of Unmanned Undersea Systems, 2025, 33(2): 238-248. doi: 10.11993/j.issn.2096-3920.2024-0165
    [2]HU Qianwei, WANG Daiwei, LI Renjie, YU Xiaofan, KANG Bin, SU Ruoyu. Underwater Visual Object Tracking Method Based on Scene Perception[J]. Journal of Unmanned Undersea Systems, 2025, 33(2): 212-219, 290. doi: 10.11993/j.issn.2096-3920.2025-0007
    [3]GAO Jian, HE Yaozhen, CHEN Yimin, ZHANG Yuanxu, YANG Xubo, LI Yufeng, ZHANG Zhenchi. Review of Visual Control Technology for Undersea Vehicles[J]. Journal of Unmanned Undersea Systems, 2024, 32(2): 282-294. doi: 10.11993/j.issn.2096-3920.2023-0061
    [4]LI Bao-qi, REN Lu-lu, CHEN Fa, QIAN Bin, HUANG Hai-ning. A Method of Erect Rail Barricade Recognition Based on Forward-Looking 3D Sonar[J]. Journal of Unmanned Undersea Systems, 2022, 30(6): 747-753. doi: 10.11993/j.issn.2096-3920.2022-0016
    [5]HAN Ze-kai, ZHU Xing-hua, HAN Xiao-jun, SUN Kai, LIU Xiao-yu. Visual Guidance Algorithm for AUV Recovery Based on CNN Object Tracking[J]. Journal of Unmanned Undersea Systems, 2022, 30(6): 801-808. doi: 10.11993/j.issn.2096-3920.2022-0031
    [6]ZHENG Yi, WANG Ming-zhou. Underwater Bearings-Only Short Time Target Tracking Based on Initial Value Updating[J]. Journal of Unmanned Undersea Systems, 2021, 29(2): 189-195. doi: 10.11993/j.issn.2096-3920.2021.02.009
    [7]ZHANG Wei, PAN Jun, GONG Peng, LI Zi-xuan, GUO Shuai-ke, GAO Sai-bo. Monocular Vision Guided Lamp Array Tracking Method for the UUV Recovery Process[J]. Journal of Unmanned Undersea Systems, 2021, 29(4): 435-441. doi: 10.11993/j.issn.2096-3920.2021.04.010
    [8]LIU Feng, LIANG Xu, MIAO Yi-rani, TU Chao-hua, ZHAO Yan-kai. Parameterized Analysis and Optimization of Undersea Vehicle Resistance[J]. Journal of Unmanned Undersea Systems, 2020, 28(5): 526-531. doi: 10.11993/j.issn.2096-3920.2020.05.008
    [9]SU Jun, LI Ya-an, CHEN Xiao, ZHAO Zhen-yi. Data Association Algorithm for Multi-target Tracking of Underwater Bearings-only Systems with Double Observation Stations[J]. Journal of Unmanned Undersea Systems, 2018, 26(2): 115-121. doi: 10.11993/j.issn.2096-3920.2018.02.003
    [10]Long-xiang, YU Han-jun, SHENG Xue-li, HAN Xiao, HAO Hao-yan, CHEN Yang. Underwater Multi-Target Tracking Based on Collaborative Detection Data Fusion[J]. Journal of Unmanned Undersea Systems, 2018, 26(5): 387-394. doi: 10.11993/j.issn.2096-3920.2018.05.002
    [11]ZHANG Si-yu, HE Xin-yi, ZHANG Chi, ZHU Lin, CHEN Shuang. Present Situation and Prospect of Underwater Multi-Target Tracking Technologies[J]. Journal of Unmanned Undersea Systems, 2018, 26(6): 511-520. doi: 10.11993/j.issn.2096-3920.2018.06.001
    [12]ZHANG Shen-rong, CAI Wei-jun, YANG Chun-wu, CUI Xin-shan. A Parameterization Method for Airfoil Based on Composite Mapping Method and B-Spline[J]. Journal of Unmanned Undersea Systems, 2015, 23(3): 161-165. doi: 10.11993/j.issn.1673-1948.2015.03.001
    [13]GAO Wen-juan, LI Ya-an, CHEN Xiao, CHEN Zhi-guang. Application of IMM to Underwater Maneuver Target Tracking[J]. Journal of Unmanned Undersea Systems, 2015, 23(3): 196-201. doi: 10.11993/j.issn.1673-1948.2015.03.007
    [14]YANG Xiang-feng, YANG Yun-chuan, CHEN Ya-lin. Modeling and Simulation of Target Tracking for Underwater Acoustic Homing Weapon[J]. Journal of Unmanned Undersea Systems, 2013, 21(1): 015-19. doi: 10.11993/j.issn.1673-1948.2013.01.004
    [15]LIU Jian-xin, CHENG Yong-mei, YU Liang, CHEN Ke-zhe. Multi-target Tracking Method of Torpedo in the Environment of Soft Kill Weapons[J]. Journal of Unmanned Undersea Systems, 2013, 21(6): 425-430. doi: 10.11993/j.issn.1673-1948.2013.06.007
    [16]YANG Xiang-feng, YANG Yun-chuan, GUO Lei. A Data Association Method for Torpedo Multiple-Target Tracking[J]. Journal of Unmanned Undersea Systems, 2012, 20(6): 414-418. doi: 10.11993/j.issn.1673-1948.2012.06.004
    [17]SONG Bao-wei, YE Jing, CAO Yong-hui. Parametric Design of Cylinder Shell Based on Spreadsheet[J]. Journal of Unmanned Undersea Systems, 2011, 19(1): 006-9. doi: 10.11993/j.issn.1673-1948.2011.01.002
    [18]HE Hui-jiang, LI Nan. Parametric Three-Dimensional Modeling for Torpedo Contra-Rotating Propeller Based on UG OPEN / API[J]. Journal of Unmanned Undersea Systems, 2011, 19(1): 010-13. doi: 10.11993/j.issn.1673-1948.2011.01.003
    [19]HE Hui-jiang, LI Nan. Parametric Modeling of Torpedo Shell Structure Based on APDL[J]. Journal of Unmanned Undersea Systems, 2010, 18(4): 246-248. doi: 10.11993/j.issn.1673-1948.2010.04.002
    [20]WANG Bai-he, HUANG Jian-guo, ZHANG Qun-fei. Model and Algorithm for Attacked Target Optimization in Underwater Multi-targets Tracing System[J]. Journal of Unmanned Undersea Systems, 2007, 15(4): 022-25. doi: 10.11993/j.issn.1673-1948.2007.04.006
  • Created with Highcharts 5.0.7Amount of accessChart context menuAbstract Views, HTML Views, PDF Downloads StatisticsAbstract ViewsHTML ViewsPDF Downloads2024-072024-082024-092024-102024-112024-122025-012025-022025-032025-042025-052025-06051015202530
    Created with Highcharts 5.0.7Chart context menuAccess Class DistributionFULLTEXT: 37.1 %FULLTEXT: 37.1 %META: 53.1 %META: 53.1 %PDF: 9.8 %PDF: 9.8 %FULLTEXTMETAPDF
    Created with Highcharts 5.0.7Chart context menuAccess Area Distribution其他: 7.7 %其他: 7.7 %其他: 0.7 %其他: 0.7 %Kao-sung: 0.7 %Kao-sung: 0.7 %Morrisville: 0.7 %Morrisville: 0.7 %Naaldwijk: 2.1 %Naaldwijk: 2.1 %Rochester: 0.7 %Rochester: 0.7 %伊斯坦布尔: 0.7 %伊斯坦布尔: 0.7 %加利福尼亚: 1.4 %加利福尼亚: 1.4 %北京: 2.1 %北京: 2.1 %十堰: 1.4 %十堰: 1.4 %合肥: 0.7 %合肥: 0.7 %哈尔滨: 1.4 %哈尔滨: 1.4 %大庆: 1.4 %大庆: 1.4 %天津: 0.7 %天津: 0.7 %威海: 1.4 %威海: 1.4 %宁波: 0.7 %宁波: 0.7 %安卡拉省: 2.1 %安卡拉省: 2.1 %宣城: 0.7 %宣城: 0.7 %山景城: 4.9 %山景城: 4.9 %平顶山: 0.7 %平顶山: 0.7 %广州: 0.7 %广州: 0.7 %弗吉: 0.7 %弗吉: 0.7 %张家口: 4.2 %张家口: 4.2 %德里: 2.8 %德里: 2.8 %扬州: 2.8 %扬州: 2.8 %朝阳: 1.4 %朝阳: 1.4 %杭州: 2.1 %杭州: 2.1 %株洲: 0.7 %株洲: 0.7 %桂林: 2.8 %桂林: 2.8 %武汉: 0.7 %武汉: 0.7 %洛杉矶: 2.8 %洛杉矶: 2.8 %温州: 0.7 %温州: 0.7 %湛江: 1.4 %湛江: 1.4 %漯河: 2.1 %漯河: 2.1 %漳州: 0.7 %漳州: 0.7 %澳门: 1.4 %澳门: 1.4 %石家庄: 2.1 %石家庄: 2.1 %纽约: 0.7 %纽约: 0.7 %芒廷维尤: 25.2 %芒廷维尤: 25.2 %芝加哥: 2.8 %芝加哥: 2.8 %西宁: 2.1 %西宁: 2.1 %西安: 1.4 %西安: 1.4 %运城: 1.4 %运城: 1.4 %邯郸: 0.7 %邯郸: 0.7 %郑州: 0.7 %郑州: 0.7 %重庆: 0.7 %重庆: 0.7 %长沙: 1.4 %长沙: 1.4 %其他其他Kao-sungMorrisvilleNaaldwijkRochester伊斯坦布尔加利福尼亚北京十堰合肥哈尔滨大庆天津威海宁波安卡拉省宣城山景城平顶山广州弗吉张家口德里扬州朝阳杭州株洲桂林武汉洛杉矶温州湛江漯河漳州澳门石家庄纽约芒廷维尤芝加哥西宁西安运城邯郸郑州重庆长沙

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(8)  / Tables(6)

    Article Metrics

    Article Views(76) PDF Downloads(14) Cited by()
    Proportional views
    Related
    Service
    Subscribe

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return