Abstract:
Coverage path planning for unmanned underwater vehicles (UUV) in unknown aquatic environments is a critical task, with wide applications in marine resource exploration, environmental monitoring, and underwater reconnaissance. However, due to environmental uncertainties, motion constraints, and energy limitations, traditional path planning methods struggle to adapt to complex scenarios. This paper proposes an adaptive multi-objective optimization-based UUV coverage path planning method, integrating Proximal Policy Optimization (PPO) with a dynamic weight adjustment mechanism. By analyzing the correlation between reward objectives and employing linear regression estimation, the proposed approach adaptively adjusts the weights of different optimization objectives, enabling UUVs to autonomously plan efficient coverage paths in environments with unknown obstacles and ocean currents. To validate the effectiveness of the proposed method, a two-dimensional simulation environment incorporating a simplified planar motion model based on a six-degree-of-freedom rigid-body motion framework and a sonar detection model was developed. Comparative experiments were conducted under various obstacle distributions and random ocean currents. Experimental results demonstrate that, compared with traditional methods, the proposed approach improves coverage while optimizing mission completion rate, trajectory length, energy consumption, and information latency. Specifically, it increases coverage by 4.03%, enhances mission completion rate by 10%, improves utility by 10.96%, reduces mission completion time by 14.13%, shortens trajectory length by 26.85%, lowers energy consumption by 10.3%, and decreases information latency by 19.34%. These results indicate that the proposed method significantly enhances the adaptability and robustness of UUVs in complex environments, providing a novel optimization strategy for autonomous underwater exploration tasks.