Citation: | ZHANG Shang, YANG Rui, CHEN Zhen, LI Ming. Motion Control Method of Autonomous Surface Vehicle Based on the PILCO Algorithm[J]. Journal of Unmanned Undersea Systems, 2021, 29(5): 541-549. doi: 10.11993/j.issn.2096-3920.2021.05.005 |
[1] |
Park S, Kayacan E, Ratti C, et al. Coordinated Control of a Reconfigurable Multi-vessel Platform: Robust Control Approach[C]//2019 International Conference on Robotics and Automation(ICRA). Montreal, Canada: IEEE, 2019.
|
[2] |
Lu Y, Zhang G, Qiao L, et al. Adaptive Output-feedback Formation Control for Underactuated Surface Vessels[J]. International Journal of Control, 2020, 93(3): 400-409.
|
[3] |
Woo J, Yu C, Kim N. Deep Reinforcement Learning-based Controller for Path Following of an Unmanned Surface Vehicle[J]. Ocean Engineering, 2019, 183: 155-166.
|
[4] |
Paulos J, Eckenstein N, Tosun T, et al. Automated Self-assembly of Large Maritime Structures by a Team of Robotic Boats[J]. IEEE Transactions on Automation Science and Engineering, 2015, 12(3): 958-968.
|
[5] |
Wang W, Mateos L A, Park S, et al. Design, Modeling, and Nonlinear Model Predictive Tracking Control of a Novel Autonomous Surface Vehicle[C]//2018 IEEE International Conference on Robotics and Automation(ICRA). Brisbane, Australia: IEEE, 2018: 6189-6196.
|
[6] |
Mnih V, Kavukcuoglu K, Silver D, et al. Playing Atari with Deep Reinforcement Learning[J]. arXiv, (2013-12-19) [2021-09-01]. https://arxiv.org/abs/1312.5602.
|
[7] |
Deisenroth M, Rasmussen C E. PILCO: A Model-based and Data-efficient Approach to Policy Search[C]// Proceedings of the 28th International Conference on Machine Learning(ICML-11). Bellevue, Washington, USA: ICML, 2011: 465-472.
|
[8] |
Ramirez W A, Leong Z Q, Nguyen H D, et al. Exploration of the Applicability of Probabilistic Inference for Learning Control in Underactuated Autonomous Underwater Vehicles[J]. Autonomous Robots, 2020, 44(6): 1121-1134.
|
[9] |
郭宪. 深入浅出强化学习: 原理入门[M]. 北京: 电子工业出版社, 2018.
|
[10] |
Fossen T I. Guidance and Control of Ocean Vehicles[M]. New Jersey: John Wiley & Sons, 1994.
|
[11] |
陈虹, 刘志远, 解小华. 非线性模型预测控制的现状与问题[J]. 控制与决策, 2001, 16(4): 385-391.
Chen Hong, Liu Zhi-yuan, Xie Xiao-hua. Nonlinear Model Predictive Control: The State and Open Problems[J]. Control and Decision, 2001, 16(4): 385-391.
|
[12] |
Lillicrap T P, Hunt J J, Pritzel A, et al. Continuous Control with Deep Reinforcement Learning[EB/OL]. ArXiv, (2015 -09-01) [2021-09-01]. https://www.researchgate.net/publ- ication/281670459_Continuous_control_with_deep_rein- forcement_learning.
|