引用本文:伍瑞卓,张兴龙,徐昕,张昌昕.基于高斯过程建模的移动机器人学习预测控制方法[J].控制理论与应用,2023,40(12):2236~2246.[点击复制]
WU Rui-zhuo,ZHANG Xing-long,XU Xin,ZHANG Chang-xin.Learning predictive tracking control method with Gaussian process modeling for mobile robots[J].Control Theory and Technology,2023,40(12):2236~2246.[点击复制]
基于高斯过程建模的移动机器人学习预测控制方法
Learning predictive tracking control method with Gaussian process modeling for mobile robots
摘要点击 901  全文点击 294  投稿时间:2023-04-20  修订日期:2023-12-11
查看全文  查看/发表评论  下载PDF阅读器
DOI编号  10.7641/CTA.2023.30250
  2023,40(12):2236-2246
中文关键词  高斯过程  学习预测控制  滚动时域强化学习  环境和模型不确定性  无人系统控制技术
英文关键词  Guassian process  learning predictive control  receding horizon reinforcement learning  environment and model uncertainty  control technique of unmanned system
基金项目  国家自然科学基金项目(61825305, 62003361, U21A20518)资助.
作者单位E-mail
伍瑞卓 国防科技大学 zhangxinglong18@nudt.edu.cn 
张兴龙* 国防科技大学 zhangxinglong18@nudt.edu.cn 
徐昕 国防科技大学  
张昌昕 国防科技大学  
中文摘要
      移动机器人在复杂地形条件下面临环境和模型不确定性的挑战, 例如草地、陡坡等环境会对移动机器人的 高精度控制造成影响. 本文提出了一种基于高斯过程建模的移动机器人学习预测控制方法, 能够对环境和模型不确 定性进行实时的建模和预测, 并将该模型用于最优控制策略的学习中, 完成在模型和环境不确定下的机器人运动控 制. 该方法利用高斯过程回归对环境和模型不确定性进行建模, 并结合系统运动学方程得到误差状态模型, 并将该 模型用于滚动时域强化学习中, 通过迭代优化学习最优控制策略. 最后, 针对移动机器人在椭圆和8字形轨迹上的横 向跟踪控制问题, 进行了仿真实验, 并与非线性模型预测控制进行比较. 结果表明, 本文提出的方法能够有效提升 复杂地形条件下控制器的控制性能, 在性能指标上相比未采用高斯过程建模的滚动时域强化学习方法提高20%, 比 非线性模型预测控制方法提高36%, 验证了所提方法的有效性和优越性
英文摘要
      Due to environmental and model uncertainty, mobile robots face significant challenges in tracking control in complex environments. Dynamic environment, such as meadows, and deep slopes, would result in performance degradation. This paper proposes a learning predictive control method with Gaussian process modeling, that can effectively model and predict environmental and model uncertainty, then design optimal control strategies utilizing the uncertainty model. The paper uses Gaussian process regression to model uncertainty and utilize the model to learn the optimal policy in the receding horizon reinforcement learning algorithm, iterating to learn the optimal control strategy. Aiming at the lateral tracking control problem of wheeled robots on elliptical and eight-shaped trajectories, simulation experiments were carried out and compared with nonlinear model predictive control methods. The results indicate that the proposed algorithm effectively enhances the control performance of the controller in complex scenarios, showing a 20% improvement in performance indicators compared to receding horizon reinforcement learning method and a 36% improvement in performance indicators compared to nonlinear model predictive control method. This verifies the effectiveness and superiority of the proposed method.