引用本文:屈秋霞,罗艳红,张化光.针对时变轨迹的非线性仿射系统的鲁棒近似最优跟踪控制[J].控制理论与应用,2016,33(1):77~84.[点击复制]
QU Qiu-xia,LUO Yan-hong,ZHANG Hua-guang.Robust approximate optimal tracking control of time-varying trajectory for nonlinear affine systems[J].Control Theory and Technology,2016,33(1):77~84.[点击复制]
针对时变轨迹的非线性仿射系统的鲁棒近似最优跟踪控制
Robust approximate optimal tracking control of time-varying trajectory for nonlinear affine systems
摘要点击 3285  全文点击 1286  投稿时间:2014-10-15  修订日期:2015-06-08
查看全文  查看/发表评论  下载PDF阅读器
DOI编号  10.7641/CTA.2016.40963
  2016,33(1):77-84
中文关键词  非线性仿射系统  时变轨迹  最优控制  跟踪问题  渐近稳定
英文关键词  nonlinear affine systems  time-varying trajectory  optimal control  tracking problem  asymptotic stability
基金项目  国家自然科学基金项目(61273029, 61273027), 辽宁省自然科学基金(2013020037), 高等学校博士学科点专项科研基金(20110042120032), 中央高 校基本科研基金项目(N130504004, N140404004)资助.
作者单位E-mail
屈秋霞 东北大学信息科学与工程学院 quqiuxia2010@163.com 
罗艳红* 东北大学信息科学与工程学院 neuluo@gmail.com 
张化光 东北大学信息科学与工程学院  
中文摘要
      针对非线性连续系统难以跟踪时变轨迹的问题, 本文首先通过系统变换引入新的状态变量从而将非线性 系统的最优跟踪问题转化为一般非线性时不变系统的最优控制问题, 并基于近似动态规划算法(ADP)获得近似最 优值函数与最优控制策略. 为有效地实现该算法, 本文利用评价网与执行网来估计值函数及相应的控制策略, 并且 在线更新二者. 为了消除神经网络近似过程中产生的误差, 本文在设计控制器时增加一个鲁棒项; 并且通过Lyapunov 稳定性定理来证明本文提出的控制策略可保证系统跟踪误差渐近收敛到零, 同时也验证在较小的误差范围内, 该控制策略能够接近于最优控制策略. 最后给出两个时变跟踪轨迹实例来证明该方法的可行性与有效性.
英文摘要
      For continuous time nonlinear systems, it is difficult to track their time-varying trajectory. To deal with this problem, we use a system transformation to introduce a new state variable for converting the optimal tracking problem of nonlinear systems into optimal control problem of general nonlinear time-invariant systems. For this system, we obtain the approximate optimal value function and the approximate optimal control policy based on approximate dynamic programming (ADP). Then, we use the critic network and the actor network to estimate the value function and the corresponding control strategy, and update both of them online. Besides, a robust control term is added to the controller to eliminate the residual errors generated in the process of neural network approximation. By using the Lyapunov stability theorem, we prove that the proposed control strategy can guarantee the tracking error to converge asymptotically to zero, and the control strategy is close to the optimal control strategy when the error is in a small bound. Finally, simulations of two time-varying trajectory tracking examples show the feasibility and effectiveness of the proposed method.