引用本文:阮晓钢,蔡建羡.模糊操作条件概率自动机仿生自主学习系统和机器人自平衡控制[J].控制理论与应用,2010,27(7):960~964.[点击复制]
RUAN Xiao-gang,CAI Jian-xian.Fuzzy operant conditioning probabilistic automaton bionic autonomous learning system and robot self-balancing control[J].Control Theory and Technology,2010,27(7):960~964.[点击复制]
模糊操作条件概率自动机仿生自主学习系统和机器人自平衡控制
Fuzzy operant conditioning probabilistic automaton bionic autonomous learning system and robot self-balancing control
摘要点击 1713  全文点击 1284  投稿时间:2009-04-11  修订日期:2009-08-10
查看全文  查看/发表评论  下载PDF阅读器
DOI编号  
  2010,27(7):960-964
中文关键词  操作条件反射  概率自动机  模糊推理  仿生自主学习系统    自平衡控制
英文关键词  operant conditioning  probabilistic automaton  fuzzy inference  bionic autonomous learning system  entropy  self-balancing control
基金项目  国家自然科学基金资助项目(60774077); 国家“863”计划资助项目(2007AA04Z226); 北京市教委重点资助项目(KZ200810005002).
作者单位E-mail
阮晓钢 北京工业大学 电子信息与控制学院  
蔡建羡* 北京工业大学 电子信息与控制学院 cjxlaq@163.com 
中文摘要
      为了实现两轮机器人的自平衡控制, 利用Skinner操作条件反射机理, 以概率自动机为平台, 融入模糊推理, 构造了模糊操作条件概率自动机(OCPA)仿生自主学习系统. 该学习系统是一个从状态集合到操作行为集合的随机映射, 采用操作条件反射学习机制, 从操作行为集合中随机学习作为控制系统控制信号的最优行为, 并利用学习到的操作行为取向值信息, 调整操作条件反射学习算法. 此外, 学习系统还引入行为熵, 以验证其自学习和自组织能力. 应用于两轮机器人自平衡控制的仿真结果, 验证了模糊OCPA学习系统的可行性.
英文摘要
      A fuzzy operant conditioning probabilistic automaton(OCPA) bionic autonomous learning system is constructed based on Skinner operant conditioning theory and combined with the probabilistic automaton and fuzzy inference for realizing a two-wheeled robot self-balancing control. The learning system is a stochastic mapping from state sets to operant action sets. The optimal action for controlling the system is stochastically learned from the operant action set by adopting operant conditioning learning algorithm; in the same time the orientation value information of the learned operant action is used to adjust the operant conditioning learning algorithm. In addition, the action entropy is added to verify the self-learning and self-organizing ability of the learning system. In the simulation, a two-wheeled robot self-balancing control is realized, demonstrating the feasibility of the fuzzy OCPA learning system.