模糊操作条件概率自动机仿生自主学习系统和机器人自平衡控制

阮晓钢; 蔡建羡

引用本文:	阮晓钢,蔡建羡.模糊操作条件概率自动机仿生自主学习系统和机器人自平衡控制[J].控制理论与应用,2010,27(7):960~964.[点击复制]
	RUAN Xiao-gang,CAI Jian-xian.Fuzzy operant conditioning probabilistic automaton bionic autonomous learning system and robot self-balancing control[J].Control Theory and Technology,2010,27(7):960~964.[点击复制]

模糊操作条件概率自动机仿生自主学习系统和机器人自平衡控制

Fuzzy operant conditioning probabilistic automaton bionic autonomous learning system and robot self-balancing control

摘要点击 1713 全文点击 1284 投稿时间：2009-04-11 修订日期：2009-08-10

查看全文查看/发表评论下载PDF阅读器

DOI编号

2010,27(7):960-964

中文关键词操作条件反射概率自动机模糊推理仿生自主学习系统熵自平衡控制

英文关键词 operant conditioning probabilistic automaton fuzzy inference bionic autonomous learning system entropy self-balancing control

基金项目国家自然科学基金资助项目(60774077); 国家“863”计划资助项目(2007AA04Z226); 北京市教委重点资助项目(KZ200810005002).

作者	单位	E-mail
阮晓钢	北京工业大学电子信息与控制学院
蔡建羡^*	北京工业大学电子信息与控制学院	cjxlaq@163.com

中文摘要

为了实现两轮机器人的自平衡控制, 利用Skinner操作条件反射机理, 以概率自动机为平台, 融入模糊推理, 构造了模糊操作条件概率自动机(OCPA)仿生自主学习系统. 该学习系统是一个从状态集合到操作行为集合的随机映射, 采用操作条件反射学习机制, 从操作行为集合中随机学习作为控制系统控制信号的最优行为, 并利用学习到的操作行为取向值信息, 调整操作条件反射学习算法. 此外, 学习系统还引入行为熵, 以验证其自学习和自组织能力. 应用于两轮机器人自平衡控制的仿真结果, 验证了模糊OCPA学习系统的可行性.

英文摘要

A fuzzy operant conditioning probabilistic automaton(OCPA) bionic autonomous learning system is constructed based on Skinner operant conditioning theory and combined with the probabilistic automaton and fuzzy inference for realizing a two-wheeled robot self-balancing control. The learning system is a stochastic mapping from state sets to operant action sets. The optimal action for controlling the system is stochastically learned from the operant action set by adopting operant conditioning learning algorithm; in the same time the orientation value information of the learned operant action is used to adjust the operant conditioning learning algorithm. In addition, the action entropy is added to verify the self-learning and self-organizing ability of the learning system. In the simulation, a two-wheeled robot self-balancing control is realized, demonstrating the feasibility of the fuzzy OCPA learning system.