受控排队系统的平均最优与约束平均最优

张兰兰; 郭先平

引用本文:	张兰兰,郭先平.受控排队系统的平均最优与约束平均最优[J].控制理论与应用,2009,26(2):139~144.[点击复制]
	ZHANG Lan-lan,GUO Xian-ping.Average optimality and constrained average optimality for controlled queuing systems[J].Control Theory and Technology,2009,26(2):139~144.[点击复制]

受控排队系统的平均最优与约束平均最优

Average optimality and constrained average optimality for controlled queuing systems

摘要点击 1554 全文点击 947 投稿时间：2007-07-12 修订日期：2008-05-16

查看全文查看/发表评论下载PDF阅读器

DOI编号

2009,26(2):139-144

中文关键词连续时间马尔可夫决策过程平均准则受控排队系统平均最优平稳策略约束平均最优策略

英文关键词 continuous-time Markov decision processes average criterion controlled queuing systems average optimal stationary policy constrained average optimal policy

基金项目国家自然科学基金资助项目(60874004); 教育部博士点基金资助课题(20050558022).

作者	单位	E-mail
张兰兰	南方医科大学公共卫生与热带医学学院, 广东广州 510515	katiezll@yahoo.com.cn
郭先平	中山大学数学与计算科学学院, 广东广州 510275	mcsgxp@mail.sysu.edu.cn

中文摘要

根据连续时间马尔可夫决策过程的平均准则, 给出了一种特殊的马尔可夫决策过程－受控排队系统平均最优以及约束最优的新条件. 这个新条件仅使用模型的初始数据, 但利用了生灭过程的遍历性理论. 可以证明受控排队系统存在平均最优平稳策略与约束平均最优策略.

英文摘要

For a special Markov decision process based on the continuous-time Markov decision processes with the average criterion, a new set of conditions is proposed for both the optimality and constrained optimality for a controlled queuing system. These conditions only employ the initial data of the controlled system, but make use of the ergodicity of a birth and death process. By using the Lagrange multipliers approach, the existence of an average optimal stationary policy and a constrained average-optimal policy can be confirmed.