相似文献/References:
[1]李龙澍 葛瑞峰 王慧萍.基于神经网络的批强化学习在Robocup中的应用[J].计算机技术与发展,2009,(07):98.
LI Long-shu,GE Rui-feng,WANG Hui-ping.Application of Batch Reinforcement Learning Based on NN to Robocup[J].,2009,(01):98.
[2]于东超 耿祥义 刘泮青.5vs5仿真机器人足球比赛——防守算法研究[J].计算机技术与发展,2008,(02):59.
YU Dong-chao,GENG Xiang-yi,LIU Pan-qing.5vs5 Simulation Robot Soccer Competition: Defence Algorithm Research[J].,2008,(01):59.
[3]周勇 刘锋.基于改进的Q学习的RoboCup传球策略研究[J].计算机技术与发展,2008,(04):63.
ZHOU Yong,LIU Feng.Research of RoboCup Pass Strategy Based on Improved Q- Learning[J].,2008,(01):63.
[4]马勇 李龙澍 李学俊.基于Q学习的Agent智能防守策略研究与应用[J].计算机技术与发展,2008,(12):106.
MA Yong,LI Long-shu,LI Xue-jun.Research and Application about Defensive Strategy Based on Q Learning[J].,2008,(01):106.
[5]朱志强 王建元 王芳.基于Agent的核心计算机操作机制研究[J].计算机技术与发展,2007,(07):8.
ZHU Zhi-qiang,WANG Jian-yuan,WANG Fang.Agent - Based Research of Operational Mechanism for Core Computer[J].,2007,(01):8.
[6]刘丹 谢益武.面向智能体的信息系统开发方法研究[J].计算机技术与发展,2006,(03):101.
LIU Dan,XIE Yi-wu.Research on Development Methods in Agent-Oriented IS[J].,2006,(01):101.
[7]吴智威 刘东峰 程昱 孙粤辉.基于智能体方法的人群疏散三维仿真[J].计算机技术与发展,2012,(11):108.
WU Zhi-wei,LIU Dong-feng,CHENG Yu,et al.Three-dimensional Crowd Simulation of Agent,based Method[J].,2012,(01):108.
[8]李文振,万晓冬,李育岭,等.基于XML的作战仿真想定的研究与实现[J].计算机技术与发展,2013,(06):183.
LI Wen-zhen,WAN Xiao-dong,LI Yu-ling,et al.Research and Implementation of Operation Simulation Scenario Based on XML[J].,2013,(01):183.
[9]赵春,方敏. 基于区域分割的交通仿真死锁处理算法研究[J].计算机技术与发展,2017,27(05):25.
ZHAO Chun,FANG Min. Investigation on Deadlock Resolution Algorithm for Traffic Simulation with Region Segmentation[J].,2017,27(01):25.
[10]雷 莹,许道云.一种合作 Markov 决策系统[J].计算机技术与发展,2020,30(12):8.[doi:10. 3969 / j. issn. 1673-629X. 2020. 12. 002]
LEI Ying,XU Dao-yun.A Cooperation Markov Decision Process System[J].,2020,30(01):8.[doi:10. 3969 / j. issn. 1673-629X. 2020. 12. 002]