earticle

논문검색

Using Modified UCT Algorithm Basing on Risk Estimation Methods in Imperfect Information Games

초록

영어

Risk dominance and payoff dominance strategy are two complementary parts of the game theory decision strategy. While payoff dominance is still the basic principle in perfect information, two player games, risk dominance has shown its advantages in imperfect information conditions. In this paper, we first review the related work in the area of estimation methods and the influence of risk factors on computing game equilibrium. Then a new algorithm, UCT-Risk is proposed in this paper, which is a modification of UCT (UCB apply to Trees) algorithm based on risk estimation methods. Finally, we implement the proposed algorithm in SiGuo game, a popular imperfect information game in China. The experimental result of the new algorithm shows it correctness and effectiveness.

목차

Abstract
 1. Introduction
 2. Monte Carlo Method and UCT
 3. Estimating Risks in Imperfect Information Conditions
 4. UCT-Risk Algorithm
 5. Experiments and Performance Evaluation
  5.1. Parameters Set in Experiments
  5.2. Experiments against Other UCT Policies
 6. Conclusions
 Acknowledgements
 References

저자정보

  • Jiajia Zhang Intelligence Computing Research Center Harbin Institute of Technology Shenzhen Graduate School C302, HIT Campus Shenzhen University Town, NanShan District, XiLi, Shenzhen 518055, P. R. China
  • Xuan Wang Intelligence Computing Research Center Harbin Institute of Technology Shenzhen Graduate School C302, HIT Campus Shenzhen University Town, NanShan District, XiLi, Shenzhen 518055, P. R. China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.