AMAF 휴리스틱을 적용한 삼목게임

이병두

AMAF 휴리스틱을 적용한 삼목게임

원문정보

All-Moves-As-First Heuristic applied to the game of Tic-Tac-Toe

이병두

한국컴퓨터게임학회 한국컴퓨터게임학회논문지 제30권 제2호 2017.06 pp.1-7 KCI 등재

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

Monte-Carlo Tree Search (MCTS) is a best-first search algorithm to evaluate states of the game tree in game playing, and has been successfully applied to various games, especially to the game of Go. Upper Confidence Bounds for Trees (UCT), which is a variant of MCTS, uses the UCB1 formula as selection policy, and balances exploitation and exploration of the states. Rapid Action-Value Estimation (RAVE), which is a All-Moves-As-First (AMAF) heuristic, treats all moves in a simulation as the first move, and therefore updates the statistics of all children of the root node. In this paper, we evaluate the performance of RAVE and UCT playing against each other in the game of Tic-Tac-Toe. The experimental results show that the first player RAVE is much inferior to the second player UCT (13.0±0.7%); on the other hand, the first player UCT is far superior to RAVE (99.9±0.1%).

키워드

저자정보

이병두 Byung-Doo Lee. 세한대학교 체육학부 바둑학과 조교수

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 기관로그인 시 무료 이용이 가능합니다.

4,000원

0개의 논문이 장바구니에 담겼습니다.

earticle