원문정보
초록
영어
Effective warehouse management requires advanced resource planning to optimize profits and space. Robots offer a promising solution, but their effectiveness relies on embedded artificial intelligence. Multi-agent reinforcement learning (MARL) enhances robot intelligence in these environments. This study explores various MARL algorithms using the Multi-Robot Warehouse Environment (RWARE) to determine their suitability for warehouse resource planning. Our findings show that cooperative MARL is essential for effective warehouse management. IA2C outperforms MAA2C and VDA2C on smaller maps, while VDA2C excels on larger maps. IA2C’s decentralized approach, focusing on cooperation over collaboration, allows for higher reward collection in smaller environments. However, as map size increases, reward collection decreases due to the need for extensive exploration. This study highlights the importance of selecting the appropriate MARL algorithm based on the specific warehouse environment's requirements and scale.
목차
1. Introduction
2. Literature Review
2.1 Multi-agent Reinforcement Learning (MARL)
2.2 Cooperative and Competitive Behavior in MARL
2.3 Distinguishing Collaboration and Cooperation Goals in Cooperative Behavior
3. Methodology
3.1 System and Environment
3.2 Independent Synchronous Advantage Actor-Critic
3.3 Multi-Agent Advantage Actor-Critic
3.4 Value Decomposition Advantage Actor-Critic
4. Result and Discussion
5. Conclusion
Acknowledgment
References