Multi-Agent Reinforcement Learning for Autonomous On Demand Vehicles

Boyali A., Hashimoto N., John V., ACARMAN T.

30th IEEE Intelligent Vehicles Symposium (IV), Paris, Fransa, 9 - 12 Haziran 2019, ss.1461-1468

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası:
Doi Numarası: 10.1109/ivs.2019.8813876
Basıldığı Şehir: Paris
Basıldığı Ülke: Fransa
Sayfa Sayıları: ss.1461-1468
Galatasaray Üniversitesi Adresli: Evet

Özet

In this study, we elaborate the procedure of designing a supervisory controller for the Autonomous Transit on Demand Vehicle (ATODV) system. Reinforcement learning is implemented to reduce the mean waiting time of the passengers, and a cost function is introduced to penalize the energy consumption of the electric vehicles. A stochastic simulation environment for an ATODV pilot project is coded in the Python environment to train the autonomous cart decision process as agents with artificial intelligence. Passenger group behavior, get-on and getoff times, destinations are modeled as random variables. A single Deep Q-Learning Network is trained subject to multi-agent settings. The ATODV system's independent decision making for the carts to reduce the passenger's waiting time while constraining the energy consumption and empty vehicle motion is evaluated.