Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning

Authors: Jiawei Wang, Lijun Sun

IJCAI 2021 | Conference PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental We evaluate the proposed framework on real-world bus services and actual passenger demand derived from smart card data. Our results show that the proposed model outperforms both traditional headway-based control methods and existing MARL methods.
Researcher Affiliation Academia Jiawei Wang and Lijun Sun McGill University, Montreal, Canada jiawei.wang4@mail.mcgill.ca, lijun.sun@mcgill.ca
Pseudocode No The paper does not contain a clearly labeled pseudocode or algorithm block.
Open Source Code No The paper does not contain an explicit statement or link indicating that the source code for the described methodology is publicly available.
Open Datasets No The paper mentions using 'real-world data' and 'smart card data' from 'Four bus routes (R1-R4) in an anonymous city', but does not provide any specific link, DOI, or citation for public access to this dataset.
Dataset Splits No The paper mentions training on R1 and testing on R1, R2, R3, R4 for transferability, but does not provide specific percentages or counts for train/validation/test splits within any of the bus routes' data. It states 'training the model on R1 for 250 episodes' but no data splits.
Hardware Specification No The paper states: 'All models are implemented with python and Py Torch 1.7.0 on Ubuntu 18.04 LTS, and experiments are conducted on a server with 256GB RAM.' This mentions RAM but not specific CPU or GPU models.
Software Dependencies Yes All models are implemented with python and Py Torch 1.7.0 on Ubuntu 18.04 LTS
Experiment Setup Yes We set hyper-parameter w = 0.2 in reward function Eq. (2) to place priority on system stability. We start our experiment by training the model on R1 for 250 episodes. In this simulation, the alighting and boarding times per passenger are set to ta = 1.8 s/pax and tb = 3.0 s/pax, respectively. To simulate the uncertainty of road conditions, buses are given a random speed v U(0.6, 1.2) km/h when travelling between every two consecutive stops, where v is set to 30 km/h and U denotes a continuous uniform distribution. The capacity of the bus is set to 120 pax.