Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning
Authors: Jiawei Wang, Lijun Sun
IJCAI 2021 | Conference PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | We evaluate the proposed framework on real-world bus services and actual passenger demand derived from smart card data. Our results show that the proposed model outperforms both traditional headway-based control methods and existing MARL methods. |
| Researcher Affiliation | Academia | Jiawei Wang and Lijun Sun McGill University, Montreal, Canada jiawei.wang4@mail.mcgill.ca, lijun.sun@mcgill.ca |
| Pseudocode | No | The paper does not contain a clearly labeled pseudocode or algorithm block. |
| Open Source Code | No | The paper does not contain an explicit statement or link indicating that the source code for the described methodology is publicly available. |
| Open Datasets | No | The paper mentions using 'real-world data' and 'smart card data' from 'Four bus routes (R1-R4) in an anonymous city', but does not provide any specific link, DOI, or citation for public access to this dataset. |
| Dataset Splits | No | The paper mentions training on R1 and testing on R1, R2, R3, R4 for transferability, but does not provide specific percentages or counts for train/validation/test splits within any of the bus routes' data. It states 'training the model on R1 for 250 episodes' but no data splits. |
| Hardware Specification | No | The paper states: 'All models are implemented with python and Py Torch 1.7.0 on Ubuntu 18.04 LTS, and experiments are conducted on a server with 256GB RAM.' This mentions RAM but not specific CPU or GPU models. |
| Software Dependencies | Yes | All models are implemented with python and Py Torch 1.7.0 on Ubuntu 18.04 LTS |
| Experiment Setup | Yes | We set hyper-parameter w = 0.2 in reward function Eq. (2) to place priority on system stability. We start our experiment by training the model on R1 for 250 episodes. In this simulation, the alighting and boarding times per passenger are set to ta = 1.8 s/pax and tb = 3.0 s/pax, respectively. To simulate the uncertainty of road conditions, buses are given a random speed v U(0.6, 1.2) km/h when travelling between every two consecutive stops, where v is set to 30 km/h and U denotes a continuous uniform distribution. The capacity of the bus is set to 120 pax. |