Weighted Ensemble Methods for Predicting Train Delays

Al Ghamdi, Mostafa, Parr, Gerard ORCID: https://orcid.org/0000-0002-9365-9132 and Wang, Wenjia (2020) Weighted Ensemble Methods for Predicting Train Delays. In: Computational Science and Its Applications – ICCSA 2020 - 20th International Conference, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) . UNSPECIFIED, 978-3-030-58798-7, pp. 586-600. ISBN 9783030587987

Full text not available from this repository. (Request a copy)


Train delays have become a serious and common problem in the rail services due to the increasing number of passengers and limited rail network capacity, so being able to predict train delays accurately is essential for train controllers to devise appropriate plans to prevent or reduce some delays. This paper presents a machine learning ensemble framework to improve the accuracy and consistency of train delay prediction. The basic idea is to train many different types of machine learning models for each station along a chosen journey of train service using historical data and relevant weather data, and then with certain criteria to choose some models to build an ensemble. It then combines the outputs from its member models with an aggregation function to produce the final prediction. Two aggregation functions were devised to combine the outputs of individual models: averaging and weighted averaging. These ensembles were implemented with a framework and their performance was tested with the data from an intercity train service as a case study. The accuracy was measured by the percentages of correct prediction of the arrival time for a train and correct prediction within one minute to the actual arrival time. The mean accuracies and standard deviations are 42.3%(±11.24) from the individual models, 57.8%(±3.56) from the averaging ensembles, and 72.8%(±0.99) from the weighted ensembles. For the predictions within one minute of the actual times, they are 86.4%(±14.05), 94.6%(±1.34) and 96.0%(±0.47) respectively. So overall, the ensembles significantly improved not only the prediction accuracies but also the consistency and the weighted ensembles are clearly the best.

Item Type: Book Section
Uncontrolled Keywords: ensemble,machine learning,train delays,theoretical computer science,computer science(all) ,/dk/atira/pure/subjectarea/asjc/2600/2614
Faculty \ School: Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Data Science and Statistics
Faculty of Science > Research Groups > Smart Emerging Technologies
Faculty of Science > Research Groups > Cyber Security Privacy and Trust Laboratory
Related URLs:
Depositing User: LivePure Connector
Date Deposited: 04 Dec 2020 00:21
Last Modified: 03 Aug 2023 15:33
URI: https://ueaeprints.uea.ac.uk/id/eprint/77887
DOI: 10.1007/978-3-030-58799-4_43

Actions (login required)

View Item View Item