Quantifying variable contributions to bus operation delays considering causal relationships

Bus services often face operational delays due to dynamic conditions such as traffic congestion, which can propagate through bus routes, affecting overall system performance. Understanding the causes of bus arrival delays is crucial for effective public transport management and control. Moreover, un...

Full description

Bibliographic Details
Published in:Transportation Research Part E: Logistics and Transportation Review
Main Authors: Zhang, Qi, Ma, Zhenliang, Wu, Yuanyuan, Liu, Yang, Qu, Xiaobo
Format: Text
Language:unknown
Published: World Transit Research 2025
Subjects:
Online Access:https://www.worldtransitresearch.info/research/10713
https://doi.org/10.1016/j.tre.2024.103881
_version_ 1833102870758555648
author Zhang, Qi
Ma, Zhenliang
Wu, Yuanyuan
Liu, Yang
Qu, Xiaobo
author_facet Zhang, Qi
Ma, Zhenliang
Wu, Yuanyuan
Liu, Yang
Qu, Xiaobo
author_sort Zhang, Qi
collection Monash University, Institute of Transport Studies: World Transit Research (WTR)
container_start_page 103881
container_title Transportation Research Part E: Logistics and Transportation Review
container_volume 194
description Bus services often face operational delays due to dynamic conditions such as traffic congestion, which can propagate through bus routes, affecting overall system performance. Understanding the causes of bus arrival delays is crucial for effective public transport management and control. Moreover, understanding the contribution of each factor to bus delays not only aids in developing targeted strategies to mitigate delays but is also crucial for effective decision-making and planning. Traditional research primarily focuses on correlation-based analysis, lacking the ability to reveal the underlying causal mechanisms. Additionally, no studies have considered the complex causal relationships between factors when quantifying their contributions to outcomes in public transport. This study aims to analyze the factors causing bus arrival delays from a causal perspective, focusing on quantifying the causal contribution of each factor while considering their causal relationships. Quantifying a factor’s causal contribution poses challenges due to computational complexity and statistical bias from the limited sample size. Using a causal discovery method, this study generates a causal graph for bus arrival delays and employs the causality-based Shapley value to quantify the contribution of each variable. The study further uses the Double Machine Learning (DML) approach to estimate the causal contributions, which provides a consistent and computationally feasible method. A case study was conducted using Google Transit Feed Specification (GTFS) data, focusing on high-frequency bus routes in Stockholm, Sweden. To validate the model, cross-validation was performed by comparing variable importance rankings with traditional models, including Linear Regression (LR) and Structural Equation Modeling (SEM). The comparison shows that results from the causality-based Shapley value significantly differ from those obtained by traditional methods in terms of importance rankings and influence magnitudes. The findings underscore the ...
format Text
genre DML
genre_facet DML
id ftmonashits:oai:www.worldtransitresearch.info:research-11977
institution Open Polar
language unknown
op_collection_id ftmonashits
op_doi https://doi.org/10.1016/j.tre.2024.103881
op_relation https://www.worldtransitresearch.info/research/10713
https://doi.org/10.1016/j.tre.2024.103881
op_rights Permission to publish the abstract has been given by Elsevier, copyright remains with them.
op_source World Transit Research
publishDate 2025
publisher World Transit Research
record_format openpolar
spelling ftmonashits:oai:www.worldtransitresearch.info:research-11977 2025-05-25T13:49:23+00:00 Quantifying variable contributions to bus operation delays considering causal relationships Zhang, Qi Ma, Zhenliang Wu, Yuanyuan Liu, Yang Qu, Xiaobo 2025-01-01T08:00:00Z https://www.worldtransitresearch.info/research/10713 https://doi.org/10.1016/j.tre.2024.103881 unknown World Transit Research https://www.worldtransitresearch.info/research/10713 https://doi.org/10.1016/j.tre.2024.103881 Permission to publish the abstract has been given by Elsevier, copyright remains with them. World Transit Research Explainable AI Causal graph discovery Shapley value Urban transit GTFS data mode - bus place - europe place - urban operations - performance text 2025 ftmonashits https://doi.org/10.1016/j.tre.2024.103881 2025-04-28T23:47:07Z Bus services often face operational delays due to dynamic conditions such as traffic congestion, which can propagate through bus routes, affecting overall system performance. Understanding the causes of bus arrival delays is crucial for effective public transport management and control. Moreover, understanding the contribution of each factor to bus delays not only aids in developing targeted strategies to mitigate delays but is also crucial for effective decision-making and planning. Traditional research primarily focuses on correlation-based analysis, lacking the ability to reveal the underlying causal mechanisms. Additionally, no studies have considered the complex causal relationships between factors when quantifying their contributions to outcomes in public transport. This study aims to analyze the factors causing bus arrival delays from a causal perspective, focusing on quantifying the causal contribution of each factor while considering their causal relationships. Quantifying a factor’s causal contribution poses challenges due to computational complexity and statistical bias from the limited sample size. Using a causal discovery method, this study generates a causal graph for bus arrival delays and employs the causality-based Shapley value to quantify the contribution of each variable. The study further uses the Double Machine Learning (DML) approach to estimate the causal contributions, which provides a consistent and computationally feasible method. A case study was conducted using Google Transit Feed Specification (GTFS) data, focusing on high-frequency bus routes in Stockholm, Sweden. To validate the model, cross-validation was performed by comparing variable importance rankings with traditional models, including Linear Regression (LR) and Structural Equation Modeling (SEM). The comparison shows that results from the causality-based Shapley value significantly differ from those obtained by traditional methods in terms of importance rankings and influence magnitudes. The findings underscore the ... Text DML Monash University, Institute of Transport Studies: World Transit Research (WTR) Transportation Research Part E: Logistics and Transportation Review 194 103881
spellingShingle Explainable AI
Causal graph discovery
Shapley value
Urban transit
GTFS data
mode - bus
place - europe
place - urban
operations - performance
Zhang, Qi
Ma, Zhenliang
Wu, Yuanyuan
Liu, Yang
Qu, Xiaobo
Quantifying variable contributions to bus operation delays considering causal relationships
title Quantifying variable contributions to bus operation delays considering causal relationships
title_full Quantifying variable contributions to bus operation delays considering causal relationships
title_fullStr Quantifying variable contributions to bus operation delays considering causal relationships
title_full_unstemmed Quantifying variable contributions to bus operation delays considering causal relationships
title_short Quantifying variable contributions to bus operation delays considering causal relationships
title_sort quantifying variable contributions to bus operation delays considering causal relationships
topic Explainable AI
Causal graph discovery
Shapley value
Urban transit
GTFS data
mode - bus
place - europe
place - urban
operations - performance
topic_facet Explainable AI
Causal graph discovery
Shapley value
Urban transit
GTFS data
mode - bus
place - europe
place - urban
operations - performance
url https://www.worldtransitresearch.info/research/10713
https://doi.org/10.1016/j.tre.2024.103881