SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

Wang Z, Jiao P, Wang J, Huang Q, Li R, Lu H. Int. J. Crashworthiness 2023; 28(6): 839-854.

Copyright

(Copyright © 2023, Informa - Taylor and Francis Group)

DOI

10.1080/13588265.2022.2130624

PMID

unavailable

Abstract

Traffic crashes cause serious traffic delay and have some unobserved heterogeneity in different areas. Using Texas accident data in 2020, this article aims to predict the level of delay caused by crashes (LDC) accurately and efficiently and discuss the difference between metropolitan and non-metropolitan areas. A framework based on Random Forests (RF) and LightGBM (LGBM) is developed to measure the association between LDC and its possible risk factors. At first, the most relevant variables in different areas were recognised through recursive feature elimination based on logistic regression. Then, LDC were forecasted by classifiers after grid search hyper parameters. To resolve data imbalance, two threshold moving methods of maximisation G-mean and F1-score were used. Finally, SHapley Additive explanation was employed to interpret the best model. The results indicate that the improved RF performs better in metropolitan areas and the improved LGBM performs better in non-metropolitan areas. In addition, Highway, spring and sunrise are the main risk factors of higher LDC in the two areas. And excessive wind speed and temperature in metropolitan areas can lead to higher LDC while in non-metropolitan areas it is pressure and apparent temperature.


Language: en

Keywords

feature learning; imbalanced data; machine learning; regional heterogeneity; Road crashes; the level of delay caused by crashes (LDC)

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print