SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

Liu Y, Wang J, Tang S, Zhang J, Wan J. Buildings (Basel) 2023; 13(7): e1831.

Copyright

(Copyright © 2023, MDPI: Multidisciplinary Digital Publications Institute)

DOI

10.3390/buildings13071831

PMID

unavailable

Abstract

Construction accident investigation reports contain critical information, but extracting useful insights from the voluminous Chinese text is challenging. Traditional methods rely on expert judgment, which leads to time-consuming and potentially inaccurate results. To overcome this problem, we propose a novel approach that combines text mining techniques and latent Dirichlet allocation (LDA) models to analyze standardized accident investigation reports in the Chinese construction industry. The proposed method integrates an information entropy term frequency-inverse document frequency (TF-IDF) weighting scheme to evaluate term importance and accounts for word and model uncertainty. The method was applied to a set of construction industry accident reports to identify the key factors leading to safety accidents. The results show that the causal factors of accidents in Chinese accident investigation reports consist of keywords and negative expressions, including "failure to timely identify safety hazards" and "inadequate site safety management". Failure to timely identify safety hazards is the most common factor in accident investigation reports, and the negative expressions commonly used in the reports include "not timely" and "not in place". The information entropy TF-IDF method is superior to traditional methods in terms of accuracy and efficiency, and the LDA model that considers word frequency and feature weights is better able to capture the underlying themes in the Chinese corpus. And the subject terms that make up the themes contain more information about the causes of accidents. This approach helps site managers more quickly and effectively understand the causal factors and key messages that lead to accidents from incident reports. It gives site managers insight into common patterns and themes associated with safety incidents, such as unsafe practices, hazardous work environments, and non-compliance with safety regulations. This enables them to make informed decisions to improve safety management practices.


Language: en

Keywords

accident investigation report; construction industry; information entropy TF-IDF; latent Dirichlet allocation (LDA); text mining

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print