TY - JOUR PY - 2014// TI - Variations on a theme: topic modeling of naturalistic driving data JO - Proceedings of the Human Factors and Ergonomic Society annual meeting A1 - McLaurin, Elease A1 - McDonald, Anthony D. A1 - Lee, John D. A1 - Aksan, Nazan A1 - Dawson, Jeffrey A1 - Tippin, Jon A1 - Rizzo, Matthew SP - 2107 EP - 2111 VL - 58 IS - 1 N2 - This paper introduces Probabilistic Topic Modeling (PTM) as a promising approach to naturalistic driving data analyses. Naturalistic driving data present an unprecedented opportunity to understand driver behavior. Novel strategies are needed to achieve a more complete picture of these datasets than is provided by the local event-based analytic strategy that currently dominates the field. PTM is a text analysis method for uncovering word-based themes across documents. In this application, documents were represented by drives and words were created from speed and acceleration data using Symbolic Aggregate approximation (SAX). A twenty-topic Latent Dirichlet Allocation (LDA) topic model was developed using words from 10,705 documents (real-world drives) by 26 drivers. The resulting LDA model clustered the drives into meaningful topics. Topic membership probabilities were successfully used as features in subsequent analyses to differentiate between healthy drivers and those suffering from Obstructive Sleep Apnea.
Language: en
LA - en SN - 2169-5067 UR - http://dx.doi.org/10.1177/1541931214581443 ID - ref1 ER -