Using contextual data to predict risky driving events: A novel methodology from explainable artificial intelligence
Usage-based insurance has allowed insurers to dynamically tailor insurance premiums by understanding when and how safe policyholders drive. However, telematics information can also be used to understand the driving contexts experienced by the driver within each trip (e.g., road types, weather, traffic). Since different ombinations of these conditions affect exposure to accidents, this understanding introduces predictive opportunities in driving risk assessment. This paper investigates the relationships between driving context combinations and risk using a naturalistic driving dataset of 77,859 km. In particular, XGBoost and Random Forests are used to determine the predictive significance of driving contexts for near-misses, speeding and distraction events. Moreover, the most important contextual factors in predicting these risky events are identified and ranked through Shapley Additive Explanations. The results show that the driving context has significant power in predicting driving risk. Speed limit, weather temperature, wind speed, traffic conditions and road slope appear in the top ten most relevant features for most risky events. Analysing contextual feature variations and their influence on risky events showed that low-speed limits increase the predicted frequency of speeding and phone unlocking events, whereas high-speed limits decrease harsh accelerations. Low temperatures decrease the expected frequency of harsh manoeuvres, and precipitations increase harsh acceleration, harsh braking, and distraction events. Furthermore, road slope, intersections and pavement quality are the most critical factors among road layout attributes. The methodology presented in this study aims to support road safety stakeholders and insurers by providing insights to study the contextual risk factors that influence road accident frequency and driving risk.
History
Publication
Accident Analysis & Prevention, 2023, 184, 106997Publisher
ElsevierOther Funding information
This project was supported by the Fonds National de la Recherche, Luxembourg (Project Code: 14614423) and the Spanish Ministry of Science and Innovation, NextGenerationEU (Project Codes: TED2021-130187B-I00 and PID2019-105986GB-C21.Sustainable development goals
- (3) Good Health and Well-being
- (11) Sustainable Cities and Communities
External identifier
Department or School
- Accounting & Finance