Driving risk assessment using near-crash database through data mining of tree-based model

Jianqiang Wang; Yang Zheng; Xiaofei Li; Chenfei Yu; Kenji Kodaka; Keqiang Li

doi:10.1016/j.aap.2015.07.007

Driving risk assessment using near-crash database through data mining of tree-based model

Accid Anal Prev. 2015 Nov:84:54-64. doi: 10.1016/j.aap.2015.07.007. Epub 2015 Aug 27.

Authors

Jianqiang Wang¹, Yang Zheng¹, Xiaofei Li¹, Chenfei Yu¹, Kenji Kodaka², Keqiang Li³

Affiliations

¹ State Key Laboratory of Automotive Safety and Energy, Tsinghua University, Beijing 10084, China.
² Honda R&D Co. Ltd., Automobile R&D Center, Tochigi 321-3393, Japan.
³ State Key Laboratory of Automotive Safety and Energy, Tsinghua University, Beijing 10084, China. Electronic address: likq@tsinghua.edu.cn.

PMID: 26319604
DOI: 10.1016/j.aap.2015.07.007

Abstract

This paper considers a comprehensive naturalistic driving experiment to collect driving data under potential threats on actual Chinese roads. Using acquired real-world naturalistic driving data, a near-crash database is built, which contains vehicle status, potential crash objects, driving environment and road types, weather condition, and driver information and actions. The aims of this study are summarized into two aspects: (1) to cluster different driving-risk levels involved in near-crashes, and (2) to unveil the factors that greatly influence the driving-risk level. A novel method to quantify the driving-risk level of a near-crash scenario is proposed by clustering the braking process characteristics, namely maximum deceleration, average deceleration, and percentage reduction in vehicle kinetic energy. A classification and regression tree (CART) is employed to unveil the relationship among driving risk, driver/vehicle characteristics, and road environment. The results indicate that the velocity when braking, triggering factors, potential object type, and potential crash type exerted the greatest influence on the driving-risk levels in near-crashes.

Keywords: Classification and regression tree (CART); Driving risk; K-mean cluster; Naturalistic driving study; Near-crash.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Accidents, Traffic / statistics & numerical data*
Automobile Driving / statistics & numerical data*
China
Data Mining
Databases, Factual
Humans
Models, Theoretical
Risk Assessment / methods*