Driving risk assessment using near-crash database through data mining of tree-based model

Accid Anal Prev. 2015 Nov:84:54-64. doi: 10.1016/j.aap.2015.07.007. Epub 2015 Aug 27.

Abstract

This paper considers a comprehensive naturalistic driving experiment to collect driving data under potential threats on actual Chinese roads. Using acquired real-world naturalistic driving data, a near-crash database is built, which contains vehicle status, potential crash objects, driving environment and road types, weather condition, and driver information and actions. The aims of this study are summarized into two aspects: (1) to cluster different driving-risk levels involved in near-crashes, and (2) to unveil the factors that greatly influence the driving-risk level. A novel method to quantify the driving-risk level of a near-crash scenario is proposed by clustering the braking process characteristics, namely maximum deceleration, average deceleration, and percentage reduction in vehicle kinetic energy. A classification and regression tree (CART) is employed to unveil the relationship among driving risk, driver/vehicle characteristics, and road environment. The results indicate that the velocity when braking, triggering factors, potential object type, and potential crash type exerted the greatest influence on the driving-risk levels in near-crashes.

Keywords: Classification and regression tree (CART); Driving risk; K-mean cluster; Naturalistic driving study; Near-crash.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Accidents, Traffic / statistics & numerical data*
  • Automobile Driving / statistics & numerical data*
  • China
  • Data Mining
  • Databases, Factual
  • Humans
  • Models, Theoretical
  • Risk Assessment / methods*