Development of New Performance Measures Based on Data Mining Weights for Hotspot Identification
- Authors
- Son, Seung-oh; Park, Juneyoung; Lee, Gunwoo; Abdel-Aty, Mohamed
- Issue Date
- Aug-2022
- Publisher
- US National Research Council
- Keywords
- data and data science; safety; calibration; crash analysis; data mining; network screening
- Citation
- Transportation Research Record, v.2676, no.8, pp 1 - 15
- Pages
- 15
- Indexed
- SCIE
SCOPUS
- Journal Title
- Transportation Research Record
- Volume
- 2676
- Number
- 8
- Start Page
- 1
- End Page
- 15
- URI
- https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/107908
- DOI
- 10.1177/03611981221084682
- ISSN
- 0361-1981
2169-4052
- Abstract
- In this study, new performance measures are proposed for hotspot identification in urban intersections. These measures reflect severity factor weights, which are determined based on data mining. To estimate the severity factor weights of crashes at urban intersections, the study utilizes tree-based random forest (RF) and extreme gradient boosting (XGB) methods. The importance of variables in the severity classification model is standardized and utilized for calculating the score of each crash, which is aggregated into intersections. The aggregated score is used as a dependent variable for the safety performance functions (SPFs) in the network screening process. To illustrate the under-dispersed severity score aggregation data, SPFs that follow the COM-Poisson distribution as well as the negative binomial (NB) are developed. Independent variables in SPFs set up intersection geometry elements that can be collected from online GIS services. Four additional performance measures are proposed, each reflecting a severity weight. Data about a total of 42,513 intersection crashes from 2017 to 2018 in South Korea were collected for crash injury severity analysis. Hotspot identification was performed on 81 intersections, and three consistency tests were conducted to validate the four measures. Tests show that the RF-based weighted N-pred,(RF) and N-EPDO(,)RF have the best consistency. Since the severity factor weights of each crash are reflected, intersections vulnerable to dangerous crashes can be analyzed in more detail. Using this method, it is expected that effective safety improvement project plans can be established with the input of safety managers in the future.
- Files in This Item
-
Go to Link
- Appears in
Collections - COLLEGE OF ENGINEERING SCIENCES > DEPARTMENT OF TRANSPORTATION AND LOGISTICS ENGINEERING > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.