Interpreting the influential factors in ship detention using a novel random forest algorithm considering dataset imbalance and uncertainty
- Authors
- Xiao, Yi; Jin, Mengjie; Qi, Guanqiu; Shi, Wenming; Li, Kevin X.; Du, Xianping
- Issue Date
- Jul-2024
- Publisher
- Elsevier Ltd
- Keywords
- Data imbalance; Data uncertainty; Port state control; PSC; Prediction; Ship inspection; Uncertain random forest
- Citation
- Engineering Applications of Artificial Intelligence, v.133
- Journal Title
- Engineering Applications of Artificial Intelligence
- Volume
- 133
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/73856
- DOI
- 10.1016/j.engappai.2024.108369
- ISSN
- 0952-1976
1873-6769
- Abstract
- Port State Control inspects foreign ships in national ports to verify that ships' conditions and equipment obey international regulations and that the ships are crewed and operated in accordance with these regulations. Port State Control has proven useful in ensuring a “safer ship and cleaner ocean.” To support the effectiveness and efficiency of inspections, targeted ships should only be considered if they are at a high risk for accidents. The key factors for ship selection have been included in inspection regimes, but their combined effect on causing ship detention is unclear. Meanwhile, certain factors are characterized by data uncertainty that may influence inspection results and even the time window of an inspection. Furthermore, although tens of thousands of inspection data items are produced yearly, the probability of ship detention is around 3%. Therefore, a new uncertain random forest algorithm has been developed to address factor uncertainty and data imbalances. This algorithm generates rules for the relationships between the multi-factors and ship detention with high accuracy and robustness performance. Based on uncertain random forest models, the following three results are presented. First, the optimal data balancing strategy is a detention ratio of 30% rather than 50%, which could better balance inspection accuracy and efficiency. Second, data uncertainty influences the prediction probability of ship detention; as the uncertainty interval range increases, the prediction probability decreases. Third, the uncertain random forest algorithm generates Port State Control's association rules. Thus, this algorithm can help port authorities identify substandard vessels more efficiently. © 2024 Elsevier Ltd
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - ETC > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/73856)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.