Multi-modal dataset and fusion network for simultaneous semantic segmentation of on-road dynamic objects
- Authors
- Cho, Jieun; Ha, Jinsu; Song, Hamin; Jang, Sungmoon; Jo, Kichun
- Issue Date
- Mar-2025
- Publisher
- Elsevier Ltd
- Keywords
- Autonomous driving; Deep learning; Perception; Semantic segmentation; Sensor fusion
- Citation
- Engineering Applications of Artificial Intelligence, v.143, pp 1 - 11
- Pages
- 11
- Indexed
- SCIE
SCOPUS
- Journal Title
- Engineering Applications of Artificial Intelligence
- Volume
- 143
- Start Page
- 1
- End Page
- 11
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212176
- DOI
- 10.1016/j.engappai.2025.110024
- ISSN
- 0952-1976
1873-6769
- Abstract
- An accurate and robust perception system is essential for autonomous vehicles to interact with various dynamic objects on the road. By applying semantic segmentation techniques to the data from the camera sensor and light detection and ranging sensor, dynamic objects can be classified at pixel and point levels respectively. However, there are challenges when using a single sensor, especially under adverse lighting conditions or with sparse point densities. To address these challenges, this paper proposes a network for simultaneous point cloud and image semantic segmentation based on sensor fusion. The proposed network adopts a modal-specific architecture to fully leverage the characteristics of sensor data and achieves geometrically accurate matching through the image, point, and voxel feature fusion module. Additionally, we introduce the dataset that provides semantic labels for synchronized images and point clouds. Experimental results show that the proposed fusion approach outperforms uni-modal based methods and demonstrates robust performance even in challenging real-world scenarios. The dataset is publicly available at https://github.com/ailab-konkuk/Multi-Modal-Dataset.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 미래자동차공학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.