PIDDNet: RGB-Depth Fusion Network for Real-time Semantic Segmentation
- Authors
- Shin, Yunsik; Lee, Chaehyun; Son, Yongho; Kim, YangGon; Park, Junghee; Choi, Jun Won
- Issue Date
- Oct-2023
- Publisher
- IEEE Computer Society
- Keywords
- Deep Learning; RGB-Depth Fusion; Semantic Segmentation
- Citation
- International Conference on ICT Convergence, pp 1049 - 1052
- Pages
- 4
- Indexed
- SCOPUS
- Journal Title
- International Conference on ICT Convergence
- Start Page
- 1049
- End Page
- 1052
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196350
- DOI
- 10.1109/ICTC58733.2023.10393276
- ISSN
- 2162-1233
2162-1241
- Abstract
- For RGB semantic segmentation, a two-branch network was proposed to effectively utilize both local detail information and global contextual information within an RGB image. This architecture combines a shallow spatial path with a deeper context path, resulting in high performance and FPS. Research on RGB-Depth segmentation has shown the performance gain that the depth map could provide complementary information to the RGB model. However, the advantage of fusing RGB and depth map within a two-branch network framework is unclear due to the distinct characteristics of these modalities. To address this, we present a novel fusion RGB-Depth architecture that takes into account the attributes of local context, global context, RGB, and depth map. Through the bidirectional image depth fusion technique, we effectively leverage each of the modalities, achieving a performance of 81.23 mIoU. This marks a gain of 1.27% when compared to the RGB-only model and 0.45% when contrasted with the element-wise feature addition fusion baseline.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.