PIDDNet: RGB-Depth Fusion Network for Real-time Semantic Segmentation

Shin, Yunsik; Lee, Chaehyun; Son, Yongho; Kim, YangGon; Park, Junghee; Choi, Jun Won

doi:10.1109/ICTC58733.2023.10393276

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

PIDDNet: RGB-Depth Fusion Network for Real-time Semantic Segmentation

Authors: Shin, Yunsik; Lee, Chaehyun; Son, Yongho; Kim, YangGon; Park, Junghee; Choi, Jun Won

Issue Date: Oct-2023

Publisher: IEEE Computer Society

Keywords: Deep Learning; RGB-Depth Fusion; Semantic Segmentation

Citation: International Conference on ICT Convergence, pp 1049 - 1052

Pages: 4

Indexed: SCOPUS

Journal Title: International Conference on ICT Convergence

Start Page: 1049

End Page: 1052

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196350

DOI: 10.1109/ICTC58733.2023.10393276

ISSN: 2162-1233
2162-1241

Abstract: For RGB semantic segmentation, a two-branch network was proposed to effectively utilize both local detail information and global contextual information within an RGB image. This architecture combines a shallow spatial path with a deeper context path, resulting in high performance and FPS. Research on RGB-Depth segmentation has shown the performance gain that the depth map could provide complementary information to the RGB model. However, the advantage of fusing RGB and depth map within a two-branch network framework is unclear due to the distinct characteristics of these modalities. To address this, we present a novel fusion RGB-Depth architecture that takes into account the attributes of local context, global context, RGB, and depth map. Through the bidirectional image depth fusion technique, we effectively leverage each of the modalities, achieving a performance of 81.23 mIoU. This marks a gain of 1.27% when compared to the RGB-only model and 0.45% when contrasted with the element-wise feature addition fusion baseline.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE