PIFNet: 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Driving

Zheng, Wenqi; Xie, Han; Chen, Yunfan; Roh, Jeongjin; Shin, Hyunchul

doi:10.3390/app12073686

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

PIFNet: 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Drivingopen access

Authors: Zheng, Wenqi; Xie, Han; Chen, Yunfan; Roh, Jeongjin; Shin, Hyunchul

Issue Date: Apr-2022

Publisher: MDPI

Keywords: 3D object detection; lidar point cloud; camera images; object detection

Citation: Applied Sciences-basel, v.12, no.7, pp 1 - 11

Pages: 11

Indexed: SCIE
SCOPUS

Journal Title: Applied Sciences-basel

Volume: 12

Number: 7

Start Page: 1

End Page: 11

URI: https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/107904

DOI: 10.3390/app12073686

ISSN: 2076-3417

Abstract: Owing to its wide range of applications, 3D object detection has attracted increasing attention in computer vision tasks. Most existing 3D object detection methods are based on Lidar point cloud data. However, these methods have some limitations in localization consistency and classification confidence, due to the irregularity and sparsity of Light Detection and Ranging (LiDAR) point cloud data. Inspired by the complementary characteristics of Lidar and camera sensors, we propose a new end-to-end learnable framework named Point-Image Fusion Network (PIFNet) to integrate the LiDAR point cloud and camera images. To resolve the problem of inconsistency in the localization and classification, we designed an Encoder-Decoder Fusion (EDF) module to extract the image features effectively, while maintaining the fine-grained localization information of objects. Furthermore, a new effective fusion module is proposed to integrate the color and texture features from images and the depth information from the point cloud. This module can enhance the irregularity and sparsity problem of the point cloud features by capitalizing the fine-grained information from camera images. In PIFNet, each intermediate feature map is fed into the fusion module to be integrated with its corresponding point-wise features. Furthermore, point-wise features are used instead of voxel-wise features to reduce information loss. Extensive experiments using the KITTI dataset demonstrate the superiority of PIFNet over other state-of-the-art methods. Compared with several state-of-the-art methods, our approach outperformed by 1.97% in mean Average Precision (mAP) and by 2.86% in Average Precision (AP) for the hard cases on the KITTI 3D object detection benchmark.

Files in This Item

PIFNet 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Driving.pdf 1.29 MB

Appears in Collections: COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Roh, Jeong jin photo

Roh, Jeong jin: ERICA 공학대학 (SCHOOL OF ELECTRICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE