Multi-layer fusion techniques using a CNN for multispectral pedestrian detection

Chen, Yunfan; Xie, Han; Shin, Hyunchul

doi:10.1049/iet-cvi.2018.5315

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Multi-layer fusion techniques using a CNN for multispectral pedestrian detection

Authors: Chen, Yunfan; Xie, Han; Shin, Hyunchul

Issue Date: Dec-2018

Publisher: INST ENGINEERING TECHNOLOGY-IET

Keywords: pedestrians; feature extraction; image matching; image fusion; object detection; feedforward neural nets; learning (artificial intelligence); multilayer fusion techniques; CNN; multispectral pedestrian detection; multilayer fused convolution neural network; pedestrian detectors; adverse illumination circumstances; shadows; overexposure; nighttime; deep learning; visible information; thermal information; MLF region proposal network; summation fusion method; convolutional layers; adverse illumination; feature extraction; feature maps; scale matching; fused ROI pooling layers; detection miss rate reduction; KAIST multispectral pedestrian dataset

Citation: IET COMPUTER VISION, v.12, no.8, pp.1179 - 1187

Indexed: SCIE
SCOPUS

Journal Title: IET COMPUTER VISION

Volume: 12

Number: 8

Start Page: 1179

End Page: 1187

URI: https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/5067

DOI: 10.1049/iet-cvi.2018.5315

ISSN: 1751-9632

Abstract: In this study, a novel multi-layer fused convolution neural network (MLF-CNN) is proposed for detecting pedestrians under adverse illumination conditions. Currently, most existing pedestrian detectors are very likely to be stuck under adverse illumination circumstances such as shadows, overexposure, or nighttime. To detect pedestrians under such conditions, the authors apply deep learning for effective fusion of the visible and thermal information in multispectral images. The MLF-CNN consists of a proposal generation stage and a detection stage. In the first stage, they design an MLF region proposal network and propose to use summation fusion method for integration of the two convolutional layers. This combination can detect pedestrians in different scales, even in adverse illumination. Furthermore, instead of extracting features from a single layer, they extract features from three feature maps and match the scale using the fused ROI pooling layers. This new multiple-layer fusion technique can significantly reduce the detection miss rate. Extensive evaluations of several challenging datasets well demonstrate that their approach achieves state-of-the-art performance. For example, their method performs 28.62% better than the baseline method and 11.35% better than the well-known faster R-CNN halfway fusion method in detection accuracy on KAIST multispectral pedestrian dataset.

Files in This Item: Go to Link

Appears in Collections: COLLEGE OF ENGINEERING SCIENCES > SCHOOL OF ELECTRICAL ENGINEERING > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE