Dual aggregated feature pyramid network for multi label classification

Yun, Dongjoo; Ryu, Jongbin; Lim, Jongwoo

doi:10.1016/j.patrec.2021.01.013

Detailed Information

Cited 3 time in webofscience

Cited 2 time in scopus

Metadata Downloads

Dual aggregated feature pyramid network for multi label classification

Authors: Yun, Dongjoo; Ryu, Jongbin; Lim, Jongwoo

Issue Date: Apr-2021

Publisher: Elsevier B.V.

Keywords: Aggregation; Deep learning; Multi-label classification

Citation: Pattern Recognition Letters, v.144, pp.75 - 81

Indexed: SCIE
SCOPUS

Journal Title: Pattern Recognition Letters

Volume: 144

Start Page: 75

End Page: 81

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/142135

DOI: 10.1016/j.patrec.2021.01.013

ISSN: 0167-8655

Abstract: While many deep convolutional neural networks show promising performance in various classification tasks, multiple objects appearing in very different sizes, shapes, and appearances cause difficulty in multilabel classification using conventional neural networks. In this paper, we introduce a dual aggregated network on pyramidal convolutional features for multi-label classification. The proposed method includes both featureand classifier-level aggregation to learn discriminant multi-scale information of various target objects in the image. First, the feature-level aggregation collects the convolutional activation maps from the multi-scale pyramid network, and then it densely pools them to take localized features of each object. We elaborately design the feature aggregation method so that the responses from the objects with different sizes, aspect ratios, and shapes are properly reflected the aggregated activation map. Unlike conventional methods, this process does not require the region proposal step, which reduces the computational burden significantly. Second, we introduce the classifier level aggregation algorithm for integrating the multi-object classifier modules. To maximize the discrimination power of each class, we train one-vs-all classifiers for individual classes using the class-wise loss function. For each test image, the scores from the class-wise classifiers are aggregated to get the final multi-label classification result. By combining the above featureand classifier-level aggregation methods, our network can be trained in an end-to-end fashion, which is not possible for the conventional multi-label classification algorithms using region proposals. Extensive evaluations on PASCAL VOC 2007 and PASCAL VOC 2012 demonstrate that the proposed algorithm outperforms the state-of-the-art methods.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Lim, Jongwoo photo

Lim, Jongwoo: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :5,980,979; Today View :6,952

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE