Dual aggregated feature pyramid network for multi label classification
- Authors
- Yun, Dongjoo; Ryu, Jongbin; Lim, Jongwoo
- Issue Date
- Apr-2021
- Publisher
- Elsevier B.V.
- Keywords
- Aggregation; Deep learning; Multi-label classification
- Citation
- Pattern Recognition Letters, v.144, pp.75 - 81
- Indexed
- SCIE
SCOPUS
- Journal Title
- Pattern Recognition Letters
- Volume
- 144
- Start Page
- 75
- End Page
- 81
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/142135
- DOI
- 10.1016/j.patrec.2021.01.013
- ISSN
- 0167-8655
- Abstract
- While many deep convolutional neural networks show promising performance in various classification tasks, multiple objects appearing in very different sizes, shapes, and appearances cause difficulty in multilabel classification using conventional neural networks. In this paper, we introduce a dual aggregated network on pyramidal convolutional features for multi-label classification. The proposed method includes both featureand classifier-level aggregation to learn discriminant multi-scale information of various target objects in the image. First, the feature-level aggregation collects the convolutional activation maps from the multi-scale pyramid network, and then it densely pools them to take localized features of each object. We elaborately design the feature aggregation method so that the responses from the objects with different sizes, aspect ratios, and shapes are properly reflected the aggregated activation map. Unlike conventional methods, this process does not require the region proposal step, which reduces the computational burden significantly. Second, we introduce the classifier level aggregation algorithm for integrating the multi-object classifier modules. To maximize the discrimination power of each class, we train one-vs-all classifiers for individual classes using the class-wise loss function. For each test image, the scores from the class-wise classifiers are aggregated to get the final multi-label classification result. By combining the above featureand classifier-level aggregation methods, our network can be trained in an end-to-end fashion, which is not possible for the conventional multi-label classification algorithms using region proposals. Extensive evaluations on PASCAL VOC 2007 and PASCAL VOC 2012 demonstrate that the proposed algorithm outperforms the state-of-the-art methods.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/142135)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.