Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

DAFA: Diversity-Aware Feature Aggregation for Attention-Based Video Object Detection

Full metadata record
DC Field Value Language
dc.contributor.authorRoh, Si-Dong-
dc.contributor.authorChung, Ki-Seok-
dc.date.accessioned2024-01-10T02:06:01Z-
dc.date.available2024-01-10T02:06:01Z-
dc.date.issued2022-09-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/193867-
dc.description.abstractWe present a framework for attention-based video object detection using a simple yet effective external memory management algorithm. An attention mechanism has been adopted in video object detection task to enrich the features of key frames using adjacent frames. Although several recent studies utilized frame-level first-in-first-out (FIFO) memory to collect global video information, such a memory structure suffers from collection inefficiency, which results in low attention performance and high computational cost. To address this issue, we developed a novel scheme called diversity-aware feature aggregation (DAFA). Whereas other methods do not store sufficient feature information without expanding memory capacity, DAFA efficiently collects diverse features while avoiding redundancy using a simple Euclidean distance-based metric. Experimental results on the ImageNet VID dataset demonstrate that our lightweight model with global attention achieves 83.5 mAP on the ResNet-101 backbone, which exceeds the accuracy levels of most existing methods with a minimum runtime. Our method with global and local attention stages obtains 84.5 and 85.9 mAP on ResNet-101 and ResNeXt-101, respectively, thus achieving state-of-the-art performance without requiring additional post-processing methods.-
dc.format.extent11-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleDAFA: Diversity-Aware Feature Aggregation for Attention-Based Video Object Detection-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/ACCESS.2022.3203399-
dc.identifier.scopusid2-s2.0-85137603207-
dc.identifier.wosid000853793200001-
dc.identifier.bibliographicCitationIEEE ACCESS, v.10, pp 93453 - 93463-
dc.citation.titleIEEE ACCESS-
dc.citation.volume10-
dc.citation.startPage93453-
dc.citation.endPage93463-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordAuthorFeature extraction-
dc.subject.keywordAuthorMemory management-
dc.subject.keywordAuthorObject detection-
dc.subject.keywordAuthorVideo recording-
dc.subject.keywordAuthorTermination of employment-
dc.subject.keywordAuthorNeural networks-
dc.subject.keywordAuthorMicromechanical devices-
dc.subject.keywordAuthorAttention mechanism-
dc.subject.keywordAuthordiversity-aware-
dc.subject.keywordAuthorneural networks-
dc.subject.keywordAuthorspatio-temporal-
dc.subject.keywordAuthorvideo object detection-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/9874741-
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chung, Ki Seok photo

Chung, Ki Seok
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE