Joint Representation of Temporal Image Sequences and Object Motion for Video Object Detection

Koh, Junho; Kim, Jaekyum; Shin, Younji; Lee, Byeongwon; Yang, Seungji; Choi, Jun Won

doi:10.1109/ICRA48506.2021.9561778

Detailed Information

Cited 1 time in webofscience

Cited 1 time in scopus

Metadata Downloads

Joint Representation of Temporal Image Sequences and Object Motion for Video Object Detection

Full metadata record

DC Field	Value	Language
dc.contributor.author	Koh, Junho	-
dc.contributor.author	Kim, Jaekyum	-
dc.contributor.author	Shin, Younji	-
dc.contributor.author	Lee, Byeongwon	-
dc.contributor.author	Yang, Seungji	-
dc.contributor.author	Choi, Jun Won	-
dc.date.accessioned	2022-07-06T17:44:08Z	-
dc.date.available	2022-07-06T17:44:08Z	-
dc.date.created	2022-05-04	-
dc.date.issued	2021-05	-
dc.identifier.issn	1050-4729	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/141863	-
dc.description.abstract	In this paper, we propose a new video object detection (VoD) method, referred to as temporal feature aggregation and motion-aware VoD (TM-VoD), that produces a joint representation of temporal image sequences and object motion. The TM-VoD generates strong spatio-temporal features for VOD by temporally redundant information in an image sequence and the motion context. These are produced at the feature level in the region proposal stage and at the instance level in the refinement stage. In the region proposal stage, visual features are temporally fused with appropriate weights at the pixel level via gated attention model. Furthermore, pixel level motion features are obtained by capturing the changes between adjacent visual feature maps. In the refinement stage, the visual features are aligned and aggregated at the instance level. We propose a novel feature alignment method, which uses the initial region proposals as anchors to predict the box coordinates for all video frames. Moreover, the instance level motion features are obtained by applying the region of interest (RoI) pooling to the pixel level motion features and by encoding the sequential changes in the box coordinates. Finally, all these instance level features are concatenated to produce a joint representation of the objects. Experiments on the ImageNet VID dataset demonstrate that the proposed method significantly outperforms existing VoDs and achieves performance comparable with that of state-of-the-art VoDs.	-
dc.language	영어	-
dc.language.iso	en	-
dc.publisher	IEEE	-
dc.title	Joint Representation of Temporal Image Sequences and Object Motion for Video Object Detection	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Choi, Jun Won	-
dc.identifier.doi	10.1109/ICRA48506.2021.9561778	-
dc.identifier.scopusid	2-s2.0-85123767748	-
dc.identifier.wosid	000771405405004	-
dc.identifier.bibliographicCitation	2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), v.2021-May, pp.13370 - 13376	-
dc.relation.isPartOf	2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021)	-
dc.citation.title	2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021)	-
dc.citation.volume	2021-May	-
dc.citation.startPage	13370	-
dc.citation.endPage	13376	-
dc.type.rims	ART	-
dc.type.docType	Proceedings Paper	-
dc.description.journalClass	1	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Automation & Control Systems	-
dc.relation.journalResearchArea	Robotics	-
dc.relation.journalWebOfScienceCategory	Automation & Control Systems	-
dc.relation.journalWebOfScienceCategory	Robotics	-
dc.identifier.url	https://ieeexplore.ieee.org/document/9561778	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Choi, Jun Won photo

Choi, Jun Won: COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :5,999,632; Today View :25,877

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE