LiveCap: Live Video Captioning with Sequential Encoding Network

Choi, Wangyu; Yoon, Jungwon

doi:10.1109/ICTC55196.2022.9952747

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

LiveCap: Live Video Captioning with Sequential Encoding Network

Full metadata record

DC Field	Value	Language
dc.contributor.author	Choi, Wangyu	-
dc.contributor.author	Yoon, Jungwon	-
dc.date.accessioned	2023-01-25T10:09:30Z	-
dc.date.available	2023-01-25T10:09:30Z	-
dc.date.issued	2022-10	-
dc.identifier.issn	2162-1233	-
dc.identifier.issn	2162-1241	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/182239	-
dc.description.abstract	Today, video captioning frameworks are very useful in places such as video surveillance systems. Most of these systems require real-time captioning, however existing video captioning frameworks still have some limitations in live video. Specifically, they require the whole video to describe. In this paper, we propose LiveCap, a framework for generating sentences corresponding to the current scene in real time from live video. LiveCap consists of three modules: sequential encoding network, captioning network, and context gating network. Our framework accumulates context for sequentially given video segments (sequential encoding network) and generates sentences based on it (captioning network). Furthermore, the context gating network controls the flow between the two networks to determine when to generate sentences. We train and test LiveCap on the ActivityNet Captions dataset and verify that LiveCap generates fluent and coherent captions in live video.	-
dc.format.extent	3	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	IEEE Computer Society	-
dc.title	LiveCap: Live Video Captioning with Sequential Encoding Network	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ICTC55196.2022.9952747	-
dc.identifier.scopusid	2-s2.0-85143254312	-
dc.identifier.bibliographicCitation	International Conference on ICT Convergence, v.2022-October, pp 1894 - 1896	-
dc.citation.title	International Conference on ICT Convergence	-
dc.citation.volume	2022-October	-
dc.citation.startPage	1894	-
dc.citation.endPage	1896	-
dc.type.docType	Conference Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordPlus	Encoding (symbols)	-
dc.subject.keywordPlus	Network coding	-
dc.subject.keywordPlus	Security systems	-
dc.subject.keywordPlus	Statistical tests	-
dc.subject.keywordPlus	Video signal processing	-
dc.subject.keywordPlus	Real time systems	-
dc.subject.keywordPlus	current	-
dc.subject.keywordPlus	Encodings	-
dc.subject.keywordPlus	Fluents	-
dc.subject.keywordPlus	Live video	-
dc.subject.keywordPlus	Network-control	-
dc.subject.keywordPlus	Real- time	-
dc.subject.keywordPlus	Sentence-based	-
dc.subject.keywordPlus	Video segments	-
dc.subject.keywordPlus	Video surveillance systems	-
dc.identifier.url	https://ieeexplore.ieee.org/document/9952747	-

Files in This Item: Go to Link

Appears in Collections: 서울 산업융합학부 > 서울 산업융합학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Yoon, Jungwon photo

Yoon, Jungwon: 서울 산업융합학부

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE