Cited 0 time in
LiveCap: Live Video Captioning with Sequential Encoding Network
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Choi, Wangyu | - |
| dc.contributor.author | Yoon, Jungwon | - |
| dc.date.accessioned | 2023-01-25T10:09:30Z | - |
| dc.date.available | 2023-01-25T10:09:30Z | - |
| dc.date.issued | 2022-10 | - |
| dc.identifier.issn | 2162-1233 | - |
| dc.identifier.issn | 2162-1241 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/182239 | - |
| dc.description.abstract | Today, video captioning frameworks are very useful in places such as video surveillance systems. Most of these systems require real-time captioning, however existing video captioning frameworks still have some limitations in live video. Specifically, they require the whole video to describe. In this paper, we propose LiveCap, a framework for generating sentences corresponding to the current scene in real time from live video. LiveCap consists of three modules: sequential encoding network, captioning network, and context gating network. Our framework accumulates context for sequentially given video segments (sequential encoding network) and generates sentences based on it (captioning network). Furthermore, the context gating network controls the flow between the two networks to determine when to generate sentences. We train and test LiveCap on the ActivityNet Captions dataset and verify that LiveCap generates fluent and coherent captions in live video. | - |
| dc.format.extent | 3 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | IEEE Computer Society | - |
| dc.title | LiveCap: Live Video Captioning with Sequential Encoding Network | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ICTC55196.2022.9952747 | - |
| dc.identifier.scopusid | 2-s2.0-85143254312 | - |
| dc.identifier.bibliographicCitation | International Conference on ICT Convergence, v.2022-October, pp 1894 - 1896 | - |
| dc.citation.title | International Conference on ICT Convergence | - |
| dc.citation.volume | 2022-October | - |
| dc.citation.startPage | 1894 | - |
| dc.citation.endPage | 1896 | - |
| dc.type.docType | Conference Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordPlus | Encoding (symbols) | - |
| dc.subject.keywordPlus | Network coding | - |
| dc.subject.keywordPlus | Security systems | - |
| dc.subject.keywordPlus | Statistical tests | - |
| dc.subject.keywordPlus | Video signal processing | - |
| dc.subject.keywordPlus | Real time systems | - |
| dc.subject.keywordPlus | current | - |
| dc.subject.keywordPlus | Encodings | - |
| dc.subject.keywordPlus | Fluents | - |
| dc.subject.keywordPlus | Live video | - |
| dc.subject.keywordPlus | Network-control | - |
| dc.subject.keywordPlus | Real- time | - |
| dc.subject.keywordPlus | Sentence-based | - |
| dc.subject.keywordPlus | Video segments | - |
| dc.subject.keywordPlus | Video surveillance systems | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/9952747 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
