Video retrieval of human interactions using model-based motion tracking and multi-layer finite state automata

Park, S.; Park, J.; Aggarwal, J.K.

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Video retrieval of human interactions using model-based motion tracking and multi-layer finite state automata

Authors: Park, S.; Park, J.; Aggarwal, J.K.

Issue Date: 2003

Publisher: Springer Verlag

Citation: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v.2728, pp.394 - 403

Journal Title: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volume: 2728

Start Page: 394

End Page: 403

URI: https://scholarworks.bwise.kr/hongik/handle/2020.sw.hongik/26597

DOI: 10.1007/3-540-45113-7_39

ISSN: 0302-9743

Abstract: Recognition of human interactions in a video is useful for video annotation, automated surveillance, and content-based video retrieval. This paper presents a model-based approach to motion tracking and recognition of human interactions using multi-layer finite state automata (FA). The system is used for widely-available, static-background monocular surveillance videos. A three-dimensional human body model is built using a sphere and cylinders and is projected on a two-dimensional image plane to fit the foreground image silhouette. We convert the human motion tracking problem into a parameter optimization problem without the need to compute inverse kinematics. A cost functional is used to estimate the degree of the overlap between the foreground input image silhouette and a projected three-dimensional body model silhouette. Motion data obtained from the tracker is analyzed in terms of feet, torso, and hands by a behavior recognition system. The recognition model represents human behavior as a sequence of states that register the configuration of individual body parts in space and time. In order to overcome the exponential growth of the number of states that usually occurs in single-level FA, we propose a multi-layer FA that abstracts states and events from motion data at multiple levels: low-level FA analyzes body parts only, and high-level FA analyzes the human interaction. Motion tracking results from video sequences are presented. Our recognition framework successfully recognizes various human interactions such as approaching, departing, pushing, pointing, and handshaking. © Springer-Verlag Berlin Heidelberg 2003.

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Park, Ji hun photo

Park, Ji hun: Engineering (Department of Computer Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :2,456,597; Today View :2,188

RSS_1.0 RSS_2.0 ATOM_1.0

94, Wausan-ro, Mapo-gu, Seoul, 04066, Korea02-320-1314

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE