Model-based viewpoint invariant human activity recognition from uncalibrated monocular video sequence

There is growing interest in human activity recognition systems, motivated by their numerous promising applications in many domains. Despite much progress, most researchers have narrowed the problem towards fixed camera viewpoint owing to inherent difficulty to train their systems across all possibl...

Full description

Bibliographic Details
Main Authors: Htike@Muhammad Yusof, Zaw Zaw, Egerton, Simon, Kuang, Ye Chow
Format: Article
Language:English
English
Published: Springer-Verlag, Berlin, Germany 2010
Subjects:
Online Access:http://irep.iium.edu.my/43203/
http://irep.iium.edu.my/43203/
http://irep.iium.edu.my/43203/4/43203_Model_based_viewpoint.pdf
http://irep.iium.edu.my/43203/5/43203_Model_based_viewpoint_CoverTOC.pdf
Description
Summary:There is growing interest in human activity recognition systems, motivated by their numerous promising applications in many domains. Despite much progress, most researchers have narrowed the problem towards fixed camera viewpoint owing to inherent difficulty to train their systems across all possible viewpoints. Fixed viewpoint systems are impractical in real scenarios. Therefore, we attempt to relax the fixed viewpoint assumption and present a novel and simple framework to recognize and classify human activities from uncalibrated monocular video source from any viewpoint. The proposed framework comprises two stages: 3D human pose estimation and human activity recognition. In the pose estimation stage, we estimate 3D human pose by a simple search-based and tracking-based technique. In the activity recognition stage, we use Nearest Neighbor, with Dynamic Time Warping as a distance measure, to classify multivariate time series which emanate from streams of pose vectors from multiple video frames. We have performed some experiments to evaluate the accuracy of the two stages separately. The encouraging experimental results demonstrate the effectiveness of our framework.