Abstract:
This paper addresses the problem of recognizing human actions from video. Particularly, the case of recognizing events in tennis game videos is analyzed. Driven by our domain knowledge, a robust player segmentation algorithm is developed for real video data. Further, we introduce a number of novel features to be extracted for our particular application. Different feature combinations are investigated in order to find the optimal one. Finally, recognition results for different classes of tennis strokes using automatic learning capability of Hidden Markov Models (HMMs) are presented. The experimental results demonstrate that our method is close to realizing statistics of tennis games automatically using ordinary TV broadcast videos. 1.
Citations
|
4344
|
Maximum likelihood from incomplete data via the EM algorithm
– Dempster, Laird, et al.
- 1977
|
|
905
|
Robust Statistics
– Huber
- 1981
|
|
196
|
Visual Information Retrieval
– Bimbo, Alberto
- 1999
|
|
187
|
Visual perception of biological motion and a model for its analysis
– Johansson
- 1973
|
|
172
|
Layered representation of motion video using robust maximum-likelihood estimation of mixture models and MDL encoding
– Ayer, Sawhney
- 1995
|
|
82
|
Knowledge-assisted content-based retrieval for multimedia databases
– Yoshitaka, Kishida, et al.
- 1994
|
|
73
|
Multiresolution Image Processing and Analysis
– Rosenfeld
- 1984
|
|
60
|
Inferring body pose without tracking body parts
– Rosales, Sclaro
|
|
59
|
Real-time human motion analysis by image skeletonization
– Fujiyoshi, Lipton
- 1998
|
|
46
|
Automatic Classification of Tennis Video for High-level Content-based Retrieval
– Sudhir, Lee, et al.
|
|
20
|
Elmagarmid. Spatial and temporal content-based access to hyper video databases
– Jiang, K
- 1998
|
|
18
|
Computer Vision for Human-Machine Interaction
– Cipolla, Pentland
- 1998
|
|
12
|
Video annotation for content-based retrieval using human behavior analysis and domain knowledge
– Miyamori, Iisaku
- 2000
|
|
9
|
A Framework for Video Modelling
– Petkovic, Jonker
- 2000
|
|
6
|
Overview of Data Models and Query Languages for Content-based Video
– Petkovic, Jonker
- 2000
|
|
4
|
LucentVision: Converting Real World Events into Multimedia Experiences
– Pingali, Jean, et al.
- 2000
|
|
3
|
Hidden Markov Models for Speech Recognition
– Michaelson, Steedman
- 1990
|
|
2
|
Recognizing Human Action in Time-Seuential Images using Hidden Markov Model
– Yamato, Ohya, et al.
- 1992
|