by Fabrice Souvannavong, Bernard Merialdo, Benoît Huet
In Proceedings of the ACM International Workshop on Multimedia Information Retrieval
http://bmgroup.eurecom.fr/papers/souvfa-041015.pdf
Add To MetaCart
Abstract:
We present a complete and efficient framework for video shot indexing and retrieval. Video shots are described by their key-frame, themselves described by their regions. Regionbased approaches suffer from the complexity of segmentation and comparison tasks. A compact region-based shot representation is usually obtained thanks to vector-quantization method. We thus introduce LSA to reduce the noise inherent to the segmentation and the quantization processes. Then to better capture the content of video shots, we propose two original methods. The first takes advantage of a multi-scale segmentation of frames while the second uses multiple frames to represent a shot. Both approaches require more computation time during the pre-processing but not for indexing and comparison tasks. Indeed the extra information is included in the original signatures of shots. Finally we introduce a relevance feedback loop to optimize the search and propose a new method to optimize the effect of LSA. In the experimental section, we make an evaluation of latent semantic analysis and proposed approaches on two problems, namely object retrieval and semantic content estimation. Categories and Subject Descriptors H.3.1 [Information storage and retrieval]: Content analysis and indexing—Indexing methods; H.3.3 [Information storage and retrieval]: Information search and retrieval— Relevance feedback
Citations
|
1463
|
Indexing by Latent Semantic Analysis
– Deerwester, Dumais, et al.
- 1990
|
|
559
|
Relevance feedback in information retrieval, The
– Rocchio
- 1971
|
|
220
|
A practical approach to feature selection
– Rendell, Kira
- 1992
|
|
202
|
Fast multiresolution image querying
– Jacobs, Finkelstein, et al.
- 1995
|
|
195
|
Blobworld: A system for region-based image indexing and retrieval
– Carson, Thomas, et al.
- 1999
|
|
182
|
Object recognition as machine translation; learning a lexicon for a fixed image vocabulary
– Duygulu, Barnard, et al.
- 2002
|
|
129
|
Intelligent access to digital video: informedia project
– Wactlar, Kanade, et al.
- 1996
|
|
69
|
A fully automated content-based video search engine supporting spatiotemporal queries
– Chang, Chen, et al.
- 1998
|
|
44
|
Probabilistic multimedia objects Multijects: A novel approach to video indexing and retrieval in multimedia systems
– Naphade, Kristjansson, et al.
- 1998
|
|
35
|
Automatic Video Database Indexing and Retrieval
– Ardizzone, Cascia
- 1997
|
|
33
|
Video Collaborative Annotation Forum: Establishing
– Lin, Tseng, et al.
- 2003
|
|
30
|
Spatio-temporal segmentation of video by hierarchical mean shift analysis
– DeMenthon
- 2002
|
|
21
|
From feature to semantics: some preliminary results
– Zhao, Grosky
|
|
20
|
Efficiently computing a good segmentation
– Felzenschwab, Huttenlocher
|
|
19
|
An effective region-based image retrieval framework,” presented at
– Jing, Li, et al.
- 2002
|
|
14
|
Indexing audio documents by using latent semantic analysis and som
– Kurimo
- 1999
|
|
4
|
Latent semantic indexing for video content modeling and analysis
– Souvannavong, Merialdo, et al.
- 2003
|
|
2
|
Learning visual keywords for content-based retrieval
– Lim
- 1999
|
|
2
|
Benoît Huet, “Video content modeling with latent semantic analysis
– Souvannavong, Merialdo
- 2003
|
|
2
|
Perceptual image indexing and retrieval
– Mirmehdi, Perissamy
- 2002
|
|
1
|
Benoît Huet. Latent semantic analysis for semantic content detection of video shots
– Souvannavong, Merialdo
- 2004
|
|
1
|
Sushil Bhattacharjee, and MuratKunt. Spatio-temporal segmentation based on region merging
– Moscheni
- 1998
|