MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Speaker recognition using MPEG-7 descriptors”, Eurospeech 2003 (2003) [2 citations — 1 self]

Download:
pdf
by Hyoung-gook Kim, Edgar Berdahl, Nicolas Moreau, Thomas Sikora
Proceedings of EUROSPEECH
http://www.nue.tu-berlin.de/publications/papers/EuSp03-539.pdf
Add To MetaCart

Abstract:

Our purpose is to evaluate the efficiency of MPEG-7 audio descriptors for speaker recognition. The upcoming MPEG-7 standard provides audio feature descriptors, which are useful for many applications. One example application is a speaker recognition system, in which reduced-dimension log-spectral features based on MPEG-7 descriptors are used to train hidden Markov models for individual speakers. The feature extraction based on MPEG-7 descriptors consists of three main stages: Normalized Audio Spectrum Envelope (NASE), Principal Component Analysis (PCA) and Independent Component Analysis (ICA). An experimental study is presented where the speaker recognition rates are compared for different feature extraction methods. Using ICA, we achieved better results than NASE and PCA in a speaker recognition system. 1.

Citations

850 Principal Component Analysis – Jolliffe - 1986
808 Independent component analysis, a new concept – Comon - 1994
153 Independent component analysis: Algorithms and applications – Hyvärinen, Oja - 2000
44 Introduction to MPEG-7 – Manjunath, Salembier, et al. - 2002
3 General Sound Similarity and Sound Recognition Tools", in ''Introduction to MPEG-7 – Casey - 2002
2 1/SC 29, ''Information technology multimedia content description interface-Part 4: Audio – JTC - 2001
1 Analysis of speaker variability'', Eurospeech – Huang, Chen, et al. - 2001