See this document in CiteSeerX!

Comparison of MPEG-7 Basis Projection Features and MFCC (2004)  (Make Corrections)  
applied to Robust Speaker Recognition Hyoung-Gook Kim, Martin Haller, Thomas...



  Home/Search   Context   Related

 
View or download:
ftsu01.nue.tuberlin....0778Kim2004.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  ftsu01.nue.tub...t=pubs&sort=rhc (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Our purpose is to evaluate the efficiency of MPEG-7 basis projection (BP) features vs. Mel-scale Frequency Cepstrum Coefficients (MFCC) for speaker recognition in noisy environments. The MPEG-7 feature extraction mainly consists of a Normalized Audio Spectrum Envelope (NASE), a basis decomposition algorithm and a spectrum basis projection. Prior to the feature extraction the noise reduction algorithm is performed by using a modified log spectral amplitude speech estimator (LSA) and a minima... (Update)

Similar documents (at the sentence level):
5.6%:   Comparison of MPEG-7 Basis Projection Features and MFCC.. - Kim, Haller, Sikora (2004)   (Correct)

Active bibliography (related documents):   More   All
0.4:   Comparison Of Mpeg-7 Audio Spectrum Projection Features And.. - Applied To Speaker (2004)   (Correct)
0.1:   Audio Classification Based on MPEG-7 Spectral - Basis Representations.. (2004)   (Correct)
0.1:   Learning Articulation from Cepstral Coefficients - Toutios, Margaritis (2005)   (Correct)

Similar documents based on text:
0.0:   Unknown -   (Correct)

BibTeX entry:   (Update)

@misc{ robust-comparison,
  author = "Applied To Robust",
  title = "Comparison of MPEG-7 Basis Projection Features and MFCC",
  url = "citeseer.ist.psu.edu/760145.html" }
Citations (may not include all citations):
653   Fundamentals of speech recognition (context) - Rabiner, Juang - 1993
16   Tracking speech presence uncertainty to improve speech enahn.. - Malah, Cox et al. - 1999
14   Speech enhancement for non-stationary noise environments (context) - Cohen, Berdugo - 2001
10   MPEG-7 sound recognition tools (context) - Casey - 2001
7   Independent component analysis: algorithms and applications (context) - Hyvarinen, Oja - 2000
4   Speaker recognition using MPEG-7 descriptors (context) - Kim, Berdahl et al. - 2003

Documents on the same site (http://ftsu01.nue.tu-berlin.de/elvera/en/list.php?list=pubs&sort=rhc):   More
Lossless and Perceptual Coding of Digital Audio - Noll, Liebchen (2005)   (Correct)
Enhancement of Noisy Speech for Noise Robust Front-End.. - Kim, Schwab, Moreau.. (2003)   (Correct)
An Integrated System For Face Detection And Tracking - Goldmann Krinidis Nikolaidis (2005)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC