See this document in CiteSeerX!

Audio-Visual Speaker Recognition for Video Broadcast News: Some Fusion Techniques (1999)  (Make Corrections)  (1 citation)
Benoit Maison, Chalapathy Neti, Andrew Senior



  Home/Search   Context   Related

 
View or download:
ibm.com/AVSTG/mmsp99_spid.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  ibm.com/AVSTG/cnetipubs (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Audio-based speaker identi#cation degrades severely when there is a mismatchbetween training and test conditions either due to channel or noise. In this paper, we explore various techniques to fuse video based speaker identi#cation with audio-based speaker identi#cation to improve the performance under mismatched conditions. Speci#cally,we explore techniques to optimally determine the relativeweights of the independent decisions based on audio and video to achieve the best combination.... (Update)

Similar documents based on text:   More   All
3.8:   On The Use Of Visual Information For Improving Audio-Based .. - Senior, Neti, Maison (1999)   (Correct)
0.5:   Selective Use Of The Speech Spectrum And A Vqgmm.. - Lin, Jan, Che, Yuk.. (1996)   (Correct)
0.2:   ngerprint, face and speech", in Audio- and Video-based.. - Chalapathy Neti And   (Correct)

BibTeX entry:   (Update)

Benoit Maison, Chalapathy Neti, and Andrew Senior, "Audio-visual speaker recognition for video broadcast news: some fusion techniques, " in IEEE Multimedia Signal Processing (MMSP99), Denmark, September 1999. http://citeseer.ist.psu.edu/maison99audiovisual.html   More

@misc{ maison99audiovisual,
  author = "B. Maison and C. Neti and A. Senior",
  title = "Audio-visual speaker recognition for video broadcast news: some fusion
    techniques",
  text = "Benoit Maison, Chalapathy Neti, and Andrew Senior, Audio-visual speaker
    recognition for video broadcast news: some fusion techniques,  in IEEE Multimedia
    Signal Processing (MMSP99), Denmark, September 1999.",
  year = "1999",
  url = "citeseer.ist.psu.edu/maison99audiovisual.html" }
Citations not processed or no citations identified.

Documents on the same site (http://www.research.ibm.com/AVSTG/cnetipubs.html):   More
On The Use Of Visual Information For Improving Audio-Based .. - Senior, Neti, Maison (1999)   (Correct)
Detection Of Faces Under Shadows And Lighting Variations - Iyengar Neti Ibm (2001)   (Correct)
A Vision-based Microphone Switch for Speech Intent Detection - Giridharan Iyengar And (2001)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC