(Enter summary)
Abstract: Audio-based speaker identi#cation degrades severely when there is a
mismatchbetween training and test conditions either due to channel or
noise. In this paper, we explore various techniques to fuse video based
speaker identi#cation with audio-based speaker identi#cation to improve
the performance under mismatched conditions. Speci#cally,we explore
techniques to optimally determine the relativeweights of the independent
decisions based on audio and video to achieve the best combination.... (Update)
Context of citations to this paper: More
.... 4, 5] Joint processing of audio and visual information have been used successfully in speaker change, speaker identification etc as well[6, 7]. In this paper, we propose to use the visual channel for establishment of speech intent. One can argue that rather than using the...
Cited by: More
A Vision-based Microphone Switch for Speech Intent Detection - Giridharan Iyengar And (2001)
(Correct)
Similar documents (at the sentence level):
69.1%: On The Use Of Visual Information For Improving Audio-Based .. - Senior, Neti, Maison (1999)
(Correct)
11.2%: Audio-Visual Speaker Recognition for Video Broadcast News - Neti, Senior (1999)
(Correct)
5.6%: Joint Processing of Audio and Visual Information.. - Neti, Maison.. (2000)
(Correct)
System load high. Please wait...
Timeout. Please try your query later.
Similar documents based on text: More All
0.5: Selective Use Of The Speech Spectrum And A Vqgmm.. - Lin, Jan, Che, Yuk.. (1996)
(Correct)
0.2: ngerprint, face and speech", in Audio- and Video-based.. - Chalapathy Neti And
(Correct)
0.2: Perceptual Interfaces For Information Interaction: .. - Neti, Iyengar.. (2000)
(Correct)
BibTeX entry: (Update)
Benoit Maison, Chalapathy Neti, and Andrew Senior, "Audio-visual speaker recognition for video broadcast news: some fusion techniques, " in IEEE Multimedia Signal Processing (MMSP99), Denmark, September 1999. http://citeseer.ist.psu.edu/maison99audiovisual.html More
@misc{ maison99audiovisual,
author = "B. Maison and C. Neti and A. Senior",
title = "Audio-visual speaker recognition for video broadcast news: some fusion
techniques",
text = "Benoit Maison, Chalapathy Neti, and Andrew Senior, Audio-visual speaker
recognition for video broadcast news: some fusion techniques, in IEEE Multimedia
Signal Processing (MMSP99), Denmark, September 1999.",
year = "1999",
url = "citeseer.ist.psu.edu/maison99audiovisual.html" }
Citations not processed or no citations identified.
Documents on the same site (http://www.research.ibm.com/AVSTG/cnetipubs.html): More
On The Use Of Visual Information For Improving Audio-Based .. - Senior, Neti, Maison (1999)
(Correct)
Detection Of Faces Under Shadows And Lighting Variations - Iyengar Neti Ibm (2001)
(Correct)
A Vision-based Microphone Switch for Speech Intent Detection - Giridharan Iyengar And (2001)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC