Results 1 - 10
of
11
SpeechFind: Advances in spoken document retrieval for a national gallery of the spoken word
- IEEE Trans. Audio, Speech, and Language Processing
, 2005
"... Abstract—Advances in formulating spoken document retrieval for a new National Gallery of the Spoken Word (NGSW) are addressed. NGSW is the first large-scale repository of its kind, consisting of speeches, news broadcasts, and recordings from the 20th century. After presenting an overview of the audi ..."
Abstract
-
Cited by 22 (5 self)
- Add to MetaCart
(Show Context)
Abstract—Advances in formulating spoken document retrieval for a new National Gallery of the Spoken Word (NGSW) are addressed. NGSW is the first large-scale repository of its kind, consisting of speeches, news broadcasts, and recordings from the 20th century. After presenting an overview of the audio stream content of the NGSW, with sample audio files from U.S. Presidents from 1893 to the present, an overall system diagram is proposed with a discussion of critical tasks associated with effective audio in-formation retrieval. These include advanced audio segmentation, speech recognition model adaptation for acoustic background noise and speaker variability, and information retrieval using natural language processing for text query requests that include document and query expansion. For segmentation, a new eval-uation criterion entitled fused error score (FES) is proposed, followed by application of the CompSeg segmentation scheme on
Dialogue strategy to clarify user’s queries for document retrieval system with speech interface
- Speech Comm
"... interface ..."
Collecting spontaneously spoken queries for information retrieval
- In Proceedings of 4th International Conference on Language Resources and Evaluation
, 2004
"... Motivated to realize the speech-driven information retrieval systems that accept spontaneously spoken queries, we developed a method to collect such speech data derived from the pre-defined search topics that had been systematically constructed for IR research. In order to evaluate both our method a ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
(Show Context)
Motivated to realize the speech-driven information retrieval systems that accept spontaneously spoken queries, we developed a method to collect such speech data derived from the pre-defined search topics that had been systematically constructed for IR research. In order to evaluate both our method and the performance of the document retrieval by using the spontaneously spoken queries, we took place two experiments of collecting the speech data by our method using publicly available test collections of evaluating document retrieval. The first preliminary experiment took place with relatively small number of search topics selected from the NTCIR-3 Web retrieval collection, which had been constructed for the TREC-style evaluation workshop, in order to test our method. The second experiment took place with all of the search topics released from the NTCIR-4 Web task to participate the formal run of the evaluation. The information about the collected data and the result of the evaluation with respect to both the speech recognition accuracy and the precision of document retrieval by using the collected data are presented in this paper. 1.
Speech-based information retrieval system with clarification dialogue strategy
- in Proc. Human Language Technology Conf. (HLT/EMNLP
, 2005
"... This paper addresses a dialogue strategy to clarify and constrain the queries for speech-driven document retrieval systems. In spoken dialogue interfaces, users often make utterances before the query is completely generated in their mind; thus input queries are often vague or fragmental. As a result ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
This paper addresses a dialogue strategy to clarify and constrain the queries for speech-driven document retrieval systems. In spoken dialogue interfaces, users often make utterances before the query is completely generated in their mind; thus input queries are often vague or fragmental. As a result, usually many items are matched. We propose an efficient dialogue framework, where the system dynamically selects an optimal question based on information gain (IG), which represents reduction of matched items. A set of possible questions is prepared using various knowledge sources. As a bottom-up knowledge source, we extract a list of words that can take a number of objects and potentially causes ambiguity, using a dependency structure analysis of the document texts. This is complemented by top-down knowledge sources of metadata and handcrafted questions. An experimental evaluation showed that the method significantly improved the success rate of retrieval, and all categories of the prepared questions contributed to the improvement. 1
Construction and Analysis of Corpus of Japanese Classroom Lecture Speech Contents
"... ..."
(Show Context)
Dialogue Strategy to Clarify User’s Queries for Document Retrieval System with Speech Interface
"... ..."
(Show Context)
Development of a dialogue system for Web retrieval *
"... Recently automatic speech recognition has become a practical technology, and is now used in real-world applications, such as information retrieval. Speech-driven Web retrieval, in which spoken queries ..."
Abstract
- Add to MetaCart
(Show Context)
Recently automatic speech recognition has become a practical technology, and is now used in real-world applications, such as information retrieval. Speech-driven Web retrieval, in which spoken queries
Experiments on Web Retrieval Driven by Spontaneously Spoken Queries
, 2003
"... Motivated to realize the speech-driven information retrieval systems that accept spontaneously spoken queries, we developed a method to collect such speech data derived from the pre-defined search topics that had been systematically constructed for IR research. In order to evaluate both our method a ..."
Abstract
- Add to MetaCart
(Show Context)
Motivated to realize the speech-driven information retrieval systems that accept spontaneously spoken queries, we developed a method to collect such speech data derived from the pre-defined search topics that had been systematically constructed for IR research. In order to evaluate both our method and the performance of the document retrieval by using the spontaneously spoken queries, we took place two experiments of collecting the speech data by our method using publicly available test collections of evaluating document retrieval. The first preliminary experiment took place with relatively small number of search topics selected from the NTCIR-3 Web retrieval collection, in order to test our method. The second experiment took place with all of the search topics released from the NTCIR-4 Web task to participate the formal run of the evaluation. The information about the collected data and the result of the evaluation with respect to both the speech recognition accuracy and the precision of document retrieval by using the collected data are presented in this paper. 1
unknown title
"... Abstract The objective of this research is to construct a video searching mechanism and speech interface on the multimedia cross-platform, namely TV and Internet, which requires the capability to deal with dynamic contents. Current NetTv enables users to search both recorded TV contents and news on ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract The objective of this research is to construct a video searching mechanism and speech interface on the multimedia cross-platform, namely TV and Internet, which requires the capability to deal with dynamic contents. Current NetTv enables users to search both recorded TV contents and news on the Internet by simply speaking keywords as a query; hence the videos related to the keyword spoken are retrieved. Also, the system provides a simple keyword based QA system to answer various questions that may occur to users whilst watching retrieved videos. In this way, NetTv improves the usability of video searching and viewing in a hands free way.