MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Phonetic Confusion Based Document Expansion for Spoken Document Retrieval

Download:
pdf
unknown authors
http://www.nue.tu-berlin.de/publications/papers/ICSLP2004_Moreau.header.pdf
Add To MetaCart

Abstract:

This paper presents a phone-based approach of spoken document retrieval (SDR), developed in the framework of the emerging MPEG-7 standard. We describe an indexing and retrieval system that uses phonetic information only. The retrieval method is based on the vector space IR model, using phone N-grams as indexing terms. We propose a technique to expand the representation of documents by means of phone confusion probabilities in order to improve the retrieval performance. This method is tested on a collection of short German spoken documents, using 10 city names as queries. 1.

Citations

44 Introduction to MPEG-7 – Manjunath, Salembier, et al. - 2002
31 The Application of Classical Information Retrieval Techniques to Spoken Documents – James - 1995
22 Subword-based Approaches for Spoken Document Retrieval – Ng - 2000
3 SpokenContent Representation in MPEG-7 – Charlesworth, Garner - 2001
3 Using Syllable-based Indexing Features and Language Models to improve German Spoken Document Retrieval", Eurospeech’03 – Larson, Eickeler - 2003
3 Sikora T., "Combination of Phone N-Grams for a MPEG-7-based Spoken Document Retrieval System – Moreau, Kim
2 Schäuble P., "New Techniques for Open-Vocabulary Spoken Document Retrieval", SIGIR'98 – Wechsler, Munteanu - 1998
2 Evaluation Measures", 10th Text Retrieval Conference – TREC - 2001