Phonetic Confusion Based Document Expansion for Spoken Document Retrieval
Abstract:
This paper presents a phone-based approach of spoken document retrieval (SDR), developed in the framework of the emerging MPEG-7 standard. We describe an indexing and retrieval system that uses phonetic information only. The retrieval method is based on the vector space IR model, using phone N-grams as indexing terms. We propose a technique to expand the representation of documents by means of phone confusion probabilities in order to improve the retrieval performance. This method is tested on a collection of short German spoken documents, using 10 city names as queries. 1.
Citations
| 44 | Introduction to MPEG-7 – Manjunath, Salembier, et al. - 2002 |
| 31 | The Application of Classical Information Retrieval Techniques to Spoken Documents – James - 1995 |
| 22 | Subword-based Approaches for Spoken Document Retrieval – Ng - 2000 |
| 3 | SpokenContent Representation in MPEG-7 – Charlesworth, Garner - 2001 |
| 3 | Using Syllable-based Indexing Features and Language Models to improve German Spoken Document Retrieval", Eurospeech’03 – Larson, Eickeler - 2003 |
| 3 | Sikora T., "Combination of Phone N-Grams for a MPEG-7-based Spoken Document Retrieval System – Moreau, Kim |
| 2 | Schäuble P., "New Techniques for Open-Vocabulary Spoken Document Retrieval", SIGIR'98 – Wechsler, Munteanu - 1998 |
| 2 | Evaluation Measures", 10th Text Retrieval Conference – TREC - 2001 |

