Abstract:
Phonetic matching is used in applications such as name retrieval, where the spelling of a name is used to identify other strings that are likely to be of similar pronunciation. In this paper we explain the parallels between information retrieval and phonetic matching, and describe our new phonetic matching techniques. Our experimental comparison with existing techniques such as Soundex and edit distances, which is based on recall and precision, demonstrates that the new techniques are superior. In addition, reasoning from the similarity of phonetic matching and information retrieval, we have applied combination of evidence to phonetic matching. Our experiments with combining demonstrate that it leads to substantial improvements in effectiveness. 1
Citations
|
957
|
Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer
– Salton
|
|
236
|
Fast text searching allowing errors
– Wu, Manber
- 1992
|
|
191
|
Combination of multiple searches
– Shaw, Fox
- 1994
|
|
106
|
Approximate string matching with q-grams and maximal matches
– Ukkonen
- 1992
|
|
105
|
Approximate string matching
– HALL, DOWLING
- 1980
|
|
91
|
A course in phonetics
– Ladefoged
- 1975
|
|
76
|
Combining multiple evidence from different properties of weighting schemes
– Lee
- 1995
|
|
59
|
A critical investigation of recall and precision as measures of retrieval system performance
– Raghavan, Jung, et al.
- 1989
|
|
34
|
Relevance assessments and retrieval system evaluation
– Lesk, Salton
- 1969
|
|
19
|
Combining evidence for information retrieval
– Belkin, Kantor, et al.
- 1991
|
|
19
|
Similarity measures for short queries
– Wilkinson, Zobel, et al.
- 1995
|
|
19
|
Finding approximate matches in large lexicons
– Zobel, Dart
- 1995
|
|
17
|
A system for converting English text into speech
– Ainsworth
- 1973
|
|
10
|
A Survey of English Spelling
– Carney
- 1994
|
|
9
|
Computerized correction of phonographic errors
– Veronis
- 1988
|
|
9
|
Relevance assessments and retrieval system evaluation. Information Storage and Retrieval
– Lesk, Salton
- 1969
|
|
7
|
Fisching fore werds’: Phonetic retrieval of written text in information systems. Program: automated library and information systems
– Gadd
- 1988
|
|
6
|
Descriptive Phonetics
– Calvert
- 1986
|
|
6
|
PHONIX: The algorithm. Program: automated library and information systems
– Gadd
- 1990
|
|
5
|
Finding approximate matches in large lexicons. Software{Practice and Experience
– Zobel, Dart
- 1995
|
|
4
|
Relevance judgements for assessing recall
– Wallis, Thom
- 1996
|
|
4
|
Automatic tezt processing: the transformation, analysis, and retrieval of inforraation by computer
– Salton
- 1989
|
|
4
|
Relevance judgments for assessing recall
– Wallis, Thom
- 1996
|
|
3
|
Gimson’s Pronounciation of English
– Gimson, Cruttenden
- 1994
|
|
2
|
Approximate string-matching with qgrarns and maximal matches
– Ukkonen
- 1992
|
|
1
|
Effective phonetic string matching. Manuscript in submission
– Dart, Zobel
|
|
1
|
Using a pronounciation dictionary for fnetik matching
– Dart, Zobel
|
|
1
|
Effective phonetic string mat thing. Manuscript in submission
– Dart, Zobel
|
|
1
|
b]. Using a pronunciation dictionary for fnetik matching
– Dart, Zobel
- 1995
|
|
1
|
PHONIX: The algorithm. PTogr-am: awto - mated library and information systems
– Gadd
- 1990
|
|
1
|
Gimson’s Pronounciatzon of English
– Gimson, Cruttenden
- 1994
|