Title :
Improving precision and recall for Soundex retrieval
Author :
Holmes, David ; McCabe, M. Catherine
Abstract :
We present a phonetic algorithm for name searches that fuses existing techniques [the Soundex system of Russell and the techniques of J. Celko (1995) and U. Pfeifer et al.] and that introduces new features. This combination offers improved precision and recall. The described experiments assign multiple phonetic codes to each name. Counting common phonetic codes and digrams, the experiments implement the Dice coefficient to assign a similarity score between names. We use the Pfeifer corpus and relevance assessments to compare and contrast our experimental results with traditional techniques.
Keywords :
linguistics; relevance feedback; Dice coefficient; Pfeifer corpus; Soundex retrieval; digrams; name searches; phonetic algorithm; phonetic codes; precision; recall; relevance assessments; similarity score; Computer errors; Cultural differences; Fuses; Government; Information retrieval; Information systems; Information technology; Libraries;
Conference_Titel :
Information Technology: Coding and Computing, 2002. Proceedings. International Conference on
Print_ISBN :
0-7695-1506-1
DOI :
10.1109/ITCC.2002.1000354