• DocumentCode
    302175
  • Title

    An efficient voice retrieval system for very-large-vocabulary Chinese textual databases with a clustered language model

  • Author

    Lin, Sung-Chien ; Chien, Lee-Feng ; Chen, Keh-Jiann ; Lee, Lin-shan

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    287
  • Abstract
    This paper presents an accurate and efficient voice retrieval system for very-large-vocabulary Chinese textual databases with a specially-designed clustered language model. To reduce the problems resulted from the complexity of unconstrained speech-input queries for retrieval, the system is completely syllable-based in both speech recognition and database retrieval by properly utilizing the mono-syllabic structure of Chinese language. In addition, it partitions the records in the database into clusters and trains the clustered language model using the clustering results. The proposed clustered language model with its augmented search algorithm are very useful to improve accuracy and speed of the speech retrieval system. In the preliminary tests using an experimental database with about 30,000 bibliographical records, it was found that the present system can accept unconstrained speech-input queries and achieve very good performance
  • Keywords
    bibliographic systems; information retrieval system evaluation; natural languages; speech recognition; augmented search algorithm; bibliographical records; clustered language model; database retrieval; experimental database; monosyllabic structure; speech recognition; speech retrieval system; syllable based system; system performance; unconstrained speech input queries; very large vocabulary Chinese textual databases; voice retrieval system; Clustering algorithms; Computer science; Data engineering; Information retrieval; Information science; Natural languages; Partitioning algorithms; Spatial databases; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.540414
  • Filename
    540414