• DocumentCode
    2256081
  • Title

    Data collection of Japanese dialects and its influence into speech recognition

  • Author

    Kudo, Ikuo ; Nakama, Takao ; Watanabe, Tomoko ; Kameyama, Reiko

  • Author_Institution
    Tsukuba R&D Center Ltd., Texas Instrum., Ibaraki, Japan
  • Volume
    4
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    2021
  • Abstract
    Reports the successful completion of a Japanese POLYPHONE project-the Voice Across Japan (VAJ) data collection project. The database has the following characteristics: (1) a large speaker database (8,866 speakers) through a telephone line, (2) gathering of the participants´ personal information such as gender, age, place where they grew up, and so on, and (3) data segmented by phone or word boundaries. This paper describes several aspects of Japanese dialects and also reports the results of experiments. How much does dialect influence speech recognition? In our results, dialect influences the speech recognition rate by 2-4%. The results are useful information for building practical speech recognition systems as well as for data collection
  • Keywords
    data acquisition; database management systems; languages; linguistics; speech processing; speech recognition; Japanese POLYPHONE project; Japanese dialects; Voice Across Japan project; data collection; data segmentation; personal information; phone boundaries; speaker database; speech recognition; telephone line; word boundaries; Databases; Information analysis; Instruments; Research and development; Sampling methods; Speech analysis; Speech recognition; Telephony; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607195
  • Filename
    607195