DocumentCode
2256081
Title
Data collection of Japanese dialects and its influence into speech recognition
Author
Kudo, Ikuo ; Nakama, Takao ; Watanabe, Tomoko ; Kameyama, Reiko
Author_Institution
Tsukuba R&D Center Ltd., Texas Instrum., Ibaraki, Japan
Volume
4
fYear
1996
fDate
3-6 Oct 1996
Firstpage
2021
Abstract
Reports the successful completion of a Japanese POLYPHONE project-the Voice Across Japan (VAJ) data collection project. The database has the following characteristics: (1) a large speaker database (8,866 speakers) through a telephone line, (2) gathering of the participants´ personal information such as gender, age, place where they grew up, and so on, and (3) data segmented by phone or word boundaries. This paper describes several aspects of Japanese dialects and also reports the results of experiments. How much does dialect influence speech recognition? In our results, dialect influences the speech recognition rate by 2-4%. The results are useful information for building practical speech recognition systems as well as for data collection
Keywords
data acquisition; database management systems; languages; linguistics; speech processing; speech recognition; Japanese POLYPHONE project; Japanese dialects; Voice Across Japan project; data collection; data segmentation; personal information; phone boundaries; speaker database; speech recognition; telephone line; word boundaries; Databases; Information analysis; Instruments; Research and development; Sampling methods; Speech analysis; Speech recognition; Telephony; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607195
Filename
607195
Link To Document