DocumentCode :
2256081
Title :
Data collection of Japanese dialects and its influence into speech recognition
Author :
Kudo, Ikuo ; Nakama, Takao ; Watanabe, Tomoko ; Kameyama, Reiko
Author_Institution :
Tsukuba R&D Center Ltd., Texas Instrum., Ibaraki, Japan
Volume :
4
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
2021
Abstract :
Reports the successful completion of a Japanese POLYPHONE project-the Voice Across Japan (VAJ) data collection project. The database has the following characteristics: (1) a large speaker database (8,866 speakers) through a telephone line, (2) gathering of the participants´ personal information such as gender, age, place where they grew up, and so on, and (3) data segmented by phone or word boundaries. This paper describes several aspects of Japanese dialects and also reports the results of experiments. How much does dialect influence speech recognition? In our results, dialect influences the speech recognition rate by 2-4%. The results are useful information for building practical speech recognition systems as well as for data collection
Keywords :
data acquisition; database management systems; languages; linguistics; speech processing; speech recognition; Japanese POLYPHONE project; Japanese dialects; Voice Across Japan project; data collection; data segmentation; personal information; phone boundaries; speaker database; speech recognition; telephone line; word boundaries; Databases; Information analysis; Instruments; Research and development; Sampling methods; Speech analysis; Speech recognition; Telephony; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607195
Filename :
607195
Link To Document :
بازگشت