Title :
Automatic detection and segmentation of pronunciation variants in German speech corpora
Author :
Kipp, Andreas ; Wesenick, Maria-Barbara ; Schiel, Florian
Author_Institution :
Inst. fur Phonetik und Sprachliche Kommunikation, Munchen Univ., Germany
Abstract :
In this paper we present a hybrid statistical and rule-based segmentation system which takes into account phonetic variation of German. Input to the system is the orthographic representation and the speech signal of an utterance to be segmented. The output is the transcription (SAM-PA) with the highest overall likelihood and the corresponding segmentation of the speech signal. The system consists of three main parts: In a first stage the orthographic representation is converted into a linear string of phonetic units by lexicon lookup. Phonetic rules are applied yielding a graph that contains the canonic form and presumed variations. In a second HMM-based stage the speech signal of the concerning utterance is time-aligned by a Viterbi search which is constrained by the graph of the first stage. The outcome of this stage is a string of phonetic labels and the corresponding segment boundaries. A rule-based refinement of the segment boundaries using phonetic knowledge takes place in a third stage
Keywords :
speech processing; speech recognition; German speech corpora; HMM-based stage; Viterbi search; automatic detection and segmentation; lexicon lookup; orthographic representation; phonetic knowledge; phonetic units; pronunciation variants; rule-based refinement; rule-based segmentation system; statistical segmentation system; Automatic speech recognition; Databases; Dictionaries; Hidden Markov models; Humans; Natural languages; Speech analysis; Speech processing; Speech synthesis; Viterbi algorithm;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607048