مرکز منطقه ای اطلاع رساني علوم و فناوري - Towards multi-speaker unsupervised speech pattern discovery

DocumentCode :

2800329

Title :

Towards multi-speaker unsupervised speech pattern discovery

Author :

Zhang, Yaodong ; Glass, James R.

Author_Institution :

Comput. Sci. & Artificial Intell. Lab., MIT, Cambridge, MA, USA

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4366

Lastpage :

4369

Abstract :

In this paper, we explore the use of a Gaussian posteriorgram based representation for unsupervised discovery of speech patterns. Compared with our previous work, the new approach provides significant improvement towards speaker independence. The framework consists of three main procedures: a Gaussian posteriorgram generation procedure which learns an unsupervised Gaussian mixture model and labels each speech frame with a Gaussian posteriorgram representation; a segmental dynamic time warping procedure which locates pairs of similar sequences of Gaussian posteriorgram vectors; and a graph clustering procedure which groups similar sequences into clusters. We demonstrate the viability of using the posteriorgram approach to handle many talkers by finding clusters of words in the TIMIT corpus.

Keywords :

Gaussian processes; graph theory; speaker recognition; speech processing; Gaussian posteriorgram based representation; TIMIT corpus; graph clustering; multi-speaker unsupervised speech pattern discovery; unsupervised Gaussian mixture; unsupervised discovery; Artificial intelligence; Computer science; Glass; Laboratories; Mel frequency cepstral coefficient; Signal processing; Speech processing; Speech recognition; Testing; Unsupervised learning; language acquisition; unsupervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495637

Filename :

5495637

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2800329