مرکز منطقه ای اطلاع رساني علوم و فناوري - Spoken WordCloud: Clustering recurrent patterns in speech

DocumentCode :

2580622

Title :

Spoken WordCloud: Clustering recurrent patterns in speech

Author :

Flamary, Rémi ; Anguera, Xavier ; Oliver, Nuria

Author_Institution :

LITIS EA 4108, Univ. de Rouen, St. Etienne-du-Rouvray, France

fYear :

2011

fDate :

13-15 June 2011

Firstpage :

133

Lastpage :

138

Abstract :

The automatic summarization of speech recordings is typically carried out as a two step process: the speech is first decoded using an automatic speech recognition system and the resulting text transcripts are processed to create a summary. However, this approach might not be suitable in adverse acoustic conditions or when applied to languages with limited training resources. In order to address these limitations, in this paper we propose an automatic speech summarization method that is based on the automatic discovery of recurrent patterns in the speech: recurrent acoustic patterns are first extracted from the audio and then are clustered and ranked according to the number of repetitions, creating an approximate acoustic summary of what was spoken. This approach allows us to build what we call a “Spoken WordCloud” termed after similarity with text-based word-clouds. We present an algorithm that achieves a cluster purity of up to 90% and an inverse purity of 71% in preliminary experiments using a small dataset of connected spoken words.

Keywords :

pattern clustering; speech recognition; Spoken WordCloud; automatic speech recognition system; automatic speech summarization method; clustering recurrent patterns; recurrent acoustic patterns; speech recordings; text-based word-clouds; Acoustic measurements; Acoustics; Clustering algorithms; Databases; Speech; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Content-Based Multimedia Indexing (CBMI), 2011 9th International Workshop on

Conference_Location :

Madrid

ISSN :

1949-3983

Print_ISBN :

978-1-61284-432-9

Electronic_ISBN :

1949-3983

Type :

conf

DOI :

10.1109/CBMI.2011.5972534

Filename :

5972534

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2580622