Title :
Semantic annotations for conversational speech: From speech transcriptions to predicate argument structures
Author :
Bisazza, Arianna ; Dinarelli, Marco ; Quarteroni, Silvia ; Tonelli, Sara ; Moschitti, Alessandro ; Riccardi, Giuseppe
Author_Institution :
DISI, Univ. of Trento, Trento
Abstract :
In this paper, we describe the semantic content, which can be automatically generated, for the design of advanced dialog systems. Since the latter will be based on machine learning approaches, we created training data by annotating a corpus with the needed content. Given a sentence of our transcribed corpus, domain concepts and other linguistic levels ranging from basic ones, i.e. part-of-speech tagging and constituent chunking level, to more advanced ones, i.e. syntactic and predicate argument structure (PAS) levels are annotated. In particular, the proposed PAS and taxonomy of dialog acts appear to be promising for the design of more complex dialog systems. Statistics about our semantic annotation are reported.
Keywords :
interactive systems; learning (artificial intelligence); speech processing; advanced dialog systems; constituent chunking level; conversational speech; machine learning; part-of-speech tagging; predicate argument structures; speech transcriptions; Contracts; Humans; Machine learning; Man machine systems; Natural languages; Speech; Statistics; Tagging; Taxonomy; Training data;
Conference_Titel :
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location :
Goa
Print_ISBN :
978-1-4244-3471-8
Electronic_ISBN :
978-1-4244-3472-5
DOI :
10.1109/SLT.2008.4777841