Title :
An activity based spoken language corpus of Nepali
Author :
Allwood, Julian ; Regmi, Bhim Narayan ; Dhakhwa, Sagun ; Uranw, R.K.
Author_Institution :
Univ. of Gothenburg, Gothenburg, Sweden
Abstract :
Language is used for communication and communication facilitates social activities. If we want to capture this, linguistic investigation has to be carried out within a wider context. Examination of linguistic communication in a wider context shows that it is multimodal. In order to study naturalistic multimodal communication using a corpus, the corpus should contain a combination of recordings, documentation, and transcription of multimodal communication from different social activities in naturalistic settings, preserving unedited conversation. This paper presents a brief account of the principles, methodology, current status, and preliminary findings, based on an incrementally growing and multimodal activity based spoken language corpus of Nepali.
Keywords :
linguistics; natural language processing; Nepali language; activity based spoken language corpus; multimodal communication documentation; multimodal communication recording; multimodal communication transcription; naturalistic multimodal communication; Feature extraction; Hospitals; Interviews; Pragmatics; Seminars; Speech; TV; NSC; Nepali language; activity based; multimodal; spoken language corpus;
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-2811-1
Electronic_ISBN :
978-1-4673-2812-8
DOI :
10.1109/ICSDA.2012.6422472