مرکز منطقه ای اطلاع رساني علوم و فناوري - A Corpus-Based Analysis of Mixed Code in Hong Kong Speech

DocumentCode :

3490110

Title :

A Corpus-Based Analysis of Mixed Code in Hong Kong Speech

Author :

Lee, Jeyull

Author_Institution :

Dept. of Chinese, Translation & Linguistics, City Univ. of Hong Kong, Hong Kong, China

fYear :

2012

fDate :

13-15 Nov. 2012

Firstpage :

165

Lastpage :

168

Abstract :

We present a corpus-based analysis of the use of mixed code in Hong Kong speech. From transcriptions of Cantonese television programs, we identify English words embedded within Cantonese utterances, and investigate the motivations for such code-switching. Among the many motivations observed in previous research, we found that four alone account for more than 95% of the use of English words in our speech data across genres, genders, and age groups. We performed analyses over more than 60 hours of transcribed speech, resulting in one of the largest empirical studies to-date on this linguistic phenomenon.

Keywords :

linguistics; natural language processing; speech coding; text analysis; Cantonese television program transcription; Cantonese utterances; English words identification; Hong Kong speech; code-switching; corpus linguistics; corpus-based analysis; mixed code; speech data; Educational institutions; Pragmatics; Registers; Speech; Speech coding; TV; Taxonomy; Cantonese; English; code-mixing; code-switching; corpus linguistics;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Asian Language Processing (IALP), 2012 International Conference on

Conference_Location :

Hanoi

Print_ISBN :

978-1-4673-6113-2

Electronic_ISBN :

978-0-7695-4886-9

Type :

conf

DOI :

10.1109/IALP.2012.10

Filename :

6473722

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3490110