Title :
A Corpus-Based Analysis of Mixed Code in Hong Kong Speech
Author_Institution :
Dept. of Chinese, Translation & Linguistics, City Univ. of Hong Kong, Hong Kong, China
Abstract :
We present a corpus-based analysis of the use of mixed code in Hong Kong speech. From transcriptions of Cantonese television programs, we identify English words embedded within Cantonese utterances, and investigate the motivations for such code-switching. Among the many motivations observed in previous research, we found that four alone account for more than 95% of the use of English words in our speech data across genres, genders, and age groups. We performed analyses over more than 60 hours of transcribed speech, resulting in one of the largest empirical studies to-date on this linguistic phenomenon.
Keywords :
linguistics; natural language processing; speech coding; text analysis; Cantonese television program transcription; Cantonese utterances; English words identification; Hong Kong speech; code-switching; corpus linguistics; corpus-based analysis; mixed code; speech data; Educational institutions; Pragmatics; Registers; Speech; Speech coding; TV; Taxonomy; Cantonese; English; code-mixing; code-switching; corpus linguistics;
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
DOI :
10.1109/IALP.2012.10