DocumentCode :
3490110
Title :
A Corpus-Based Analysis of Mixed Code in Hong Kong Speech
Author :
Lee, Jeyull
Author_Institution :
Dept. of Chinese, Translation & Linguistics, City Univ. of Hong Kong, Hong Kong, China
fYear :
2012
fDate :
13-15 Nov. 2012
Firstpage :
165
Lastpage :
168
Abstract :
We present a corpus-based analysis of the use of mixed code in Hong Kong speech. From transcriptions of Cantonese television programs, we identify English words embedded within Cantonese utterances, and investigate the motivations for such code-switching. Among the many motivations observed in previous research, we found that four alone account for more than 95% of the use of English words in our speech data across genres, genders, and age groups. We performed analyses over more than 60 hours of transcribed speech, resulting in one of the largest empirical studies to-date on this linguistic phenomenon.
Keywords :
linguistics; natural language processing; speech coding; text analysis; Cantonese television program transcription; Cantonese utterances; English words identification; Hong Kong speech; code-switching; corpus linguistics; corpus-based analysis; mixed code; speech data; Educational institutions; Pragmatics; Registers; Speech; Speech coding; TV; Taxonomy; Cantonese; English; code-mixing; code-switching; corpus linguistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
Type :
conf
DOI :
10.1109/IALP.2012.10
Filename :
6473722
Link To Document :
بازگشت