Title :
An approach to automatic acquisition of translation templates based on phrase structure extraction and alignment
Author :
Hu, Rile ; Zong, Chengqing ; Xu, Bo
Abstract :
In this paper, we propose a new approach for automatically acquiring translation templates from unannotated bilingual spoken language corpora. Two basic algorithms are adopted: a grammar induction algorithm, and an alignment algorithm using bracketing transduction grammar. The approach is unsupervised, statistical, and data-driven, and employs no parsing procedure. The acquisition procedure consists of two steps. First, semantic groups and phrase structure groups are extracted from both the source language and the target language. Second, an alignment algorithm based on bracketing transduction grammar aligns the phrase structure groups. The aligned phrase structure groups are post-processed, yielding translation templates. Preliminary experimental results show that the algorithm is effective
Keywords :
grammars; language translation; linguistics; natural languages; automatic acquisition; bracketing transduction grammar; grammar induction algorithm; phrase structure alignment; phrase structure extraction; phrase structure groups; semantic groups; source language; target language; translation templates acquisition; unannotated bilingual spoken language corpora; Associate members; Automation; Knowledge acquisition; Laboratories; Natural language processing; Natural languages; Pattern recognition; Statistical analysis; Tagging; Veins; Bilingual grammar induction; machine translation; structure alignment; translation template acquisition;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TSA.2005.858521