DocumentCode
1991529
Title
A multisample criterion for changepoint analysis of texts
Author
Zakrevskaya, N.S.
Author_Institution
Novosibirsk State Tech. Univ., Russia
fYear
2005
fDate
26 June-2 July 2005
Firstpage
749
Lastpage
750
Abstract
We construct a criterion to differ homogeneous and non-homogeneous texts. This criterion is based on triplets´ frequencies analysis: we find the most deviated corresponding empirical bridge and analyze its deviation. The approach can differ homogeneous and non-homogeneous texts.
Keywords
natural languages; text analysis; homogeneous texts; nonhomogeneous texts; text changepoint analysis; text identification; triplet frequencies analysis; Bridges; Frequency conversion; Libraries; Sections; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Science and Technology, 2005. KORUS 2005. Proceedings. The 9th Russian-Korean International Symposium on
Print_ISBN
0-7803-8943-3
Type
conf
DOI
10.1109/KORUS.2005.1507893
Filename
1507893
Link To Document