DocumentCode
2765877
Title
E-mail signature block analysis
Author
Chen, Hao ; Hu, Jianying ; Sproat, Richard W.
Author_Institution
CENPARMI, Concordia Univ., Montreal, Que., Canada
Volume
2
fYear
1998
fDate
16-20 Aug 1998
Firstpage
1153
Abstract
The signature block is a common structured component found in e-mail messages. Accurate identification and analysis of signature blocks are important in many multimedia messaging and information retrieval applications such as e-mail text-to-speech rendering. Traditional text analysis methods designed to deal with sequential text cannot handle 2D structures, while the highly unconstrained nature of signature blocks makes the application of 2D grammars very difficult. In this paper we describe an algorithm for signature block analysis which combines 2D structural segmentation with 1D grammatical constraints. The information obtained from both geometrical and linguistic analysis are integrated in a form of weighted finite state transducers, and the final solution is the optimal interpretation under both constraints
Keywords
character recognition; computational linguistics; document image processing; electronic mail; grammars; multimedia communication; 1D grammatical constraints; 2D structural segmentation; e-mail messages; geometrical analysis; grammars; linguistic analysis; multimedia messaging; parsing; signature block; weighted finite state transducers; Algorithm design and analysis; Design methodology; Electronic mail; Information analysis; Information retrieval; Multimedia databases; Postal services; Speech synthesis; Text analysis; Transducers;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
Conference_Location
Brisbane, Qld.
ISSN
1051-4651
Print_ISBN
0-8186-8512-3
Type
conf
DOI
10.1109/ICPR.1998.711900
Filename
711900
Link To Document