DocumentCode
478636
Title
An Empirical Measure on the Set of Symbols Occurring in Engineering Mathematics Texts
Author
Watt, Stephen M.
fYear
2008
fDate
16-19 Sept. 2008
Firstpage
557
Lastpage
564
Abstract
Certain forms of mathematical expression are used more often than others in practice. A quantitative understanding of actual usage can provide additional information to improve the accuracy of software for the input of mathematical expressions from scanned documents or handwriting and more natural forms of presentation of mathematical expressions by computer algebra systems. Earlier work has examined this question for the diverse set of articles from the mathematics preprint archive arXiv.org. That analysis showed showed the variance between mathematical areas. The present work analyzes a particular mathematical domain more deeply. We have chosen to examine second year university engineering mathematics as taught in North America as the domain. We have analyzed the set of expressions occurring in the most popular textbooks, weighted by popularity. Assuming that early training influences later mathematical usage, we take this as a model of the set of mathematical expressions used by the population of North American engineers. We present an empirical analysis of the symbols and $n$-grams occurring in these expressions.
Keywords
Character recognition; Information analysis; Large-scale systems; Mathematics; Optical character recognition software; Pattern recognition; Performance analysis; Spatial databases; Text analysis; Text recognition; Mathematical document analysis; engineering mathematics; n-gram frequencies; symbol frequencies;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on
Conference_Location
Nara, Japan
Print_ISBN
978-0-7695-3337-7
Type
conf
DOI
10.1109/DAS.2008.82
Filename
4670006
Link To Document