• DocumentCode
    478636
  • Title

    An Empirical Measure on the Set of Symbols Occurring in Engineering Mathematics Texts

  • Author

    Watt, Stephen M.

  • fYear
    2008
  • fDate
    16-19 Sept. 2008
  • Firstpage
    557
  • Lastpage
    564
  • Abstract
    Certain forms of mathematical expression are used more often than others in practice. A quantitative understanding of actual usage can provide additional information to improve the accuracy of software for the input of mathematical expressions from scanned documents or handwriting and more natural forms of presentation of mathematical expressions by computer algebra systems. Earlier work has examined this question for the diverse set of articles from the mathematics preprint archive arXiv.org. That analysis showed showed the variance between mathematical areas. The present work analyzes a particular mathematical domain more deeply. We have chosen to examine second year university engineering mathematics as taught in North America as the domain. We have analyzed the set of expressions occurring in the most popular textbooks, weighted by popularity. Assuming that early training influences later mathematical usage, we take this as a model of the set of mathematical expressions used by the population of North American engineers.  We present an empirical analysis of the symbols and $n$-grams occurring in these expressions.
  • Keywords
    Character recognition; Information analysis; Large-scale systems; Mathematics; Optical character recognition software; Pattern recognition; Performance analysis; Spatial databases; Text analysis; Text recognition; Mathematical document analysis; engineering mathematics; n-gram frequencies; symbol frequencies;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on
  • Conference_Location
    Nara, Japan
  • Print_ISBN
    978-0-7695-3337-7
  • Type

    conf

  • DOI
    10.1109/DAS.2008.82
  • Filename
    4670006