• DocumentCode
    2022166
  • Title

    A Case-Based Reasoning Approach for Invoice Structure Extraction

  • Author

    Hamza, Hatem ; Belaid, Yolande ; Belaid, Abdel

  • Author_Institution
    ITESOFT, Aimargues
  • Volume
    1
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    327
  • Lastpage
    331
  • Abstract
    This paper shows the use of case-based reasoning (CBR) for invoice structure extraction and analysis. This method, called CBR-DIA (CBR for document invoice analysis), is adaptive and does not need any previous training. It analyses a document by retrieving and analysing similar documents or elements of documents (cases) stored in a database. The retrieval step is performed thanks to graph comparison techniques like graph probing and edit distance. The analysis step is done thanks to the information found in the nearest retrieved cases. Applied on 950 invoices, CBR-DIA reaches a recognition rate of 85.29% for documents of known classes and 76.33% for documents of unknown classes.
  • Keywords
    case-based reasoning; document image processing; feature extraction; graph theory; image retrieval; optical character recognition; OCR; case-based reasoning approach; document database; document invoice analysis; document retrieval; edit distance; graph comparison technique; graph probing; invoice structure extraction; Artificial intelligence; Data mining; Databases; Image analysis; Information analysis; Information retrieval; Problem-solving; Tagging; Terminology; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4378726
  • Filename
    4378726