• DocumentCode
    301441
  • Title

    Information retrieval using letter tuples with neural network and nearest neighbor classifiers

  • Author

    Kjell, Bradley ; Woods, W. Addison ; Frieder, Ophir

  • Author_Institution
    C.S. Dept, Central CT State Univ., New Britain, CT, USA
  • Volume
    2
  • fYear
    1995
  • fDate
    22-25 Oct 1995
  • Firstpage
    1222
  • Abstract
    Previous work has shown that statistics of letter tuples extracted from text samples can be effective in determining authorship. These statistics have been used to create displays that visually separate the works of different authors, and have been used as input to neural network classifiers which can accurately discriminate between authors. Similar applications are described by Bennett (1976), Clausing (1993), and Damashek (1995). The present paper extends this work by testing the effectiveness of letter tuples in information retrieval systems using neural network classifiers and nearest neighbor classifiers as the retrieval method. Testing was performed using 855 full-text Wall Street Journal articles and 50 narrative queries. Performance of neural and nearest neighbor methods was similar, with the product of recall and precision exceeding 0.1 on the given data
  • Keywords
    information retrieval; information retrieval systems; neural nets; pattern classification; Wall Street Journal articles; authorship; information retrieval; letter tuples; narrative queries; nearest neighbor classifiers; neural network classifiers; statistics; Face recognition; Frequency; Information retrieval; Intelligent networks; Nearest neighbor searches; Neural networks; Pattern recognition; Speech recognition; Statistics; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 1995. Intelligent Systems for the 21st Century., IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    0-7803-2559-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.1995.537938
  • Filename
    537938