• DocumentCode
    3531298
  • Title

    Improving multi-lattice alignment based spoken keyword spotting

  • Author

    Lin, Hui ; Stupakov, Alex ; Bilmes, Jeff

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Washington, Seattle, WA
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4877
  • Lastpage
    4880
  • Abstract
    In previous work, we showed that using a lattice instead of the 1-best path to represent both the query and the utterance being searched is beneficial for spoken keyword spotting. In this paper, we introduce several techniques that further improve our multi-lattice alignment approach, including edit operation modeling and supervised training of the conditional probability table, something which cannot be directly trained by traditional maximum likelihood estimation. Experiments on TIMIT show that the proposed methods significantly improve the performance of spoken keyword spotting.
  • Keywords
    audio databases; learning (artificial intelligence); maximum likelihood estimation; probability; query processing; speech processing; TIMIT; conditional probability table; edit operation modeling; maximum likelihood estimation; multilattice alignment-based spoken keyword spotting; search query; speech database; supervised training; Audio recording; Auditory displays; Cameras; Graphical models; Keyboards; Lattices; Legged locomotion; Maximum likelihood estimation; Microphones; Video recording; Spoken keyword spotting; auxiliary training; edit operation modeling; lattice alignment; negative training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960724
  • Filename
    4960724