Abstract :
This article has described LSM, a data-driven framework for modeling globally meaningful relationships implicit in large volumes of data. LSM generalizes a paradigm originally developed to capture hidden word patterns in a text document corpus. Over the past decade, this paradigm has proven effective in an increasing variety of fields, gradually spreading from query-based information retrieval to word clustering, document/topic clustering, large-vocabulary speech recognition language modeling, automated call routing, semantic inference for spoken interface control, and several other speech processing applications.
Keywords :
information retrieval systems; speech recognition; automated call routing; data-driven framework; document clustering; hidden word patterns; large-vocabulary speech recognition language modeling; latent semantic mapping; query-based information retrieval; semantic inference; speech processing; spoken interface control; text document corpus; topic clustering; word clustering; Automatic control; Content based retrieval; Context modeling; Indexing; Information analysis; Information retrieval; Natural language processing; Natural languages; Routing; Speech recognition;