• DocumentCode
    730813
  • Title

    Content-based recommender systems for spoken documents

  • Author

    Wintrode, Jonathan ; Sell, Gregory ; Jansen, Aren ; Fox, Michelle ; Garcia-Romero, Daniel ; McCree, Alan

  • Author_Institution
    Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    5201
  • Lastpage
    5205
  • Abstract
    Content-based recommender systems use preference ratings and features that characterize media to model users´ interests or information needs for making future recommendations. While previously developed in the music and text domains, we present an initial exploration of content-based recommendation for spoken documents using a corpus of public domain internet audio. Unlike familiar speech technologies of topic identification and spoken document retrieval, our recommendation task requires a more comprehensive notion of document relevance than bags-of-words would supply. Inspired by music recommender systems, we automatically extract a wide variety of content-based features to characterize non-linguistic aspects of the audio such as speaker, language, gender, and environment. To combine these heterogeneous information sources into a single relevance judgement, we evaluate feature, score, and hybrid fusion techniques. Our study provides an essential first exploration of the task and clearly demonstrates the value of a multisource approach over a bag-of-words baseline.
  • Keywords
    document handling; information retrieval; recommender systems; speech recognition; bag-of-words baseline; content-based recommender systems; document relevance; heterogeneous information sources; preference ratings; public domain Internet audio; spoken document retrieval; topic identification; Acoustics; Feature extraction; Logistics; Recommender systems; Speech; Speech processing; Videos; Content-based recommendation; i-vectors; low resource; speech retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178963
  • Filename
    7178963