Content-based recommender systems for spoken documents

Author

Wintrode, Jonathan ; Sell, Gregory ; Jansen, Aren ; Fox, Michelle ; Garcia-Romero, Daniel ; McCree, Alan

Author_Institution

Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA

fYear

2015

fDate

19-24 April 2015

Firstpage

5201

Lastpage

5205

Abstract

Content-based recommender systems use preference ratings and features that characterize media to model users´ interests or information needs for making future recommendations. While previously developed in the music and text domains, we present an initial exploration of content-based recommendation for spoken documents using a corpus of public domain internet audio. Unlike familiar speech technologies of topic identification and spoken document retrieval, our recommendation task requires a more comprehensive notion of document relevance than bags-of-words would supply. Inspired by music recommender systems, we automatically extract a wide variety of content-based features to characterize non-linguistic aspects of the audio such as speaker, language, gender, and environment. To combine these heterogeneous information sources into a single relevance judgement, we evaluate feature, score, and hybrid fusion techniques. Our study provides an essential first exploration of the task and clearly demonstrates the value of a multisource approach over a bag-of-words baseline.

Keywords

document handling; information retrieval; recommender systems; speech recognition; bag-of-words baseline; content-based recommender systems; document relevance; heterogeneous information sources; preference ratings; public domain Internet audio; spoken document retrieval; topic identification; Acoustics; Feature extraction; Logistics; Recommender systems; Speech; Speech processing; Videos; Content-based recommendation; i-vectors; low resource; speech retrieval;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7178963

Filename

7178963