• DocumentCode
    625011
  • Title

    Utterance Classification Using Linguistic and Non-linguistic Information for Network-Based Speech-to-Speech Translation Systems

  • Author

    Sugiura, Komei ; Ryong Lee ; Kashioka, Hideki ; Zettsu, Koji ; Kidawara, Yutaka

  • Author_Institution
    Nat. Inst. of Inf. & Commun. Technol., Kyoto, Japan
  • Volume
    2
  • fYear
    2013
  • fDate
    3-6 June 2013
  • Firstpage
    212
  • Lastpage
    216
  • Abstract
    Network-based mobile services, such as speech-to-speech translation and voice search, enable the construction of large-scale log database including speech. We have developed a smartphone application called VoiceTra for speech-to-speech translation and have collected 10,000,000 utterances so far. This huge corpus is unique in size and spatio-temporal information; it contains information on anonymized user locations. This spatiotemporal corpus can be used for improving the accuracy of its speech recognition and machine translation, and it will open the door for the study of the location dependency of vocabulary and new applications for location-based services. This paper first analyzes the corpus and then presents a novel method for classifying utterances using linguistic and non-linguistic information. L2-regularized Logistic Regression is used for utterance classification. Our experiments performed on the VoiceTra log corpus revealed that our proposed method outperformed baseline methods in terms of F measure.
  • Keywords
    audio databases; language translation; linguistics; pattern classification; smart phones; spatiotemporal phenomena; speech recognition; vocabulary; L2-regularized logistic regression; VoiceTra log corpus; large-scale log database; linguistic information; location dependency; location-based services; machine translation accuracy improvement; network-based mobile services; network-based speech-to-speech translation system; nonlinguistic information; smartphone application; spatiotemporal corpus; spatiotemporal information; speech recognition accuracy improvement; utterance classification; vocabulary; Business; Knowledge discovery; Mobile communication; Pragmatics; Speech; Speech recognition; Vectors; GIS; smartphone; speech-to-speech translation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mobile Data Management (MDM), 2013 IEEE 14th International Conference on
  • Conference_Location
    Milan
  • Print_ISBN
    978-1-4673-6068-5
  • Type

    conf

  • DOI
    10.1109/MDM.2013.96
  • Filename
    6569092