• DocumentCode
    1797993
  • Title

    Mining user tasks from print logs

  • Author

    Xin Li ; Lei Zhang ; Ping Luo ; Enhong Chen ; Guandong Xu ; Yu Zong ; Chu Guan

  • Author_Institution
    Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    1250
  • Lastpage
    1257
  • Abstract
    With lots of applications emerging in World Wide Web, many interaction data from users are collected and exploited to discover user behavior or interest patterns. In this paper, we attempt to exploit a new interaction data, namely print logs, where each record is printing URLs selected by a user using a popular web printing tool. Users usually print web contents based on an intention (subtask or task). Apparently, mining common print tasks from print logs is able to capture users´ intentions, which undoubtedly benefits many web applications, such as task oriented recommendation and behavior targeting. However, it is not an easy job to perform this due to the difficulty of URL topic representation and task formulation. To this end, we propose a general framework, named UPT (Users Print Tasks mining framework), for mining print tasks from print logs. Specifically, we attempt to leverage delicious (a social book marking web service) as an external thesaurus to expand the expression of each URL by selecting tags associated with the domain of each URL. Then, we construct a tag co-occurrence graph where similar tags can be clustered as subtasks. If we view each subtask as an item, then the print log is transformed to a transaction database, on which an efficient pattern mining algorithm is proposed to induce tasks. Finally, we evaluate the effectiveness of the proposed framework through experiments on a real print log.
  • Keywords
    Web services; data mining; database management systems; graph theory; pattern clustering; thesauri; Delicious; UPT; URL printing; URL topic representation; Web content printing; Web printing tool; World Wide Web; behavior targeting; external thesaurus; interaction data; interest pattern discovery; pattern mining algorithm; print logs; social book marking Web service; subtask clustering; tag co-occurrence graph; task formulation; task oriented recommendation; transaction database; user behavior discovery; user print task mining framework; user task mining; Data mining; Itemsets; Printing; Semantics; Thesauri; Web pages; Clustering; Frequent Pattern Mining; Print Logs; Print Tasks; the Wisdom of Crowds;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889721
  • Filename
    6889721