• DocumentCode
    1768109
  • Title

    Automatic textual aggregation approach of scientific articles in OLAP context

  • Author

    Mustapha, Bouakkaz ; Sabine, Loudcher ; Youcef, Ouinten

  • Author_Institution
    LIM Lab., Univ. of Laghouat, Laghouat, Algeria
  • fYear
    2014
  • fDate
    9-11 Nov. 2014
  • Firstpage
    30
  • Lastpage
    35
  • Abstract
    In the last decade, Online Analytical Processing (OLAP) has taken an increasingly important role in Business Intelligence. Approaches, solutions and tools have been provided for both databases and data warehouses, which focus mainly on numerical data. These solutions are not suitable for textual data. Because of the fast growing of this type of data, there is a need for new approaches that take into account the textual content of data. In the context of Text OLAP (OLAP on text or documents), the measure can be textual and need a suitable aggregation function for OLAP operations such as roll-up. We present in this paper a new aggregation function for textual data. Our approach is based on the affinity between keywords and uses the search of cycles in a graph to find the aggregated keywords. We also present performances and a comparison with three other methods. The experimental study shows good results for our approach.
  • Keywords
    competitive intelligence; data mining; graph theory; information retrieval; text analysis; OLAP context; aggregated keywords; aggregation function; automatic textual aggregation approach; business intelligence; data warehouses; databases; graph cycles; online analytical processing; roll-up; scientific articles; text OLAP; Benchmark testing; Clustering algorithms; Complexity theory; Context; Data models; Pragmatics; Runtime; OLAP; aggregation function; graph; textual data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovations in Information Technology (INNOVATIONS), 2014 10th International Conference on
  • Conference_Location
    Al Ain
  • Print_ISBN
    978-1-4799-7210-4
  • Type

    conf

  • DOI
    10.1109/INNOVATIONS.2014.6987557
  • Filename
    6987557