• DocumentCode
    146438
  • Title

    Inference patterns from Big Data using aggregation, filtering and tagging- A survey

  • Author

    Prakashbhai, Pathak Anand ; Pandey, Hari Mohan

  • Author_Institution
    CSE, ASET, Noida, India
  • fYear
    2014
  • fDate
    25-26 Sept. 2014
  • Firstpage
    66
  • Lastpage
    71
  • Abstract
    This paper reviews various approaches to infer the patterns from Big Data using aggregation, filtering and tagging. Earlier research shows that data aggregation concerns about gathered data and how efficiently it can be utilized. It is understandable that at the time of data gathering one does not care much about whether the gathered data will be useful or not. Hence, filtering and tagging of the data are the crucial steps in collecting the relevant data to fulfill the need. Therefore the main goal of this paper is to present a detailed and comprehensive survey on different approaches. To make the concept clearer, we have provided a brief introduction of Big Data, how it works, working of two data aggregation tools (namely, flume and sqoop), data processing tools (hive and mahout) and various algorithms that can be useful to understand the topic. At last we have included comparisons between aggregation tools, processing tools as well as various algorithms through its pre-process, matching time, results and reviews.
  • Keywords
    Big Data; inference mechanisms; information filtering; pattern recognition; Big Data; data aggregation; data filtering; data gathering; data tagging; inference patterns; Algorithm design and analysis; Big data; Data warehouses; Databases; Facebook; Filtering; Organizations; Big Data; Flume; HDFS; Hadoop; Hive; Mahout; MapReduce; Sqoop; aggregation; filtering; tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Confluence The Next Generation Information Technology Summit (Confluence), 2014 5th International Conference -
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-4237-4
  • Type

    conf

  • DOI
    10.1109/CONFLUENCE.2014.6949238
  • Filename
    6949238