• DocumentCode
    3173260
  • Title

    An efficient unstructured big data analysis method for enhancing performance using machine learning algorithm

  • Author

    Reshmy, A.K. ; Paulraj, D.

  • Author_Institution
    Anna Univ., Chennai, India
  • fYear
    2015
  • fDate
    19-20 March 2015
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    In this modern world, data mining technology holds an essential position in all the major Engineering fields. Handling of Unstructured Big Data is an essential task of this era. At present, making the maximum advantage of parallel processing know-hows and the task of rapid examination of huge data steadily and continuously transmitted or received from various sources is becoming popular or conventional. The big data analytics job is fragmented into smaller jobs and ran over tens, hundreds or thousands of product servers by the parallel processing architecture. This helps in maintaining the data center cost efficient and facilitates easy handling of the enormous work in an efficient way. In this paper, proposed solution takes online consumer purchase. The online system has unrivalled bank of data on online consumer purchasing behavior that can be mined from its 100 million customers accounts. They use customer click-stream data and historical purchase data of all those 100 million customers and each user is shown personalized results on customized web pages. For improving Big Data performance the Machine Learning Method i.e. K-Nearest Neighbour algorithm used to support to take good analysis. Hadoop simulator is used to solve this kind of task.
  • Keywords
    Big Data; Web sites; data analysis; learning (artificial intelligence); parallel processing; Hadoop simulator; K-nearest neighbour algorithm; Web pages; customer click-stream data; historical purchase data; machine learning algorithm; unstructured big data analysis method; Big data; Business; Computers; Data mining; Databases; Industries; Machine learning algorithms; Hadoop Frame Work; Machine Learning Algorithm; Parallel Processing Technologies; Unstructured Big Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuit, Power and Computing Technologies (ICCPCT), 2015 International Conference on
  • Conference_Location
    Nagercoil
  • Type

    conf

  • DOI
    10.1109/ICCPCT.2015.7159492
  • Filename
    7159492