Hadoop has become the defacto standard of big data analysis, due to its scalability, fault-tolerance and availability. However, using Hadoop is still a non-trivial task; difficulties, like unfriendly interfaces, heterogeneous but isolated tools, make data analysis on Hadoop a tedious work. The goal of this topic is to design and implement a flexible web based tool facilitating big data analysis using Hadoop. The contesters are required to fulfill the following tasks: (1) Build web user interface for HDFS file management and Hive table management. (2) Integration open source data mining tools, like weka, Mahout etc. Develop a drag and drop based web user interface for developing big data analysis applications with these functions. (3) Implement web based report charts design interface.
This wiki is licensed under a Creative Commons 2.0 licenseXWiki Enterprise 6.4.4 - Documentation