OLFI 2012: Advanced Hadoop
This tutorial will explain how to leverage a Hadoop cluster to do data analysis using Java MapReduce, Apache Hive and Apache Pig. It is recommended that participants have experience with some programming language.
Topics include:
- Why are Hadoop and MapReduce needed?
- Writing a Java MapReduce program
- Common algorithms applied to Hadoop such as indexing, classification, joining data sets and graph processing
- Data analysis with Hive and Pig
- Overview of writing applications that use Apache HBase
NOTE: Some programming experience is strongly recommended for this session.
Instructor: Tom Hanlon, Cloudera
Schedule: 1 pm to 4 pm