Class meeting for 10-605 in Fall 2016 Hadoop Overview

From Cohen Courses
Jump to navigationJump to search

This is one of the class meetings on the schedule for the course Machine Learning with Large Datasets 10-605 in Fall 2016.

Slides

Map-reduce overview:

Other:

Quiz

  • To be posted

Readings for the Class

  • There are lots of on-line tutorials for Hadoop. The O'Reilly Book is also quite good.

Things to Remember

  • Hadoop terminology: HDFS, shards, job tracker, combiner, mapper, reducer, ...