Class meeting for 10-605 Hadoop Overview

From Cohen Courses
Revision as of 10:01, 7 September 2017 by Wcohen (talk | contribs) (→‎Quiz)
Jump to navigationJump to search

This is one of the class meetings on the schedule for the course Machine Learning with Large Datasets 10-605 in Fall 2017.

Slides

Map-reduce overview:

Other:

Quiz

  • There is no quiz today - you should instead spend the review time actually working with Hadoop on stoat.

Readings for the Class

  • There are lots of on-line tutorials for Hadoop. The O'Reilly Book is also quite good.

Things to Remember

  • Hadoop terminology: HDFS, shards, job tracker, combiner, mapper, reducer, ...