Difference between revisions of "Class meeting for 10-605 Hadoop Overview"

From Cohen Courses
Jump to navigationJump to search
 
Line 1: Line 1:
#REDIRECT [[Class meeting for 10-605 in Fall 2016 Hadoop Overview]]
+
This is one of the class meetings on the [[Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2017|schedule]] for the course [[Machine Learning with Large Datasets 10-605 in Fall 2017]].
 +
 
 +
=== Slides ===
 +
 
 +
Map-reduce overview:
 +
 
 +
* [http://www.cs.cmu.edu/~wcohen/10-605/map-reduce.pptx Map-Reduce overview - ppt]
 +
Other:
 +
 
 +
* [http://www.cs.cmu.edu/~wcohen/10-605/annotated-hadoop-log.txt  A log of me interacting with Hadoop] (streaming Hadoop only).
 +
 
 +
=== Quiz ===
 +
 
 +
* To be posted
 +
 
 +
=== Readings for the Class ===
 +
 
 +
* There are lots of on-line tutorials for Hadoop.  The [http://shop.oreilly.com/product/0636920010388.do O'Reilly Book] is also quite good.
 +
 
 +
=== Things to Remember ===
 +
 
 +
* Hadoop terminology: HDFS, shards, job tracker, combiner, mapper, reducer, ...

Revision as of 11:39, 11 August 2017

This is one of the class meetings on the schedule for the course Machine Learning with Large Datasets 10-605 in Fall 2017.

Slides

Map-reduce overview:

Other:

Quiz

  • To be posted

Readings for the Class

  • There are lots of on-line tutorials for Hadoop. The O'Reilly Book is also quite good.

Things to Remember

  • Hadoop terminology: HDFS, shards, job tracker, combiner, mapper, reducer, ...