Difference between revisions of "Class meeting for 10-605 Hadoop Overview"

From Cohen Courses
Jump to navigationJump to search
Line 7: Line 7:
 
* [http://www.cs.cmu.edu/~wcohen/10-605/map-reduce.pptx Map-Reduce overview - ppt]
 
* [http://www.cs.cmu.edu/~wcohen/10-605/map-reduce.pptx Map-Reduce overview - ppt]
 
* [http://www.cs.cmu.edu/~wcohen/10-605/map-reduce.pdf Map-Reduce overview - pdf]
 
* [http://www.cs.cmu.edu/~wcohen/10-605/map-reduce.pdf Map-Reduce overview - pdf]
 
Other:
 
 
* [http://www.cs.cmu.edu/~wcohen/10-605/annotated-hadoop-log.txt  An annotated log of me interacting with streaming Hadoop].
 
  
 
=== Quiz ===
 
=== Quiz ===
  
* There is no quiz today - you should instead spend the review time actually working with Hadoop on stoat.
+
* There is no quiz today - you should instead spend the review time actually working with Hadoop on stoat.  You might also look at
 +
this [http://www.cs.cmu.edu/~wcohen/10-605/annotated-hadoop-log.txt  annotated log of me interacting with streaming Hadoop].
  
 
=== Readings for the Class ===
 
=== Readings for the Class ===

Revision as of 10:02, 7 September 2017

This is one of the class meetings on the schedule for the course Machine Learning with Large Datasets 10-605 in Fall 2017.

Slides

Map-reduce overview:

Quiz

  • There is no quiz today - you should instead spend the review time actually working with Hadoop on stoat. You might also look at

this annotated log of me interacting with streaming Hadoop.

Readings for the Class

  • There are lots of on-line tutorials for Hadoop. The O'Reilly Book is also quite good.

Things to Remember

  • Hadoop terminology: HDFS, shards, job tracker, combiner, mapper, reducer, ...