Difference between revisions of "Class meeting for 10-605 Hadoop Overview"

From Cohen Courses
Jump to navigationJump to search
 
(5 intermediate revisions by the same user not shown)
Line 6: Line 6:
  
 
* [http://www.cs.cmu.edu/~wcohen/10-605/map-reduce.pptx Map-Reduce overview - ppt]
 
* [http://www.cs.cmu.edu/~wcohen/10-605/map-reduce.pptx Map-Reduce overview - ppt]
Other:
+
* [http://www.cs.cmu.edu/~wcohen/10-605/map-reduce.pdf Map-Reduce overview - pdf]
 
 
* [http://www.cs.cmu.edu/~wcohen/10-605/annotated-hadoop-log.txt  A log of me interacting with Hadoop] (streaming Hadoop only).
 
  
 
=== Quiz ===
 
=== Quiz ===
  
* To be posted
+
* There is no quiz today - you should instead spend the review time actually working with Hadoop on stoat.  You might also look at this [http://www.cs.cmu.edu/~wcohen/10-605/annotated-hadoop-log.txt  annotated log of me interacting with streaming Hadoop].
  
 
=== Readings for the Class ===
 
=== Readings for the Class ===

Latest revision as of 10:03, 7 September 2017

This is one of the class meetings on the schedule for the course Machine Learning with Large Datasets 10-605 in Fall 2017.

Slides

Map-reduce overview:

Quiz

Readings for the Class

  • There are lots of on-line tutorials for Hadoop. The O'Reilly Book is also quite good.

Things to Remember

  • Hadoop terminology: HDFS, shards, job tracker, combiner, mapper, reducer, ...