Class meeting for 10-405 Hadoop Overview
From Cohen Courses
Jump to navigationJump to searchThis is one of the class meetings on the schedule for the course Machine Learning with Large Datasets 10-405 in Spring 2018.
Slides
Map-reduce overview:
Quiz
- Today's quiz
- You might also look at this annotated log of me interacting with streaming Hadoop.
Readings for the Class
- There are lots of on-line tutorials for Hadoop. The O'Reilly Book is also quite good.
Things to Remember
- Hadoop terminology: HDFS, shards, job tracker, combiner, mapper, reducer, ...