Class meeting for 10-605 Hadoop Overview
From Cohen Courses
Jump to navigationJump to searchThis is one of the class meetings on the schedule for the course Machine Learning with Large Datasets 10-605 in Fall 2017.
Slides
Map-reduce overview:
Other:
- A log of me interacting with Hadoop (streaming Hadoop only).
Quiz
- To be posted
Readings for the Class
- There are lots of on-line tutorials for Hadoop. The O'Reilly Book is also quite good.
Things to Remember
- Hadoop terminology: HDFS, shards, job tracker, combiner, mapper, reducer, ...