Difference between revisions of "Class meeting for 10-605 in Fall 2016 Overview"

Revision as of 14:27, 1 August 2016

Why use big data?
- Simple learning methods with large data sets can outperform complex learners with smaller datasets
- The ordering of learning methods, best-to-worst, can be different for small datasets than from large datasets
- The best way to improve performance for a learning system is often to collect more data
- Large datasets often imply large classifiers

Asymptotic analysis
- It measures number of operations as function of problem size
- Different operations (eg disk seeking, scanning, memory access) can have very very different costs
- Disk access is cheapest when you scan sequentially

Revision as of 14:18, 1 August 2016 (view source) Wcohen (talk \| contribs) ← Older edit		Revision as of 14:27, 1 August 2016 (view source) Wcohen (talk \| contribs) Newer edit →
Line 1:		Line 1:
−	This is one of the class meetings on the [[Syllabus for Machine Learning with Large Datasets 10-605 in ~~Spring 2015~~\|schedule]] for the course [[Machine Learning with Large Datasets 10-605 in Fall 2016]].	+	This is one of the class meetings on the [[Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2016\|schedule]] for the course [[Machine Learning with Large Datasets 10-605 in Fall 2016]].

	=== Slides ===		=== Slides ===