Difference between revisions of "Class meeting for 10-605 in Fall 2016 Overview"

Revision as of 12:06, 30 August 2016

Why use big data?
- Simple learning methods with large data sets can outperform complex learners with smaller datasets
- The ordering of learning methods, best-to-worst, can be different for small datasets than from large datasets
- The best way to improve performance for a learning system is often to collect more data
- Large datasets often imply large classifiers

Asymptotic analysis
- It measures number of operations as function of problem size
- Different operations (eg disk seeking, scanning, memory access) can have very very different costs
- Disk access is cheapest when you scan sequentially

@@ Line 9: / Line 9: @@
 * Before the next class: watch [https://mediatech-stream.andrew.cmu.edu/Mediasite/Catalog/Full/4e86c44694a14b9fbe1ea7653f553ac621 My overview lecture from 10-601 ] (lecture 1, and a little of lecture 2) if you need it.
+* Today's quiz: [https://qna-app.appspot.com/edit_new.html#/pages/view/aglzfnFuYS1hcHByGQsSDFF1ZXN0aW9uTGlzdBiAgIDQqdaqCQw]
 === Readings for the Class ===