Difference between revisions of "Machine Learning 10-601 in Fall 2014"

From Cohen Courses
Jump to navigationJump to search
m
 
(48 intermediate revisions by 10 users not shown)
Line 4: Line 4:
 
* Mid-term: there will be a block midterm, with one test shared between the two sections, scheduled for '''7-9pm October 29.'''
 
* Mid-term: there will be a block midterm, with one test shared between the two sections, scheduled for '''7-9pm October 29.'''
 
* '''Start of classes''': the first lecture for 10-601B (William's class, Tu/Th) will be '''Tuesday 9/2.'''  The first lecture for 10-601A (Ziv's class, M/W) will be '''Wed 8/27.'''
 
* '''Start of classes''': the first lecture for 10-601B (William's class, Tu/Th) will be '''Tuesday 9/2.'''  The first lecture for 10-601A (Ziv's class, M/W) will be '''Wed 8/27.'''
 +
* 9/10: Homework 1 has been announced and is available on the syllabus page.
 +
* 9/19: Homework 2 has been announced and is available on the syllabus page.
 +
* 9/26: Homework 3 has been announced and is available on the syllabus page.
  
 
== Important People and Places ==
 
== Important People and Places ==
Line 11: Line 14:
 
** Sandy Winkler, sandyw@cs.cmu.edu is the course secretary.
 
** Sandy Winkler, sandyw@cs.cmu.edu is the course secretary.
 
** [[Syllabus for Machine Learning 10-601 in Fall 2014]] with lectures slides and homeworks
 
** [[Syllabus for Machine Learning 10-601 in Fall 2014]] with lectures slides and homeworks
** We'll be using  BlackBoard and Autolab for most assignments, and Piazza for general Q/A.
+
** We'll be using  BlackBoard and Autolab for most assignments, and Piazza for general Q/A. The lectures are recorded by MediaTech.
 +
*** [https://autolab.cs.cmu.edu/10601-f14 Autolab] - submit programs
 +
*** [https://piazza.com/cmu/fall2014/10601/home Piazza] - announcements and discussion
 
*** [https://blackboard.andrew.cmu.edu/webapps/portal/frameset.jsp?url=%2Fwebapps%2Fblackboard%2Fexecute%2Flauncher%3Ftype%3DCourse%26id%3D_4739854_1%26url%3D Blackboard page] - grades and non-programming assignments
 
*** [https://blackboard.andrew.cmu.edu/webapps/portal/frameset.jsp?url=%2Fwebapps%2Fblackboard%2Fexecute%2Flauncher%3Ftype%3DCourse%26id%3D_4739854_1%26url%3D Blackboard page] - grades and non-programming assignments
*** [https://autolab.cs.cmu.edu/10601-f14 Autolab] - submit programs
+
*** [https://mediatech-stream.andrew.cmu.edu/Mediasite/Catalog/Full/0f99d98346a6442893503bd19928bebe21 Video Lectures for 10-601A] - MediaTech
*** [https://piazza.com/class/hyu9y7rrcx77o7 Piazza] - announcements and discussion
+
*** [https://mediatech-stream.andrew.cmu.edu/Mediasite/Catalog/Full/4e86c44694a14b9fbe1ea7653f553ac621 Video Lectures for 10-601B] - MediaTech
 
* For Section 10-601A
 
* For Section 10-601A
 
** Instructor: [http://www.cs.cmu.edu/~zivbj/ Ziv Bar-Joseph],  Lane Center and Machine Learning Dept
 
** Instructor: [http://www.cs.cmu.edu/~zivbj/ Ziv Bar-Joseph],  Lane Center and Machine Learning Dept
 
** When/where: Mon/Wed 1:30-2:50 WEH 7500
 
** When/where: Mon/Wed 1:30-2:50 WEH 7500
 
** TAs:  
 
** TAs:  
*** Abhinav Maurya ahmaurya@gmail.com - PhD
+
*** Abhinav Maurya ahmaurya+10601@gmail.com - PhD
 
*** Ying Yang yingyan1@andrew.cmu.edu - PhD
 
*** Ying Yang yingyan1@andrew.cmu.edu - PhD
*** Siddhartha Jain sj1@cs.cmu.edu - PhD
 
 
*** Qihui (Anna) Li qihuil@andrew.cmu.edu - MS
 
*** Qihui (Anna) Li qihuil@andrew.cmu.edu - MS
 
*** Jingwei Shen js1@andrew.cmu.edu - MS
 
*** Jingwei Shen js1@andrew.cmu.edu - MS
Line 30: Line 34:
 
** When/where: Tues/Thurs 1:30-2:50 WEH 7500
 
** When/where: Tues/Thurs 1:30-2:50 WEH 7500
 
** TAs:  
 
** TAs:  
 +
*** Siddhartha Jain sjX [at] cs.cmu.edu where X=1 - PhD
 
*** Debjani Biswas dbiswas@cmu.edu  - MS
 
*** Debjani Biswas dbiswas@cmu.edu  - MS
 
*** Kuo Liu kuo.liu.pku@gmail.com - MS
 
*** Kuo Liu kuo.liu.pku@gmail.com - MS
Line 35: Line 40:
 
***  Jin Sun jins@andrew.cmu.edu - MS
 
***  Jin Sun jins@andrew.cmu.edu - MS
 
*** Xu Zhuo zhuox@cmu.edu - MS
 
*** Xu Zhuo zhuox@cmu.edu - MS
* Recitations: note that assignments in Sep/Oct are due on Thursdays, and assignments in Nov/Dec are due on Mondays.
+
 
** Monday 8pm: PH 125C -- ''ending Nov 3''
+
== Recitations ==
** Tuesday 6pm: HH B131 (may be moved Sept 23)
+
 
** Tuesday 8pm: DH 1112
+
There are four recitations held by two TAs each week. Students can go to any section they like. Attendance is optional but highly recommended. In recitations, TAs will usually review the important contents in the lectures, cover (extra) materials that students need to know for homework, answer students' questions, and help students to prepare for the exam and project.
** Wed 7pm: GHC 4307 (may be moved Sept 17, Sept 24, Oct 1)
+
 
** Thursday 8pm: DH112 -- ''starting Nov 6''
+
Time slots and locations can be found below. Content (slides and code) can be found on syllabus page (scroll down to the bottom). Please be aware that the recitation schedule changes in some of the weeks. Also, note that assignments in Sep/Oct are due on Thursdays, and assignments in Nov/Dec are due on Mondays.
* On-line lectures: TBA
+
 
* Office hours:
+
* Monday 8pm: PH 125C -- ''ending Nov 3''
** William: 11am Monday, GHC 8217
+
* Tuesday 6pm: HH B131
** Ziv: Monday after class (3pm), GHC 8006
+
* Tuesday 8pm: DH112
** Kuo Liu: 8:30 am Monday, TBD
+
* Wed 7pm: GHC 4307 (in HH B103 Sept 17, Sept 24, Oct 1)
** Jingwei Shen: 5:00 pm Monday, TBD
+
* Thursday 8pm: DH112 -- ''starting Nov 6''
** Abhinav Maurya: 5:30 pm Tuesday, Hamburg Hall 3030
+
 
** Ying Yang: 3:00 pm Friday, BH 434
+
== Office hours ==
** Sid Jain: 3:00 pm Sunday, GHC 7509
+
 
 +
{| border="1"
 +
!  Instructor
 +
!  Day
 +
!  Time
 +
!  Location
 +
|-
 +
| William || Monday || 11am || GHC 8217
 +
|-
 +
| Ziv || Monday || 3pm (after class) || GHC 8006
 +
|-
 +
| Kuo Liu || Monday || 8:30am || GHC 6418
 +
|-
 +
| Qihui Li || Tuesday || 11am || GHC 7404
 +
|-
 +
| Jingwei Shen || Tuesday ||  5pm-6pm || GHC 5th floor Citadel Public Area
 +
|-
 +
| Sid Jain || Tuesday || 5 pm || GHC 6505
 +
|-
 +
| Yipei Wang || Friday || 3pm || GHC 6405
 +
|-
 +
| Ying Yang || Friday || 4pm-5pm || Baker Hall 434, (follow the sign near Suite 332)
 +
|-
 +
| Abhinav Maurya || Saturday || 10:30am - 11:30am (email by 10am if you plan to come) || Hamburg Hall - 3rd Floor Faculty Lounge
 +
|-
 +
| Daniel Ribeiro SIlva || Tuesdays and Thursdays || 3pm (after Cohen's lectures) || GHC 5411
 +
|-
 +
|}
  
 
== Description ==
 
== Description ==
Line 80: Line 112:
  
 
Formal prerequisites:
 
Formal prerequisites:
 +
 
* Prerequisites are 15-122, Principles of Imperative Computation AND 21-127: Concepts of Mathematics.
 
* Prerequisites are 15-122, Principles of Imperative Computation AND 21-127: Concepts of Mathematics.
 
* Additionally, a probability course is a co-requisite: 36-217: Probability Theory and Random Processes OR 36-225: Introduction to Probability and Statistics I
 
* Additionally, a probability course is a co-requisite: 36-217: Probability Theory and Random Processes OR 36-225: Introduction to Probability and Statistics I
Line 85: Line 118:
  
 
Self-assessment for students:
 
Self-assessment for students:
 +
 
* Students, especially graduate students, come to CMU with a variety of different backgrounds, so formal course prereqs are hard to establish.  There is a short  [http://www.cs.cmu.edu/~wcohen/10-601/self-assessment/Intro_ML_Self_Evaluation.pdf self-assessment test] to see if you have the necessary background for 10-601.  We recommend that all students take this before enrolling in 10-601 to see if they have the necessary background knowledge already, or if they need to review and/or take additional courses.
 
* Students, especially graduate students, come to CMU with a variety of different backgrounds, so formal course prereqs are hard to establish.  There is a short  [http://www.cs.cmu.edu/~wcohen/10-601/self-assessment/Intro_ML_Self_Evaluation.pdf self-assessment test] to see if you have the necessary background for 10-601.  We recommend that all students take this before enrolling in 10-601 to see if they have the necessary background knowledge already, or if they need to review and/or take additional courses.
 +
 +
Refresher material:
 +
 +
A few resources that can help you review the math required to do well in a machine learning course:
 +
 +
* Linear Algebra: [http://cs229.stanford.edu/section/cs229-linalg.pdf Review], [http://www.cs.nyu.edu/~roweis/notes/matrixid.pdf Cheatsheet]
 +
* Probability: [http://cs229.stanford.edu/section/cs229-prob.pdf Review], [https://bayesrule.files.wordpress.com/2014/07/probability_cheatsheet_140718.pdf Cheatsheet]
 +
 +
Some other reviews you might be interested in:
 +
* Zico Kolter, a prof in CSD, has put up a set of [http://www.cs.cmu.edu/~zkolter/course/linalg/index.html  video lectures that review linear algebra].
 +
* Very recently, Aaditya Ramdas, a grad student in MLD, has put up some [http://www.cs.cmu.edu/~aramdas/videos.html video reviews] of multivariate calculus and multivariate probabilities and stats.
 +
 +
To assess whether you need to watch these, you should do the self-assessment test, which is linked to on the wiki.
  
 
== Grading Policy ==
 
== Grading Policy ==
  
* Semi-final exam: 20%
+
* Semi-final exam: 30%
** Instead of a final exam, we have an exam in class
+
** Instead of a final exam, we have an exam in class, the evening of 10/29.
* Weekly homeworks (out Wed, due Wed): 60%
+
* Weekly homeworks (out Wed, due Wed): 50%
 
** Late assignment policy: We will grant up to 50% credit if an assignment is less than 48 hrs late.  Also, you can drop your lowest assignment grade entirely.
 
** Late assignment policy: We will grant up to 50% credit if an assignment is less than 48 hrs late.  Also, you can drop your lowest assignment grade entirely.
 
* Project: 20% (see below)
 
* Project: 20% (see below)
Line 97: Line 144:
 
== Projects ==
 
== Projects ==
  
More details will be posted later
+
* [http://www.cs.cmu.edu/~wcohen/10-601/project-2014/project-draft.pdf Instructions (draft)] - also listed on the syllabus for the week ending 11/14.
  
 
== Policy on Collaboration among Students  ==
 
== Policy on Collaboration among Students  ==

Latest revision as of 20:45, 18 November 2014

Announcements

  • Important announcements will be made here as well as on Piazza.
  • Mid-term: there will be a block midterm, with one test shared between the two sections, scheduled for 7-9pm October 29.
  • Start of classes: the first lecture for 10-601B (William's class, Tu/Th) will be Tuesday 9/2. The first lecture for 10-601A (Ziv's class, M/W) will be Wed 8/27.
  • 9/10: Homework 1 has been announced and is available on the syllabus page.
  • 9/19: Homework 2 has been announced and is available on the syllabus page.
  • 9/26: Homework 3 has been announced and is available on the syllabus page.

Important People and Places

There are two sections for the course, which will be closely coordinated.

  • For both sections:
  • For Section 10-601A
    • Instructor: Ziv Bar-Joseph, Lane Center and Machine Learning Dept
    • When/where: Mon/Wed 1:30-2:50 WEH 7500
    • TAs:
      • Abhinav Maurya ahmaurya+10601@gmail.com - PhD
      • Ying Yang yingyan1@andrew.cmu.edu - PhD
      • Qihui (Anna) Li qihuil@andrew.cmu.edu - MS
      • Jingwei Shen js1@andrew.cmu.edu - MS
      • Yipei Wang yipeiw@andrew.cmu.edu - MS
      • Henry (Harry) Gifford hgifford@andrew.cmu.edu - senior
  • For Section 10-601B
    • Instructor: William Cohen, Machine Learning Dept and LTI
    • When/where: Tues/Thurs 1:30-2:50 WEH 7500
    • TAs:
      • Siddhartha Jain sjX [at] cs.cmu.edu where X=1 - PhD
      • Debjani Biswas dbiswas@cmu.edu - MS
      • Kuo Liu kuo.liu.pku@gmail.com - MS
      • Daniel Ribeiro Silva drsilva@andrew.cmu.edu - MS
      • Jin Sun jins@andrew.cmu.edu - MS
      • Xu Zhuo zhuox@cmu.edu - MS

Recitations

There are four recitations held by two TAs each week. Students can go to any section they like. Attendance is optional but highly recommended. In recitations, TAs will usually review the important contents in the lectures, cover (extra) materials that students need to know for homework, answer students' questions, and help students to prepare for the exam and project.

Time slots and locations can be found below. Content (slides and code) can be found on syllabus page (scroll down to the bottom). Please be aware that the recitation schedule changes in some of the weeks. Also, note that assignments in Sep/Oct are due on Thursdays, and assignments in Nov/Dec are due on Mondays.

  • Monday 8pm: PH 125C -- ending Nov 3
  • Tuesday 6pm: HH B131
  • Tuesday 8pm: DH112
  • Wed 7pm: GHC 4307 (in HH B103 Sept 17, Sept 24, Oct 1)
  • Thursday 8pm: DH112 -- starting Nov 6

Office hours

Instructor Day Time Location
William Monday 11am GHC 8217
Ziv Monday 3pm (after class) GHC 8006
Kuo Liu Monday 8:30am GHC 6418
Qihui Li Tuesday 11am GHC 7404
Jingwei Shen Tuesday 5pm-6pm GHC 5th floor Citadel Public Area
Sid Jain Tuesday 5 pm GHC 6505
Yipei Wang Friday 3pm GHC 6405
Ying Yang Friday 4pm-5pm Baker Hall 434, (follow the sign near Suite 332)
Abhinav Maurya Saturday 10:30am - 11:30am (email by 10am if you plan to come) Hamburg Hall - 3rd Floor Faculty Lounge
Daniel Ribeiro SIlva Tuesdays and Thursdays 3pm (after Cohen's lectures) GHC 5411

Description

Machine Learning (ML) asks "how can we design programs that automatically improve their performance through experience?" This includes learning to perform many types of tasks based on many types of experience, e.g. spotting high-risk medical patients, recognizing speech, classifying text documents, detecting credit card fraud, or driving autonomous robots.

Topics covered in 10-601 include concept learning, version spaces, decision trees, neural networks, computational learning theory, active learning, estimation & the bias-variance tradeoff, hypothesis testing, Bayesian learning, Naïve Bayes classifier, Bayes Nets & Graphical Models, the EM algorithm, Hidden Markov Models, K-Nearest-Neighbors and nonparametric learning, reinforcement learning, bagging and boosting, neural networks, and other topics.

10-601 focuses on the mathematical, statistical and computational foundations of the field. It emphasizes the role of assumptions in machine learning. As we introduce different ML techniques, we work out together what assumptions are implicit in them. Grading is based on written assignments, programming assignments, and a final exam.

10-601 focuses on understanding what makes machine learning work. If your interest is primarily in learning the process of applying ML effectively, and in the practical side of ML for applications, you should consider Machine Learning in Practice (11-344/05-834).

10-601 is open to all but is recommended for CS Seniors & Juniors, Quantitative Masters students, and non-SCS PhD students.

Syllabus and Text

Syllabus for Machine Learning 10-601, including lecture slides and HWs

Previous syllabi, for the historically-minded:

The text is Tom Mitchell's textbook, Machine Learning. It is recommended but not required.

Prerequisites

Formal prerequisites:

  • Prerequisites are 15-122, Principles of Imperative Computation AND 21-127: Concepts of Mathematics.
  • Additionally, a probability course is a co-requisite: 36-217: Probability Theory and Random Processes OR 36-225: Introduction to Probability and Statistics I
  • A minimum grade of 'C' is required in all these courses.

Self-assessment for students:

  • Students, especially graduate students, come to CMU with a variety of different backgrounds, so formal course prereqs are hard to establish. There is a short self-assessment test to see if you have the necessary background for 10-601. We recommend that all students take this before enrolling in 10-601 to see if they have the necessary background knowledge already, or if they need to review and/or take additional courses.

Refresher material:

A few resources that can help you review the math required to do well in a machine learning course:

Some other reviews you might be interested in:

To assess whether you need to watch these, you should do the self-assessment test, which is linked to on the wiki.

Grading Policy

  • Semi-final exam: 30%
    • Instead of a final exam, we have an exam in class, the evening of 10/29.
  • Weekly homeworks (out Wed, due Wed): 50%
    • Late assignment policy: We will grant up to 50% credit if an assignment is less than 48 hrs late. Also, you can drop your lowest assignment grade entirely.
  • Project: 20% (see below)

Projects

Policy on Collaboration among Students

These policies are the same as were used in Dr. Rosenfeld's previous version of 2013.

The purpose of student collaboration is to facilitate learning, not to circumvent it. Studying the material in groups is strongly encouraged. It is also allowed to seek help from other students in understanding the material needed to solve a particular homework problem, provided no written notes are shared, or are taken at that time, and provided learning is facilitated, not circumvented. The actual solution must be done by each student alone, and the student should be ready to reproduce their solution upon request.

The presence or absence of any form of help or collaboration, whether given or received, must be explicitly stated and disclosed in full by all involved, on the first page of their assignment. Specifically, each assignment solution must start by answering the following questions:

(1) Did you receive any help whatsoever from anyone in solving this assignment? Yes / No.
If you answered 'yes', give full details: _______________ (e.g. "Jane explained to me what is asked in Question 3.4")
(2) Did you give any help whatsoever to anyone in solving this assignment? Yes / No.
If you answered 'yes', give full details: _______________ (e.g. "I pointed Joe to section 2.3 to help him with Question 2".

Collaboration without full disclosure will be handled severely, in compliance with CMU's Policy on Cheating and Plagiarism.

As a related point, some of the homework assignments used in this class may have been used in prior versions of this class, or in classes at other institutions. Avoiding the use of heavily tested assignments will detract from the main purpose of these assignments, which is to reinforce the material and stimulate thinking. Because some of these assignments may have been used before, solutions to them may be (or may have been) available online, or from other people. It is explicitly forbidden to use any such sources, or to consult people who have solved these problems before. You must solve the homework assignments completely on your own. I will mostly rely on your wisdom and honor to follow this rule, but if a violation is detected it will be dealt with harshly. Collaboration with other students who are currently taking the class is allowed, but only under the conditions stated below.