Difference between revisions of "Social Media Analysis 10-802 in Fall 2012"

From Cohen Courses
Jump to navigationJump to search
Line 112: Line 112:
 
=== Assignment 3: Due 10:30 AM Tues, Oct 2 ===
 
=== Assignment 3: Due 10:30 AM Tues, Oct 2 ===
  
Send in your the next two summaries.
+
Send in the next two summaries.
  
 
== Grading ==
 
== Grading ==

Revision as of 17:31, 25 September 2012

Announcements

  • The first class is Tuesday 9/11.
  • William's office hours will be 1-2pm Friday (except when otherwise noted).
    • William will be out of town Friday 9/21.

Instructor and Venue

  • Instructor: William Cohen, Machine Learning Dept and LTI
  • Course secretary: Sharon Cavlovich, sharonw+@cs.cmu.edu, 412-268-5196
  • When/where: T/R 10:30-11:50, 4303 GHC, starting after the IC has finished. The first class will be Tuesday 9/11.
    • Note: the room has been moved!
  • Course Number: MLD 10-802, cross-listed as LTI 11-772
  • Prerequisites: a machine learning course (e.g., 10-701 or 10-601) or consent of the instructor.
  • TAs: Bhavana Dalvi and Aasish Pappu
  • Syllabus: To best posted, but will approximately follow the syllabus I used last time
  • Office hours:
    • William: 1-2pm Fri
    • Aasish: 2-3pm Thu [GHC 6223]
    • Bhavana: 12-1pm Tue [GHC 5509]
  • Course mailing list for announcements: 10802-announce@cs.cmu.edu

Clarification/announcement: This will be a regular 12-credit course (despite the some listings of it as a 6-credit course).

Description

The most actively growing part of the web is "social media" - e.g.. wikis, blogs, bboards, and collaboratively-developed community sites like Flikr and YouTube. This course will review selected papers from the recent research literature that address the problem of analyzing and understanding social media. Topics that will be covered include:

  • Text analysis techniques for sentiment analysis, analysis of figurative language, authorship attribution, and inference of demographic information about authors (e.g., age or sex).
  • Community analysis techniques for detecting communities, predicting authority, assessing influence (e.g. in viral marketing), or detecting spam.
  • Visualization techniques for understanding the interactions within and between communities.
  • Learning techniques for modeling and predicting trends in social media, or predicting other properties of media (e.g., user-provided content tags.)

Students should have a machine learning course (e.g., 10-601 or similar) or consent of the instructor. Readings will be based on research papers. Grades will be based on class participation, paper presentations, and a project. More specifically, students will be expected to:

  • Prepare summaries of the papers discussed in class. Summaries will be posted on this wiki.
  • Present and summarize one or more "optional" papers from the syllabus (or some other mutually agreeable paper) to the class.
  • Do a course project in a group of 2-3 people. The end result of the project will be a written report, with format and length appropriate for a conference publication.

Sample Projects

Here are some projects from previous years, so you can get some ideas of the scope.

.

Syllabus and Readings

Older syllabi:

The first half of the course, roughly, will be presentations of background material. Luckily there are some very good recent surveys on this.

The second half of the course will be presentations of recent research papers.

Other Resources

Assignments

(See my suggestion in discussion page for what should go here. Gmontane 23:18, 24 September 2012 (UTC))

Assignment 1: Due 10:30 AM Tuesday, Sep 25

Send to me (William Cohen), Aasish, and Bhavana an email listing 3 papers you plan to summarize with title, authors, link to on-line version

Any not-yet-summarized papers listing as "papers to review" on this years or last year’s syllabus (I guess last year they were "papers to present") are pre-approved - otherwise, it's fine to pick something new, and we will give you feedback if a paper is a problem. Also, don't propose papers I’ve presented in class, or papers that already have entries in the wiki.

Assignment 2: Due 10:30 AM Thus, Sep 27

Send in your first summary - by sending a link to the new wikipage to the instructor and TAs. Here are some example summaries:

Some examples without the new 'study guide' section are here:

Note that you are required to link the summary back into concepts on the wiki via the links AddressesProblem, Category, RelatedPaper, UsesMethod, and UsesDataset. It's ok to link to a problem/paper/method/dataset that's not yet on the wiki, BUT please search first to check if its there.

Assignment 3: Due 10:30 AM Tues, Oct 2

Send in the next two summaries.

Grading

Grades are based on:

  • The paper presentation
  • The project (writeup and presentation).
  • Class participation.

I use some discretion in assigning grades but my guidelines for grading are announced in the overview talk.