The Million Song dataset

From Cohen Courses
Revision as of 10:57, 28 August 2012 by Wcohen (talk | contribs) (Created page with 'This is one of the [[Category::Dataset|datasets]] discussed as a possible project dataset in Social Media Analysis 10-802 in Fall 2012. The core of the dataset is the featur…')
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

This is one of the datasets discussed as a possible project dataset in Social Media Analysis 10-802 in Fall 2012.

The core of the dataset is the feature analysis and metadata for one million songs, provided by The Echo Nest. The dataset does not include any audio, only the derived features. (But sample audio can be fetched from services). For social media analysis purposes, there are some related datasets that are quite interesting - include user-based playcount data, song-level tags and similarity, and lyrics.


  • External link: [1]

Relevant Papers