Stack Overflow

From Cohen Courses
Jump to navigationJump to search

Stack Overflow is a language-independent collaboratively edited question and answering website for programmers. Stack Overflow dataset is publicly available from StackOverflow under a Creative Commons license. One can download the latest version from here.

Here are some of the statistics about the data used in the paper Anderson_et_al_KDD2012

  • Users 440K (198K questioners, 71K answerers)
  • Questions 1M (69% with accepted answer)
  • Answers 2.8M (26% marked as accepted)
  • Votes 7.6M (93% positive)
  • Favorites 775K actions on 318K questions