Stack Overflow
From Cohen Courses
Jump to navigationJump to search
Stack Overflow is a language-independent collaboratively edited question and answering website for programmers. Stack Overflow dataset is publicly available from StackOverflow under a Creative Commons license. One can download the latest version from here.
Here are some of the statistics about the data used in the paper Anderson_et_al_KDD2012
- Users 440K (198K questioners, 71K answerers)
- Questions 1M (69% with accepted answer)
- Answers 2.8M (26% marked as accepted)
- Votes 7.6M (93% positive)
- Favorites 775K actions on 318K questions