Analyzing Community driven Question Answering Sites
From Cohen Courses
Jump to navigationJump to searchContents
Team Members
Abstract
Question answering communities such as Yahoo! Answers and StackOverflow have emerged as popular as well as effective means of information resource on the web. One interesting analysis is to keep track of the lifetime of a question. We also plan to solve the problem of identifying sufficiently answered questions. Given a question, identifying the expertise in a domain is also an interesting question whose answer we ll try to find.
Datasets
The Stack Overflow Data used in this paper is publicly available from StackOverflow under a Creative Commons license. One can download the latest version from here.
Here are some of the statistics about the data used by the authors:
- Users 440K (198K questioners, 71K answerers)
- Questions 1M (69% with accepted answer)
- Answers 2.8M (26% marked as accepted)
- Votes 7.6M (93% positive)
- Favorites 775K actions on 318K questions