Analyzing Community driven Question Answering Sites

From Cohen Courses
Revision as of 07:59, 9 October 2012 by Anikag (talk | contribs)
Jump to navigationJump to search

Team Members

Anika Gupta

Shourabh Rawat

Abstract

Question answering communities such as Yahoo! Answers and StackOverflow have emerged as popular as well as effective means of information resource on the web. One interesting analysis is to keep track of the lifetime of a question. We also plan to solve the problem of identifying sufficiently answered questions. Given a question, identifying the expertise in a domain is also an interesting question whose answer we ll try to find.

Datasets

The Stack Overflow Data used in this paper is publicly available from StackOverflow under a Creative Commons license. One can download the latest version from here.

Here are some of the statistics about the data used by the authors:

  • Users 440K (198K questioners, 71K answerers)
  • Questions 1M (69% with accepted answer)
  • Answers 2.8M (26% marked as accepted)
  • Votes 7.6M (93% positive)
  • Favorites 775K actions on 318K questions

Baseline

Techniques Used

Challenges

Relevant Literature