Difference between revisions of "Analyzing Community driven Question Answering Sites"

From Cohen Courses
Jump to navigationJump to search
(Created page with '== Team Members == Anika Gupta Shourabh Rawat == Abstract == Question answering communities such as [http://answers.yahoo.com/ Yahoo! Answers] a…')
 
Line 6: Line 6:
 
== Abstract ==
 
== Abstract ==
 
Question answering communities such as [http://answers.yahoo.com/ Yahoo! Answers] and [http://stackoverflow.com/ StackOverflow] have emerged as popular as well as effective means of information resource on the web.  
 
Question answering communities such as [http://answers.yahoo.com/ Yahoo! Answers] and [http://stackoverflow.com/ StackOverflow] have emerged as popular as well as effective means of information resource on the web.  
 
+
One interesting analysis is to keep track of the lifetime of a question. We also plan to solve the problem of identifying sufficiently answered questions. Given a question, identifying the expertise in a domain is also an interesting question whose answer we
 +
ll try to find.
 
== Datasets ==
 
== Datasets ==
 
The [[UsesDataset::Stack Overflow|Stack Overflow Data ]] used in this paper is publicly available from StackOverflow under a Creative Commons license.
 
The [[UsesDataset::Stack Overflow|Stack Overflow Data ]] used in this paper is publicly available from StackOverflow under a Creative Commons license.
Line 18: Line 19:
 
* Votes 7.6M (93% positive)
 
* Votes 7.6M (93% positive)
 
* Favorites 775K actions on 318K questions
 
* Favorites 775K actions on 318K questions
 +
 +
== Baseline==
 +
 +
==Techniques Used ==
 +
 +
== Challenges==
 +
*
 +
== Relevant Literature ==

Revision as of 08:59, 9 October 2012

Team Members

Anika Gupta

Shourabh Rawat

Abstract

Question answering communities such as Yahoo! Answers and StackOverflow have emerged as popular as well as effective means of information resource on the web. One interesting analysis is to keep track of the lifetime of a question. We also plan to solve the problem of identifying sufficiently answered questions. Given a question, identifying the expertise in a domain is also an interesting question whose answer we ll try to find.

Datasets

The Stack Overflow Data used in this paper is publicly available from StackOverflow under a Creative Commons license. One can download the latest version from here.

Here are some of the statistics about the data used by the authors:

  • Users 440K (198K questioners, 71K answerers)
  • Questions 1M (69% with accepted answer)
  • Answers 2.8M (26% marked as accepted)
  • Votes 7.6M (93% positive)
  • Favorites 775K actions on 318K questions

Baseline

Techniques Used

Challenges

Relevant Literature