TY - GEN
T1 - Facts or friends? distinguishing informational and conversational questions in social Q and A sites
AU - Harper, Max
AU - Moy, Daniel
AU - Konstan, Joseph A
PY - 2009
Y1 - 2009
N2 - Tens of thousands of questions are asked and answered every day on social question and answer (Q&A) Web sites such as Yahoo Answers. While these sites generate an enormous volume of searchable data, the problem of determining which questions and answers are archival quality has grown. One major component of this problem is the prevalence of conversational questions, identified both by Q&A sites and academic literature as questions that are intended simply to start discussion. For example, a conversational question such as ikdo you believe in evolution?" might successfully engage users in discussion, but probably will not yield a useful web page for users searching for information about evolution. Using data from three popular Q&A sites, we confirm that humans can reliably distinguish between these conversational questions and other informational questions, and present evidence that conversational questions typically have much lower potential archival value than informational questions. Further, we explore the use of machine learning techniques to automatically classify questions as conversational or informational, learning in the process about categorical, linguistic, and social differences between different question types. Our algorithms approach human performance, attaining 89.7% classification accuracy in our experiments.
AB - Tens of thousands of questions are asked and answered every day on social question and answer (Q&A) Web sites such as Yahoo Answers. While these sites generate an enormous volume of searchable data, the problem of determining which questions and answers are archival quality has grown. One major component of this problem is the prevalence of conversational questions, identified both by Q&A sites and academic literature as questions that are intended simply to start discussion. For example, a conversational question such as ikdo you believe in evolution?" might successfully engage users in discussion, but probably will not yield a useful web page for users searching for information about evolution. Using data from three popular Q&A sites, we confirm that humans can reliably distinguish between these conversational questions and other informational questions, and present evidence that conversational questions typically have much lower potential archival value than informational questions. Further, we explore the use of machine learning techniques to automatically classify questions as conversational or informational, learning in the process about categorical, linguistic, and social differences between different question types. Our algorithms approach human performance, attaining 89.7% classification accuracy in our experiments.
KW - Machine learning
KW - Online community
KW - Q and A
UR - http://www.scopus.com/inward/record.url?scp=84892459016&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84892459016&partnerID=8YFLogxK
U2 - 10.1145/1518701.1518819
DO - 10.1145/1518701.1518819
M3 - Conference contribution
AN - SCOPUS:84892459016
SN - 9781605582474
T3 - Conference on Human Factors in Computing Systems - Proceedings
SP - 759
EP - 768
BT - CHI 2009
T2 - 27th International Conference Extended Abstracts on Human Factors in Computing Systems, CHI 2009
Y2 - 4 April 2009 through 9 April 2009
ER -