Abstract
Community Question Answering (CQA) sites, such as Stack Overflow and Yahoo! Answers, have become very popular in recent years. These sites contain rich crowdsourcing knowledge contributed by the site users in the form of questions and answers, and these questions and answers can satisfy the information needs of more users. In this article, we aim at predicting the voting scores of questions/answers shortly after they are posted in the CQA sites. To accomplish this task, we identify three key aspects that matter with the voting of a post, i.e., the non-linear relationships between features and output, the question and answer coupling, and the dynamic fashion of data arrivals. A family of algorithms are proposed to model the above three key aspects. Some approximations and extensions are also proposed to scale up the computation. We analyze the proposed algorithms in terms of optimality, correctness, and complexity. Extensive experimental evaluations conducted on two real data sets demonstrate the effectiveness and efficiency of our algorithms.
Original language | English (US) |
---|---|
Article number | 7906587 |
Pages (from-to) | 1723-1736 |
Number of pages | 14 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 29 |
Issue number | 8 |
DOIs | |
State | Published - Aug 2017 |
Externally published | Yes |
Keywords
- Question answering
- coupling
- dynamics
- non-linearity
- voting prediction
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications
- Computational Theory and Mathematics