TY - GEN
T1 - Resolving healthcare forum posts via similar thread retrieval
AU - Cho, Jason H.D.
AU - Sondhi, Parikshit
AU - Zhai, Chengxiang
AU - Schatz, Bruce
N1 - Publisher Copyright:
Copyright © 2014 ACM.
PY - 2014/9/20
Y1 - 2014/9/20
N2 - Web communities such as healthcare web forums serve as popular platforms for users to get their complex medical queries resolved. A typical forum thread contains a query in its first post, and a discussion around it in subsequent posts. However many users do not receive satisfactory re- sponses from other members in the community, leaving them dissatisfied. We propose to help these users by exploiting an existing collection of discussion threads. Often many users suffer from the same medical condition and start multiple discussion threads on very similar queries. In this paper we develop and evaluate a plethora of special- ized search methods that treat an entire unresolved forum post as a query, and retrieve forum threads discussing simi- lar problems to help resolve it. The task is more challenging than a traditional document retrieval problem, since forum posts can contain a lot of irrelevant background information. The discussion threads to be retrieved are also quite different from traditional unstructured text documents. We evaluate our results on a dataset comprising over 350K discussion threads and show that our proposed methods outperform state of the art retrieval methods for the task. In particular, method based on non-uniform weighting of thread posts and semantic analysis of the query text perform quite well.
AB - Web communities such as healthcare web forums serve as popular platforms for users to get their complex medical queries resolved. A typical forum thread contains a query in its first post, and a discussion around it in subsequent posts. However many users do not receive satisfactory re- sponses from other members in the community, leaving them dissatisfied. We propose to help these users by exploiting an existing collection of discussion threads. Often many users suffer from the same medical condition and start multiple discussion threads on very similar queries. In this paper we develop and evaluate a plethora of special- ized search methods that treat an entire unresolved forum post as a query, and retrieve forum threads discussing simi- lar problems to help resolve it. The task is more challenging than a traditional document retrieval problem, since forum posts can contain a lot of irrelevant background information. The discussion threads to be retrieved are also quite different from traditional unstructured text documents. We evaluate our results on a dataset comprising over 350K discussion threads and show that our proposed methods outperform state of the art retrieval methods for the task. In particular, method based on non-uniform weighting of thread posts and semantic analysis of the query text perform quite well.
KW - Forum thread retrieval
KW - Medical case retrieval
KW - Recommender system
KW - Shallow information extraction
KW - Web forums
UR - http://www.scopus.com/inward/record.url?scp=84920716114&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84920716114&partnerID=8YFLogxK
U2 - 10.1145/2649387.2649399
DO - 10.1145/2649387.2649399
M3 - Conference contribution
AN - SCOPUS:84920716114
T3 - ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
SP - 33
EP - 42
BT - ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
PB - Association for Computing Machinery
T2 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM BCB 2014
Y2 - 20 September 2014 through 23 September 2014
ER -