Shallow information extraction from medical forum data

Parikshit Sondhi, Manish Gupta, Cheng Xiang Zhai, Julia Hockenmaier

Research output: Contribution to conferencePaper

Abstract

We study a novel shallow information extraction problem that involves extracting sentences of a given set of topic categories from medical forum data. Given a corpus of medical forum documents, our goal is to extract two related types of sentences that describe a biomedical case (i.e., medical problem descriptions and medical treatment descriptions). Such an extraction task directly generates medical case descriptions that can be useful in many applications. We solve the problem using two popular machine learning methods Support VectorMachines (SVM) and Conditional Random Fields (CRF). We propose novel features to improve the accuracy of extraction. Experiment results show that we can obtain an accuracy of up to 75%.

Original languageEnglish (US)
Pages1158-1166
Number of pages9
StatePublished - Dec 1 2010
Event23rd International Conference on Computational Linguistics, Coling 2010 - Beijing, China
Duration: Aug 23 2010Aug 27 2010

Other

Other23rd International Conference on Computational Linguistics, Coling 2010
CountryChina
CityBeijing
Period8/23/108/27/10

Fingerprint

learning method
Medical problems
physician's care
Learning systems
experiment
Information Extraction
Experiments
Medical Treatment
Experiment
Machine Learning

ASJC Scopus subject areas

  • Language and Linguistics
  • Computational Theory and Mathematics
  • Linguistics and Language

Cite this

Sondhi, P., Gupta, M., Zhai, C. X., & Hockenmaier, J. (2010). Shallow information extraction from medical forum data. 1158-1166. Paper presented at 23rd International Conference on Computational Linguistics, Coling 2010, Beijing, China.

Shallow information extraction from medical forum data. / Sondhi, Parikshit; Gupta, Manish; Zhai, Cheng Xiang; Hockenmaier, Julia.

2010. 1158-1166 Paper presented at 23rd International Conference on Computational Linguistics, Coling 2010, Beijing, China.

Research output: Contribution to conferencePaper

Sondhi, P, Gupta, M, Zhai, CX & Hockenmaier, J 2010, 'Shallow information extraction from medical forum data', Paper presented at 23rd International Conference on Computational Linguistics, Coling 2010, Beijing, China, 8/23/10 - 8/27/10 pp. 1158-1166.
Sondhi P, Gupta M, Zhai CX, Hockenmaier J. Shallow information extraction from medical forum data. 2010. Paper presented at 23rd International Conference on Computational Linguistics, Coling 2010, Beijing, China.
Sondhi, Parikshit ; Gupta, Manish ; Zhai, Cheng Xiang ; Hockenmaier, Julia. / Shallow information extraction from medical forum data. Paper presented at 23rd International Conference on Computational Linguistics, Coling 2010, Beijing, China.9 p.
@conference{1752eaafd2da4af4ab9558983ef36fe9,
title = "Shallow information extraction from medical forum data",
abstract = "We study a novel shallow information extraction problem that involves extracting sentences of a given set of topic categories from medical forum data. Given a corpus of medical forum documents, our goal is to extract two related types of sentences that describe a biomedical case (i.e., medical problem descriptions and medical treatment descriptions). Such an extraction task directly generates medical case descriptions that can be useful in many applications. We solve the problem using two popular machine learning methods Support VectorMachines (SVM) and Conditional Random Fields (CRF). We propose novel features to improve the accuracy of extraction. Experiment results show that we can obtain an accuracy of up to 75{\%}.",
author = "Parikshit Sondhi and Manish Gupta and Zhai, {Cheng Xiang} and Julia Hockenmaier",
year = "2010",
month = "12",
day = "1",
language = "English (US)",
pages = "1158--1166",
note = "23rd International Conference on Computational Linguistics, Coling 2010 ; Conference date: 23-08-2010 Through 27-08-2010",

}

TY - CONF

T1 - Shallow information extraction from medical forum data

AU - Sondhi, Parikshit

AU - Gupta, Manish

AU - Zhai, Cheng Xiang

AU - Hockenmaier, Julia

PY - 2010/12/1

Y1 - 2010/12/1

N2 - We study a novel shallow information extraction problem that involves extracting sentences of a given set of topic categories from medical forum data. Given a corpus of medical forum documents, our goal is to extract two related types of sentences that describe a biomedical case (i.e., medical problem descriptions and medical treatment descriptions). Such an extraction task directly generates medical case descriptions that can be useful in many applications. We solve the problem using two popular machine learning methods Support VectorMachines (SVM) and Conditional Random Fields (CRF). We propose novel features to improve the accuracy of extraction. Experiment results show that we can obtain an accuracy of up to 75%.

AB - We study a novel shallow information extraction problem that involves extracting sentences of a given set of topic categories from medical forum data. Given a corpus of medical forum documents, our goal is to extract two related types of sentences that describe a biomedical case (i.e., medical problem descriptions and medical treatment descriptions). Such an extraction task directly generates medical case descriptions that can be useful in many applications. We solve the problem using two popular machine learning methods Support VectorMachines (SVM) and Conditional Random Fields (CRF). We propose novel features to improve the accuracy of extraction. Experiment results show that we can obtain an accuracy of up to 75%.

UR - http://www.scopus.com/inward/record.url?scp=80053268950&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80053268950&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:80053268950

SP - 1158

EP - 1166

ER -