TY - JOUR
T1 - Summarization of an online medical encyclopedia
AU - Fiszman, Marcelo
AU - Rindflesch, Thomas C.
AU - Kilicoglu, Halil
N1 - Funding Information:
The first author was supported by an appointment to the National Library of Medicine Research Participation Program administered by the Oak Ridge Institute for Science and Education through an inter-agency agreement between the U.S. Department of Energy and the National Library of Medicine. The authors would like to acknowledge A.D.A.M. for allowing us to process content information from the Health Illustrated Encyclopedia.
PY - 2004
Y1 - 2004
N2 - We explore a knowledge-rich (abstraction) approach to summarization and apply it to multiple documents from an online medical encyclopedia. A semantic processor functions as the source interpreter and produces a list of predications. A transformation stage then generalizes and condenses this list, ultimately generating a conceptual condensate for a given disorder topic. We provide a preliminary evaluation of the quality of the condensates produced for a sample of four disorders. The overall precision of the disorder conceptual condensates was 87%, and the compression ratio from the base list of predications to the final condensate was 98%. The conceptual condensate could be used as input to a text generator to produce a natural language summary for a given disorder topic.
AB - We explore a knowledge-rich (abstraction) approach to summarization and apply it to multiple documents from an online medical encyclopedia. A semantic processor functions as the source interpreter and produces a list of predications. A transformation stage then generalizes and condenses this list, ultimately generating a conceptual condensate for a given disorder topic. We provide a preliminary evaluation of the quality of the condensates produced for a sample of four disorders. The overall precision of the disorder conceptual condensates was 87%, and the compression ratio from the base list of predications to the final condensate was 98%. The conceptual condensate could be used as input to a text generator to produce a natural language summary for a given disorder topic.
KW - Automatic Summarization
KW - Knowledge Representation
KW - Natural Language Processing
UR - http://www.scopus.com/inward/record.url?scp=70349454395&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349454395&partnerID=8YFLogxK
U2 - 10.3233/978-1-60750-949-3-506
DO - 10.3233/978-1-60750-949-3-506
M3 - Article
C2 - 15360864
AN - SCOPUS:70349454395
SN - 0926-9630
VL - 107
SP - 506
EP - 510
JO - Studies in health technology and informatics
JF - Studies in health technology and informatics
ER -