TY - GEN
T1 - Learning from negative examples in set-expansion
AU - Jindal, Prateek
AU - Roth, Dan
PY - 2011
Y1 - 2011
N2 - This paper addresses the task of set-expansion on free text. Set-expansion has been viewed as a problem of generating an extensive list of instances of a concept of interest, given a few examples of the concept as input. Our key contribution is that we show that the concept definition can be significantly improved by specifying some negative examples in the input, along with the positive examples. The state-of-the art centroid-based approach to set-expansion doesn't readily admit the negative examples. We develop an inference-based approach to set-expansion which naturally allows for negative examples and show that it performs significantly better than a strong baseline.
AB - This paper addresses the task of set-expansion on free text. Set-expansion has been viewed as a problem of generating an extensive list of instances of a concept of interest, given a few examples of the concept as input. Our key contribution is that we show that the concept definition can be significantly improved by specifying some negative examples in the input, along with the positive examples. The state-of-the art centroid-based approach to set-expansion doesn't readily admit the negative examples. We develop an inference-based approach to set-expansion which naturally allows for negative examples and show that it performs significantly better than a strong baseline.
UR - http://www.scopus.com/inward/record.url?scp=84857174697&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84857174697&partnerID=8YFLogxK
U2 - 10.1109/ICDM.2011.86
DO - 10.1109/ICDM.2011.86
M3 - Conference contribution
AN - SCOPUS:84857174697
SN - 9780769544083
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 1110
EP - 1115
BT - Proceedings - 11th IEEE International Conference on Data Mining, ICDM 2011
T2 - 11th IEEE International Conference on Data Mining, ICDM 2011
Y2 - 11 December 2011 through 14 December 2011
ER -