Reading to learn: Constructing features from semantic abstracts

Jacob Eisenstein, James Clarke, Dan Goldwasser, Dan Roth

Research output: Contribution to conferencePaper

Abstract

Machine learning offers a range of tools for training systems from data, but these methods are only as good as the underlying representation. This paper proposes to acquire representations for machine learning by reading text written to accommodate human learning. We propose a novel form of semantic analysis called reading to learn, where the goal is to obtain a high-level semantic abstract of multiple documents in a representation that facilitates learning. We obtain this abstract through a generative model that requires no labeled data, instead leveraging repetition across multiple documents. The semantic abstract is converted into a transformed feature space for learning, resulting in improved generalization on a relational learning task.

Original languageEnglish (US)
Pages958-967
Number of pages10
StatePublished - Dec 1 2009
Event2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009 - Singapore, Singapore
Duration: Aug 6 2009Aug 7 2009

Other

Other2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009
CountrySingapore
CitySingapore
Period8/6/098/7/09

Fingerprint

Semantics
Learning systems

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Information Systems

Cite this

Eisenstein, J., Clarke, J., Goldwasser, D., & Roth, D. (2009). Reading to learn: Constructing features from semantic abstracts. 958-967. Paper presented at 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009, Singapore, Singapore.

Reading to learn : Constructing features from semantic abstracts. / Eisenstein, Jacob; Clarke, James; Goldwasser, Dan; Roth, Dan.

2009. 958-967 Paper presented at 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009, Singapore, Singapore.

Research output: Contribution to conferencePaper

Eisenstein, J, Clarke, J, Goldwasser, D & Roth, D 2009, 'Reading to learn: Constructing features from semantic abstracts', Paper presented at 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009, Singapore, Singapore, 8/6/09 - 8/7/09 pp. 958-967.
Eisenstein J, Clarke J, Goldwasser D, Roth D. Reading to learn: Constructing features from semantic abstracts. 2009. Paper presented at 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009, Singapore, Singapore.
Eisenstein, Jacob ; Clarke, James ; Goldwasser, Dan ; Roth, Dan. / Reading to learn : Constructing features from semantic abstracts. Paper presented at 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009, Singapore, Singapore.10 p.
@conference{a9b8440bc6264839996fd6ceaf0896ba,
title = "Reading to learn: Constructing features from semantic abstracts",
abstract = "Machine learning offers a range of tools for training systems from data, but these methods are only as good as the underlying representation. This paper proposes to acquire representations for machine learning by reading text written to accommodate human learning. We propose a novel form of semantic analysis called reading to learn, where the goal is to obtain a high-level semantic abstract of multiple documents in a representation that facilitates learning. We obtain this abstract through a generative model that requires no labeled data, instead leveraging repetition across multiple documents. The semantic abstract is converted into a transformed feature space for learning, resulting in improved generalization on a relational learning task.",
author = "Jacob Eisenstein and James Clarke and Dan Goldwasser and Dan Roth",
year = "2009",
month = "12",
day = "1",
language = "English (US)",
pages = "958--967",
note = "2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009 ; Conference date: 06-08-2009 Through 07-08-2009",

}

TY - CONF

T1 - Reading to learn

T2 - Constructing features from semantic abstracts

AU - Eisenstein, Jacob

AU - Clarke, James

AU - Goldwasser, Dan

AU - Roth, Dan

PY - 2009/12/1

Y1 - 2009/12/1

N2 - Machine learning offers a range of tools for training systems from data, but these methods are only as good as the underlying representation. This paper proposes to acquire representations for machine learning by reading text written to accommodate human learning. We propose a novel form of semantic analysis called reading to learn, where the goal is to obtain a high-level semantic abstract of multiple documents in a representation that facilitates learning. We obtain this abstract through a generative model that requires no labeled data, instead leveraging repetition across multiple documents. The semantic abstract is converted into a transformed feature space for learning, resulting in improved generalization on a relational learning task.

AB - Machine learning offers a range of tools for training systems from data, but these methods are only as good as the underlying representation. This paper proposes to acquire representations for machine learning by reading text written to accommodate human learning. We propose a novel form of semantic analysis called reading to learn, where the goal is to obtain a high-level semantic abstract of multiple documents in a representation that facilitates learning. We obtain this abstract through a generative model that requires no labeled data, instead leveraging repetition across multiple documents. The semantic abstract is converted into a transformed feature space for learning, resulting in improved generalization on a relational learning task.

UR - http://www.scopus.com/inward/record.url?scp=80053432060&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=80053432060&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:80053432060

SP - 958

EP - 967

ER -