Topic modeling of research fields: An interdisciplinary perspective

Michael Paul, Roxana Girju

Research output: Contribution to journalConference article

Abstract

This paper addresses the problem of scientific research analysis. We use the topic model Latent Dirichlet Allocation [2] and a novel classifier to classify research papers based on topic and language. Moreover, we show various insightful statistics and correlations within and across three research fields: Linguistics, Computational Linguistics, and Education. In particular, we show how topics change over time within each field, what relations and influences exist between topics within and across fields, as well as what trends can be established for some of the world's natural languages. Finally, we talk about trend prediction and topic suggestion as future extensions of this research.

Original languageEnglish (US)
Pages (from-to)337-342
Number of pages6
JournalInternational Conference Recent Advances in Natural Language Processing, RANLP
StatePublished - Dec 1 2009
EventInternational Conference on Recent Advances in Natural Language Processing, RANLP-2009 - Borovets, Bulgaria
Duration: Sep 14 2009Sep 16 2009

Fingerprint

Computational linguistics
Linguistics
Classifiers
Education
Statistics

Keywords

  • Scientific research analysis
  • Statistical approaches
  • Topic models

ASJC Scopus subject areas

  • Software
  • Computer Science Applications
  • Artificial Intelligence
  • Electrical and Electronic Engineering

Cite this

Topic modeling of research fields : An interdisciplinary perspective. / Paul, Michael; Girju, Roxana.

In: International Conference Recent Advances in Natural Language Processing, RANLP, 01.12.2009, p. 337-342.

Research output: Contribution to journalConference article

@article{a4b205c836e94325bf7b9a4fa39773f0,
title = "Topic modeling of research fields: An interdisciplinary perspective",
abstract = "This paper addresses the problem of scientific research analysis. We use the topic model Latent Dirichlet Allocation [2] and a novel classifier to classify research papers based on topic and language. Moreover, we show various insightful statistics and correlations within and across three research fields: Linguistics, Computational Linguistics, and Education. In particular, we show how topics change over time within each field, what relations and influences exist between topics within and across fields, as well as what trends can be established for some of the world's natural languages. Finally, we talk about trend prediction and topic suggestion as future extensions of this research.",
keywords = "Scientific research analysis, Statistical approaches, Topic models",
author = "Michael Paul and Roxana Girju",
year = "2009",
month = "12",
day = "1",
language = "English (US)",
pages = "337--342",
journal = "International Conference Recent Advances in Natural Language Processing, RANLP",
issn = "1313-8502",
publisher = "Association for Computational Linguistics (ACL)",

}

TY - JOUR

T1 - Topic modeling of research fields

T2 - An interdisciplinary perspective

AU - Paul, Michael

AU - Girju, Roxana

PY - 2009/12/1

Y1 - 2009/12/1

N2 - This paper addresses the problem of scientific research analysis. We use the topic model Latent Dirichlet Allocation [2] and a novel classifier to classify research papers based on topic and language. Moreover, we show various insightful statistics and correlations within and across three research fields: Linguistics, Computational Linguistics, and Education. In particular, we show how topics change over time within each field, what relations and influences exist between topics within and across fields, as well as what trends can be established for some of the world's natural languages. Finally, we talk about trend prediction and topic suggestion as future extensions of this research.

AB - This paper addresses the problem of scientific research analysis. We use the topic model Latent Dirichlet Allocation [2] and a novel classifier to classify research papers based on topic and language. Moreover, we show various insightful statistics and correlations within and across three research fields: Linguistics, Computational Linguistics, and Education. In particular, we show how topics change over time within each field, what relations and influences exist between topics within and across fields, as well as what trends can be established for some of the world's natural languages. Finally, we talk about trend prediction and topic suggestion as future extensions of this research.

KW - Scientific research analysis

KW - Statistical approaches

KW - Topic models

UR - http://www.scopus.com/inward/record.url?scp=84866849120&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84866849120&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:84866849120

SP - 337

EP - 342

JO - International Conference Recent Advances in Natural Language Processing, RANLP

JF - International Conference Recent Advances in Natural Language Processing, RANLP

SN - 1313-8502

ER -