Cross-cultural analysis of blogs and forums with mixed-collection topic models

Michael Paul, Roxana Girju

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper presents preliminary results on the detection of cultural differences from people's experiences in various countries from two perspectives: tourists and locals. Our approach is to develop probabilistic models that would provide a good framework for such studies. Thus, we propose here a new model, ccLDA, which extends over the Latent Dirichlet Allocation (LDA) (Blei et al., 2003) and cross-collection mixture (ccMix) (Zhai et al., 2004) models on blogs and forums. We also provide a qualitative and quantitative analysis of the model on the cross-cultural data.

Original languageEnglish (US)
Pages1408-1417
Number of pages10
DOIs
StatePublished - 2009
Event2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009 - Singapore, Singapore
Duration: Aug 6 2009Aug 7 2009

Other

Other2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Held in Conjunction with ACL-IJCNLP 2009
Country/TerritorySingapore
CitySingapore
Period8/6/098/7/09

ASJC Scopus subject areas

  • Information Systems
  • Computational Theory and Mathematics
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Cross-cultural analysis of blogs and forums with mixed-collection topic models'. Together they form a unique fingerprint.

Cite this