On the role of poetic versus nonpoetic features in "kindred" and diachronic poetry attribution

Brent D. Fegley, Vetle I. Torvik

Research output: Contribution to journalArticlepeer-review

Abstract

Author attribution studies have demonstrated remarkable success in applying orthographic and lexicographic features of text in a variety of discrimination problems. What might poetic features, such as syllabic stress and mood, contribute? We address this question in the context of two different attribution problems: (a) kindred: differentiate Langston Hughes' early poems from those of kindred poets and (b) diachronic: differentiate Hughes' early from his later poems. Using a diverse set of 535 generic text features, each categorized as poetic or nonpoetic, correlation-based greedy forward search ranked the features and a support vector machine classified the poems. A small subset of features (∼10) achieved cross-validated precision and recall as high as 87%. Poetic features (rhyme patterns particularly) were nearly as effective as nonpoetic in kindred discrimination, but less effective diachronically. In other words, Hughes used both poetic and nonpoetic features in distinctive ways and his use of nonpoetic features evolved systematically while he continued to experiment with poetic features. These findings affirm qualitative studies attesting to structural elements from Black oral tradition and Black folk music (blues) and to the internal consistency of Hughes' early poetry.

Original languageEnglish (US)
Pages (from-to)2165-2181
Number of pages17
JournalJournal of the American Society for Information Science and Technology
Volume63
Issue number11
DOIs
StatePublished - Nov 2012

Keywords

  • computational linguistics
  • machine learning
  • natural language processing

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Human-Computer Interaction
  • Computer Networks and Communications
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'On the role of poetic versus nonpoetic features in "kindred" and diachronic poetry attribution'. Together they form a unique fingerprint.

Cite this