Finding intonational boundaries using acoustic cues related to the voice source

Jeung Yoon Choi, Mark Hasegawa-Johnson, Jennifer Cole

Research output: Contribution to journalArticlepeer-review

Abstract

Acoustic cues related to the voice source, including harmonic structure and spectral tilt, were examined for relevance to prosodic boundary detection. The measurements considered here comprise five categories: duration, pitch, harmonic structure, spectral tilt, and amplitude. Distributions of the measurements and statistical analysis show that the measurements may be used to differentiate between prosodic categories. Detection experiments on the Boston University Radio Speech Corpus show equal error detection rates around 70% for accent and boundary detection, using only the acoustic measurements described, without any lexical or syntactic information. Further investigation of the detection results shows that duration and amplitude measurements, and, to a lesser degree, pitch measurements, are useful for detecting accents, while all voice source measurements except pitch measurements are useful for boundary detection.

Original languageEnglish (US)
Pages (from-to)2579-2587
Number of pages9
JournalJournal of the Acoustical Society of America
Volume118
Issue number4
DOIs
StatePublished - Oct 2005

ASJC Scopus subject areas

  • Arts and Humanities (miscellaneous)
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Finding intonational boundaries using acoustic cues related to the voice source'. Together they form a unique fingerprint.

Cite this