Application of a Landmark-Based Method for Acoustic Analysis of Dysphonic Speech

Keiko Ishikawa, Marepalli B. Rao, Joel MacAuslan, Suzanne Boyce

Research output: Contribution to journalArticle

Abstract

Aim: Speakers with dysphonia often report difficulty with maintaining intelligibility in noisy environments; however, there is no objective method for characterizing this difficulty. Landmark-based analysis is a linguistically-motived, knowledge-based speech analysis technique, which may serve as the basis of acoustic tool for describing the intelligibility deficit. As the first step toward development of such a tool, this study examined whether Landmark-based analysis could describe acoustic differences between normal and dysphonic speech. Method: The recordings subjected to the Landmark-based analysis were the first sentence of the Rainbow Passage from 33 speakers with normal voice and 36 speakers with dysphonia. These recordings were selected from the Kay Elemetrics Database of Disordered Voice. The between-group difference was evaluated based on counts of certain Landmarks (LM). Results: The average counts of all LMs were significantly greater in normal speech, t(66.85) = 2.36, P = 0.02. When the group-difference was examined for each LM, dysphonic speech had more [g] and [b] LMs and fewer [s] LMs than normal speech (P < 0.01 for all cases). A classification tree model identified [+s] and [+b] LMs are the primary predictors for the dysphonic speech. The model's misclassification rate was 7.24%. Conclusions: This preliminary investigation demonstrates that LM-based analysis is capable of differentiating dysphonic speech from normal speech. This encouraging result rationalizes future examinations of LM analysis in other areas of interest. For example, LM-based measures could conceivably be used as to quantify general intelligibility, and/or provide insight into underlying mechanisms of intelligibility deficits.

Original languageEnglish (US)
JournalJournal of Voice
DOIs
StateAccepted/In press - Jan 1 2019

Fingerprint

Acoustics
Dysphonia
Databases

Keywords

  • Acoustic speech analysis
  • Dysphonia
  • Intelligibility
  • Landmark analysis

ASJC Scopus subject areas

  • Otorhinolaryngology
  • Speech and Hearing
  • LPN and LVN

Cite this

Application of a Landmark-Based Method for Acoustic Analysis of Dysphonic Speech. / Ishikawa, Keiko; Rao, Marepalli B.; MacAuslan, Joel; Boyce, Suzanne.

In: Journal of Voice, 01.01.2019.

Research output: Contribution to journalArticle

@article{dcec60cfa7314950a5a18050917125bc,
title = "Application of a Landmark-Based Method for Acoustic Analysis of Dysphonic Speech",
abstract = "Aim: Speakers with dysphonia often report difficulty with maintaining intelligibility in noisy environments; however, there is no objective method for characterizing this difficulty. Landmark-based analysis is a linguistically-motived, knowledge-based speech analysis technique, which may serve as the basis of acoustic tool for describing the intelligibility deficit. As the first step toward development of such a tool, this study examined whether Landmark-based analysis could describe acoustic differences between normal and dysphonic speech. Method: The recordings subjected to the Landmark-based analysis were the first sentence of the Rainbow Passage from 33 speakers with normal voice and 36 speakers with dysphonia. These recordings were selected from the Kay Elemetrics Database of Disordered Voice. The between-group difference was evaluated based on counts of certain Landmarks (LM). Results: The average counts of all LMs were significantly greater in normal speech, t(66.85) = 2.36, P = 0.02. When the group-difference was examined for each LM, dysphonic speech had more [g] and [b] LMs and fewer [s] LMs than normal speech (P < 0.01 for all cases). A classification tree model identified [+s] and [+b] LMs are the primary predictors for the dysphonic speech. The model's misclassification rate was 7.24{\%}. Conclusions: This preliminary investigation demonstrates that LM-based analysis is capable of differentiating dysphonic speech from normal speech. This encouraging result rationalizes future examinations of LM analysis in other areas of interest. For example, LM-based measures could conceivably be used as to quantify general intelligibility, and/or provide insight into underlying mechanisms of intelligibility deficits.",
keywords = "Acoustic speech analysis, Dysphonia, Intelligibility, Landmark analysis",
author = "Keiko Ishikawa and Rao, {Marepalli B.} and Joel MacAuslan and Suzanne Boyce",
year = "2019",
month = "1",
day = "1",
doi = "10.1016/j.jvoice.2018.12.017",
language = "English (US)",
journal = "Journal of Voice",
issn = "0892-1997",
publisher = "Mosby Inc.",

}

TY - JOUR

T1 - Application of a Landmark-Based Method for Acoustic Analysis of Dysphonic Speech

AU - Ishikawa, Keiko

AU - Rao, Marepalli B.

AU - MacAuslan, Joel

AU - Boyce, Suzanne

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Aim: Speakers with dysphonia often report difficulty with maintaining intelligibility in noisy environments; however, there is no objective method for characterizing this difficulty. Landmark-based analysis is a linguistically-motived, knowledge-based speech analysis technique, which may serve as the basis of acoustic tool for describing the intelligibility deficit. As the first step toward development of such a tool, this study examined whether Landmark-based analysis could describe acoustic differences between normal and dysphonic speech. Method: The recordings subjected to the Landmark-based analysis were the first sentence of the Rainbow Passage from 33 speakers with normal voice and 36 speakers with dysphonia. These recordings were selected from the Kay Elemetrics Database of Disordered Voice. The between-group difference was evaluated based on counts of certain Landmarks (LM). Results: The average counts of all LMs were significantly greater in normal speech, t(66.85) = 2.36, P = 0.02. When the group-difference was examined for each LM, dysphonic speech had more [g] and [b] LMs and fewer [s] LMs than normal speech (P < 0.01 for all cases). A classification tree model identified [+s] and [+b] LMs are the primary predictors for the dysphonic speech. The model's misclassification rate was 7.24%. Conclusions: This preliminary investigation demonstrates that LM-based analysis is capable of differentiating dysphonic speech from normal speech. This encouraging result rationalizes future examinations of LM analysis in other areas of interest. For example, LM-based measures could conceivably be used as to quantify general intelligibility, and/or provide insight into underlying mechanisms of intelligibility deficits.

AB - Aim: Speakers with dysphonia often report difficulty with maintaining intelligibility in noisy environments; however, there is no objective method for characterizing this difficulty. Landmark-based analysis is a linguistically-motived, knowledge-based speech analysis technique, which may serve as the basis of acoustic tool for describing the intelligibility deficit. As the first step toward development of such a tool, this study examined whether Landmark-based analysis could describe acoustic differences between normal and dysphonic speech. Method: The recordings subjected to the Landmark-based analysis were the first sentence of the Rainbow Passage from 33 speakers with normal voice and 36 speakers with dysphonia. These recordings were selected from the Kay Elemetrics Database of Disordered Voice. The between-group difference was evaluated based on counts of certain Landmarks (LM). Results: The average counts of all LMs were significantly greater in normal speech, t(66.85) = 2.36, P = 0.02. When the group-difference was examined for each LM, dysphonic speech had more [g] and [b] LMs and fewer [s] LMs than normal speech (P < 0.01 for all cases). A classification tree model identified [+s] and [+b] LMs are the primary predictors for the dysphonic speech. The model's misclassification rate was 7.24%. Conclusions: This preliminary investigation demonstrates that LM-based analysis is capable of differentiating dysphonic speech from normal speech. This encouraging result rationalizes future examinations of LM analysis in other areas of interest. For example, LM-based measures could conceivably be used as to quantify general intelligibility, and/or provide insight into underlying mechanisms of intelligibility deficits.

KW - Acoustic speech analysis

KW - Dysphonia

KW - Intelligibility

KW - Landmark analysis

UR - http://www.scopus.com/inward/record.url?scp=85059800986&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85059800986&partnerID=8YFLogxK

U2 - 10.1016/j.jvoice.2018.12.017

DO - 10.1016/j.jvoice.2018.12.017

M3 - Article

JO - Journal of Voice

JF - Journal of Voice

SN - 0892-1997

ER -