Corpora, Databases, and Internet Resources: Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora

Jennifer Cole, Mark Allan Hasegawa-Johnson, Dan Loehr, Linda Van Guilder, Henning Reetz, Stefan A. Frisch

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This article introduces a wide range of approaches to using large bodies of data for linguistic research. Corpus analysis for phonological research involves the investigation of the phonetic, phonological, and lexical properties of speech for the purpose of understanding the patterns of variation in the phonetic expression of words, and the distributional patterns of sound elements in relation to the linguistic context. A speech corpus provides a basis for investigating variability in phonetic form and also provides a rich resource for studying the relationship between phonological form and other levels of linguistic structure. Linguistic metadata provides information about the speakers, such as sex, age, ethnicity, and region of residence. Metadata may also provide information about speaker recruitment and recording procedures. Forced alignment is done using algorithms from automatic speech recognition (ASR), and is most successful when each phone associated with the word in its dictionary form is actually fully pronounced. One of the easiest methods of manipulating natural speech is the splicing technique, where parts of a speech signal are cut out, repeated, or cross-spliced with another piece of the signal. The gating technique is another form of natural speech signal manipulation often applied in psycholinguistic experiments, where parts of a speech signal are cut off, and incrementally more of the signal is presented to a listener. Another speech signal manipulation is the mixing of two signals.

Original languageEnglish (US)
Title of host publicationThe Oxford Handbook of Laboratory Phonology
PublisherOxford University Press
ISBN (Electronic)9780191744068
ISBN (Print)9780199575039
DOIs
StatePublished - Sep 18 2012

Fingerprint

phonology
manipulation
Internet
resources
phonetics
linguistics
Resources
Phonology
Data Base
Manipulation
Phonotactics
World Wide Web
psycholinguistics
listener
dictionary
recording
ethnicity
experiment

Keywords

  • Automatic speech recognition
  • Corpus analysis
  • Gating technique
  • Lexical properties
  • Linguistic metadata
  • Phonology
  • Speech signal manipulation
  • Usage frequency

ASJC Scopus subject areas

  • Arts and Humanities(all)
  • Social Sciences(all)

Cite this

Corpora, Databases, and Internet Resources : Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora. / Cole, Jennifer; Hasegawa-Johnson, Mark Allan; Loehr, Dan; Guilder, Linda Van; Reetz, Henning; Frisch, Stefan A.

The Oxford Handbook of Laboratory Phonology. Oxford University Press, 2012.

Research output: Chapter in Book/Report/Conference proceedingChapter

@inbook{89c167aea5e14797bb76e8c42cc7e025,
title = "Corpora, Databases, and Internet Resources: Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora",
abstract = "This article introduces a wide range of approaches to using large bodies of data for linguistic research. Corpus analysis for phonological research involves the investigation of the phonetic, phonological, and lexical properties of speech for the purpose of understanding the patterns of variation in the phonetic expression of words, and the distributional patterns of sound elements in relation to the linguistic context. A speech corpus provides a basis for investigating variability in phonetic form and also provides a rich resource for studying the relationship between phonological form and other levels of linguistic structure. Linguistic metadata provides information about the speakers, such as sex, age, ethnicity, and region of residence. Metadata may also provide information about speaker recruitment and recording procedures. Forced alignment is done using algorithms from automatic speech recognition (ASR), and is most successful when each phone associated with the word in its dictionary form is actually fully pronounced. One of the easiest methods of manipulating natural speech is the splicing technique, where parts of a speech signal are cut out, repeated, or cross-spliced with another piece of the signal. The gating technique is another form of natural speech signal manipulation often applied in psycholinguistic experiments, where parts of a speech signal are cut off, and incrementally more of the signal is presented to a listener. Another speech signal manipulation is the mixing of two signals.",
keywords = "Automatic speech recognition, Corpus analysis, Gating technique, Lexical properties, Linguistic metadata, Phonology, Speech signal manipulation, Usage frequency",
author = "Jennifer Cole and Hasegawa-Johnson, {Mark Allan} and Dan Loehr and Guilder, {Linda Van} and Henning Reetz and Frisch, {Stefan A.}",
year = "2012",
month = "9",
day = "18",
doi = "10.1093/oxfordhb/9780199575039.013.0017",
language = "English (US)",
isbn = "9780199575039",
booktitle = "The Oxford Handbook of Laboratory Phonology",
publisher = "Oxford University Press",
address = "United States",

}

TY - CHAP

T1 - Corpora, Databases, and Internet Resources

T2 - Corpus Phonology with Speech Resources Using The Internet For Collecting Phonological Data Speech Manipulation, Synthesis, and Automatic Recognition in Laboratory Phonology Phonotactic Patterns in Lexical Corpora

AU - Cole, Jennifer

AU - Hasegawa-Johnson, Mark Allan

AU - Loehr, Dan

AU - Guilder, Linda Van

AU - Reetz, Henning

AU - Frisch, Stefan A.

PY - 2012/9/18

Y1 - 2012/9/18

N2 - This article introduces a wide range of approaches to using large bodies of data for linguistic research. Corpus analysis for phonological research involves the investigation of the phonetic, phonological, and lexical properties of speech for the purpose of understanding the patterns of variation in the phonetic expression of words, and the distributional patterns of sound elements in relation to the linguistic context. A speech corpus provides a basis for investigating variability in phonetic form and also provides a rich resource for studying the relationship between phonological form and other levels of linguistic structure. Linguistic metadata provides information about the speakers, such as sex, age, ethnicity, and region of residence. Metadata may also provide information about speaker recruitment and recording procedures. Forced alignment is done using algorithms from automatic speech recognition (ASR), and is most successful when each phone associated with the word in its dictionary form is actually fully pronounced. One of the easiest methods of manipulating natural speech is the splicing technique, where parts of a speech signal are cut out, repeated, or cross-spliced with another piece of the signal. The gating technique is another form of natural speech signal manipulation often applied in psycholinguistic experiments, where parts of a speech signal are cut off, and incrementally more of the signal is presented to a listener. Another speech signal manipulation is the mixing of two signals.

AB - This article introduces a wide range of approaches to using large bodies of data for linguistic research. Corpus analysis for phonological research involves the investigation of the phonetic, phonological, and lexical properties of speech for the purpose of understanding the patterns of variation in the phonetic expression of words, and the distributional patterns of sound elements in relation to the linguistic context. A speech corpus provides a basis for investigating variability in phonetic form and also provides a rich resource for studying the relationship between phonological form and other levels of linguistic structure. Linguistic metadata provides information about the speakers, such as sex, age, ethnicity, and region of residence. Metadata may also provide information about speaker recruitment and recording procedures. Forced alignment is done using algorithms from automatic speech recognition (ASR), and is most successful when each phone associated with the word in its dictionary form is actually fully pronounced. One of the easiest methods of manipulating natural speech is the splicing technique, where parts of a speech signal are cut out, repeated, or cross-spliced with another piece of the signal. The gating technique is another form of natural speech signal manipulation often applied in psycholinguistic experiments, where parts of a speech signal are cut off, and incrementally more of the signal is presented to a listener. Another speech signal manipulation is the mixing of two signals.

KW - Automatic speech recognition

KW - Corpus analysis

KW - Gating technique

KW - Lexical properties

KW - Linguistic metadata

KW - Phonology

KW - Speech signal manipulation

KW - Usage frequency

UR - http://www.scopus.com/inward/record.url?scp=84925064436&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84925064436&partnerID=8YFLogxK

U2 - 10.1093/oxfordhb/9780199575039.013.0017

DO - 10.1093/oxfordhb/9780199575039.013.0017

M3 - Chapter

SN - 9780199575039

BT - The Oxford Handbook of Laboratory Phonology

PB - Oxford University Press

ER -