Abstract
This article describes a method for reducing the error rate of probabilistic phone-based transcriptions resulting from mismatched crowdsourcing by using language-specific constraints to post-process the phone sequence. In the scenario under consideration, there are no native-language transcriptions or pronunciation dictionary available in the test language; instead, available resources include non-native transcriptions, a rudimentary rule-based G2P, and a list of orthographic word forms mined from the internet. The proposed solution post-processes non-native transcriptions by converting them to test-language orthography, composing with testlanguage word forms, then converting back to a phone string. Experiments demonstrate that the phone error rate of the transcription is reduced, using this method, by 22% on an independent evaluation-test dataset.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 30-36 |
| Number of pages | 7 |
| Journal | Procedia Computer Science |
| Volume | 81 |
| DOIs | |
| State | Published - 2016 |
| Event | 5th Workshop on Spoken Language Technologies for Under-resourced languages, SLTU 2016 - Yogyakarta, Indonesia Duration: May 9 2016 → May 12 2016 |
Keywords
- G2P
- automatic speech recognition resources
- mismatched crowdsourcing
- probabilistic transcription
ASJC Scopus subject areas
- General Computer Science
Fingerprint
Dive into the research topics of 'Performance Improvement of Probabilistic Transcriptions with Language-specific Constraints'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS