TY - JOUR
T1 - Multidirectional leveraging for computational morphology and language documentation and revitalization
AU - Schreiner, Sylvia L.R.
AU - Schwartz, Lane
AU - Hunt, Benjamin
AU - Chen, Emily
N1 - Publisher Copyright:
© 2020
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2020
Y1 - 2020
N2 - St. Lawrence Island Yupik is an endangered language of the Bering Strait region. In this paper, we describe our work on Yupik jointly leveraging computational morphology and linguistic feldwork, outlining the multilayer virtuous cycle that we continue to refne in our work to document and build tools for the language. After developing a preliminary morphological analyzer from an existing pedagogical grammar of Yupik, we used it to help analyze new word forms gathered through feldwork. While in the field, we augmented the analyzer to include insights into the lexicon, phonology, and morphology of the language as they were gained during elicitation sessions and subsequent data analysis. The analyzer and other tools we have developed are improved by a corpus that continues to grow through our digitization and documentation efforts, and the computational tools in turn allow us to improve and speed those same efforts. Through this process, we have successfully identified previously undescribed lexical, morphological, and phonological processes in Yupik while simultaneously increasing the coverage of the morphological analyzer. Given the polysynthetic nature of Yupik, a high-coverage morphological analyzer is a necessary prerequisite for the development of other high-level computational tools that have been requested by the Yupik community.
AB - St. Lawrence Island Yupik is an endangered language of the Bering Strait region. In this paper, we describe our work on Yupik jointly leveraging computational morphology and linguistic feldwork, outlining the multilayer virtuous cycle that we continue to refne in our work to document and build tools for the language. After developing a preliminary morphological analyzer from an existing pedagogical grammar of Yupik, we used it to help analyze new word forms gathered through feldwork. While in the field, we augmented the analyzer to include insights into the lexicon, phonology, and morphology of the language as they were gained during elicitation sessions and subsequent data analysis. The analyzer and other tools we have developed are improved by a corpus that continues to grow through our digitization and documentation efforts, and the computational tools in turn allow us to improve and speed those same efforts. Through this process, we have successfully identified previously undescribed lexical, morphological, and phonological processes in Yupik while simultaneously increasing the coverage of the morphological analyzer. Given the polysynthetic nature of Yupik, a high-coverage morphological analyzer is a necessary prerequisite for the development of other high-level computational tools that have been requested by the Yupik community.
UR - http://www.scopus.com/inward/record.url?scp=85102349426&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85102349426&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:85102349426
SN - 1934-5275
VL - 14
SP - 69
EP - 86
JO - Language Documentation and Conservation
JF - Language Documentation and Conservation
ER -