TY - JOUR
T1 - On Improving Dynamic State Space Approaches to Articulatory Inversion with MAP-Based Parameter Estimation
AU - özbek, I. Yücel
AU - özbek, I. Yücel
AU - 2-Johnson, Mark
AU - Demirekler, Mübeccel
N1 - Funding Information:
Manuscript received July 15, 2010; revised February 07, 2011; accepted April 28, 2011. Date of publication June 07, 2011; date of current version November 04, 2011. This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK). The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Brian Mak. ˙. Y. Özbek was with the Department of Electrical and Electronics Engineering, Middle East Technical University, 06531 Ankara, Turkey. He is now with the Electrical and Electronics Engineering Department, Atatürk University, 25240 Erzurum, Turkey.
PY - 2012/1
Y1 - 2012/1
N2 - This paper presents a complete framework for articulatory inversion based on jump Markov linear systems (JMLS). In the model, the acoustic measurements and the position of each articulator are considered as observable measurement and continuous-valued hidden state of the system, respectively, and discrete regimes of the system are represented by the use of a discrete-valued hidden modal state. Articulatory inversion based on JMLS involves learning the model parameter set of the system and making inference about the state (position of each articulator) of the system using acoustic measurements. Iterative learning algorithms based on maximum-likelihood (ML) and maximum a posteriori (MAP) criteria are proposed to learn the model parameter set of the JMLS. It is shown that the learning procedure of the JMLS is a generalized version of hidden Markov model (HMM) training when both acoustic and articulatory data are given. In this paper, it is shown that the MAP-based learning algorithm improves modeling performance of the system and gives significantly better results compared to ML. The inference stage of the proposed algorithm is based on an interacting multiple models (IMM) approach, and done online (filtering), and/or offline (smoothing). Formulas are provided for IMM-based JMLS smoothing. It is shown that smoothing significantly improves the performance of articulatory inversion compared to filtering. Several experiments are conducted with the MOCHA database to show the performance of the proposed method. Comparison of the performance of the proposed method with the ones given in the literature shows that the proposed method improves the performance of state space approaches, making state space approaches comparable to the best published results.
AB - This paper presents a complete framework for articulatory inversion based on jump Markov linear systems (JMLS). In the model, the acoustic measurements and the position of each articulator are considered as observable measurement and continuous-valued hidden state of the system, respectively, and discrete regimes of the system are represented by the use of a discrete-valued hidden modal state. Articulatory inversion based on JMLS involves learning the model parameter set of the system and making inference about the state (position of each articulator) of the system using acoustic measurements. Iterative learning algorithms based on maximum-likelihood (ML) and maximum a posteriori (MAP) criteria are proposed to learn the model parameter set of the JMLS. It is shown that the learning procedure of the JMLS is a generalized version of hidden Markov model (HMM) training when both acoustic and articulatory data are given. In this paper, it is shown that the MAP-based learning algorithm improves modeling performance of the system and gives significantly better results compared to ML. The inference stage of the proposed algorithm is based on an interacting multiple models (IMM) approach, and done online (filtering), and/or offline (smoothing). Formulas are provided for IMM-based JMLS smoothing. It is shown that smoothing significantly improves the performance of articulatory inversion compared to filtering. Several experiments are conducted with the MOCHA database to show the performance of the proposed method. Comparison of the performance of the proposed method with the ones given in the literature shows that the proposed method improves the performance of state space approaches, making state space approaches comparable to the best published results.
KW - Acoustic-to-articulatory inversion
KW - interactive mualtiple model (IMM) smoothing
KW - jump Markov linear system (JMLS)
KW - maximum-likelihood (ML) and maximum a posteriori (MAP) learning
UR - http://www.scopus.com/inward/record.url?scp=85008544995&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85008544995&partnerID=8YFLogxK
U2 - 10.1109/TASL.2011.2157496
DO - 10.1109/TASL.2011.2157496
M3 - Article
AN - SCOPUS:85008544995
SN - 1558-7916
VL - 20
SP - 67
EP - 81
JO - IEEE Transactions on Audio, Speech and Language Processing
JF - IEEE Transactions on Audio, Speech and Language Processing
IS - 1
ER -