TY - GEN
T1 - The Markov selection model for concurrent speech recognition
AU - Smaragdis, Paris
AU - Raj, Bhiksha
N1 - Copyright:
Copyright 2010 Elsevier B.V., All rights reserved.
PY - 2010
Y1 - 2010
N2 - In this paper we introduce a new Markov model that is capable of recognizing speech from recordings of simultaneously speaking a priori known speakers. This work is based on recent work on non-negative representations of spectrograms, which has been shown to be very effective in source separation problems. In this paper we extend these approaches to design a Markov selection model that is able to recognize sequences even when they are presented mixed together. We do so without the need to perform separation on the signals. Unlike factorial Markov models which have been used similarly in the past, this approach features a low computational complexity in the number of sources and Markov states, which makes it a highly efficient alternative. We demonstrate the use of this framework in recognizing speech from mixtures of known speakers.
AB - In this paper we introduce a new Markov model that is capable of recognizing speech from recordings of simultaneously speaking a priori known speakers. This work is based on recent work on non-negative representations of spectrograms, which has been shown to be very effective in source separation problems. In this paper we extend these approaches to design a Markov selection model that is able to recognize sequences even when they are presented mixed together. We do so without the need to perform separation on the signals. Unlike factorial Markov models which have been used similarly in the past, this approach features a low computational complexity in the number of sources and Markov states, which makes it a highly efficient alternative. We demonstrate the use of this framework in recognizing speech from mixtures of known speakers.
UR - http://www.scopus.com/inward/record.url?scp=78449279663&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78449279663&partnerID=8YFLogxK
U2 - 10.1109/MLSP.2010.5588124
DO - 10.1109/MLSP.2010.5588124
M3 - Conference contribution
AN - SCOPUS:78449279663
SN - 9781424478774
T3 - Proceedings of the 2010 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2010
SP - 214
EP - 219
BT - Proceedings of the 2010 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2010
T2 - 2010 IEEE 20th International Workshop on Machine Learning for Signal Processing, MLSP 2010
Y2 - 29 August 2010 through 1 September 2010
ER -