TY - GEN
T1 - Development of a visual speech synthesizer via second-order isomorphism
AU - Jiang, Jintao
AU - Aronoff, Justin M.
AU - Bernstein, Lynne E.
PY - 2008
Y1 - 2008
N2 - The goals of this study were to evaluate the synthesis of visible speech that was based on 3-D motion data using second-order isomorphism. To do this, word stimuli were generated for perceptual discrimination and identification tasks. Discrimination trials were based on word-pairs that were predicted to be at four levels of perceptual dissimilarity. Results from the discrimination tasks indicated that visual synthetic speech perception maintained the dissimilarity structure of visual natural speech perception. This study demonstrated that the relatively sparse 3-D representations of face motion could be used to synthesize visual speech that perceptually approximate visual natural speech, suggesting that synthesizer development and psychophysics can benefit mutually when the goals are aligned.
AB - The goals of this study were to evaluate the synthesis of visible speech that was based on 3-D motion data using second-order isomorphism. To do this, word stimuli were generated for perceptual discrimination and identification tasks. Discrimination trials were based on word-pairs that were predicted to be at four levels of perceptual dissimilarity. Results from the discrimination tasks indicated that visual synthetic speech perception maintained the dissimilarity structure of visual natural speech perception. This study demonstrated that the relatively sparse 3-D representations of face motion could be used to synthesize visual speech that perceptually approximate visual natural speech, suggesting that synthesizer development and psychophysics can benefit mutually when the goals are aligned.
KW - Dissimilarity
KW - Second-order isomorphism
KW - Terms-Visual speech synthesis
KW - Visual speech perception
UR - http://www.scopus.com/inward/record.url?scp=51449101539&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51449101539&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2008.4518700
DO - 10.1109/ICASSP.2008.4518700
M3 - Conference contribution
AN - SCOPUS:51449101539
SN - 1424414849
SN - 9781424414840
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 4677
EP - 4680
BT - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
T2 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
Y2 - 31 March 2008 through 4 April 2008
ER -