Children's emotion recognition in an intelligent tutoring scenario

Research output: Contribution to conferencePaper

Abstract

This paper presents an approach to automatically recognize emotion which children exhibit in an intelligent tutoring system. Emotion recognition can assist the computer agent to adapt its tutorial strategies to improve the efficiency of knowledge transmission. In this study, we detect three emotional classes: confidence, puzzle, and hesitation. Emotion is detected by means of lexical, prosodic, spectral, and syntactic analyses of users' speech. An automatic speech recognition system serves as the fundamental constituent of the system. A robust classification and regression tree (CART) integrates the various information sources together for final decision. The effectiveness of the proposed approach has been tested on data collected by Wizard-of-Oz (WoZ) experiments. Our emotion recognition was speaker-independent, and yielded 91.3% accuracy. The test results showed that the spectral and duration-related prosodic features played very important roles in emotion recognition.

Original languageEnglish (US)
Pages1441-1444
Number of pages4
StatePublished - Jan 1 2004
Event8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
Duration: Oct 4 2004Oct 8 2004

Other

Other8th International Conference on Spoken Language Processing, ICSLP 2004
CountryKorea, Republic of
CityJeju, Jeju Island
Period10/4/0410/8/04

Fingerprint

emotion
scenario
confidence
Emotion
Emotion Recognition
Scenarios
Tutoring
regression
efficiency
experiment
Spectrality
Experiment
Automatic Speech Recognition
Constituent
Transmission of Knowledge
Fundamental
Confidence
Wizard
Hesitation
Syntax

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Zhang, T., Hasegawa-Johnson, M., & Levinson, S. E. (2004). Children's emotion recognition in an intelligent tutoring scenario. 1441-1444. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.

Children's emotion recognition in an intelligent tutoring scenario. / Zhang, Tong; Hasegawa-Johnson, Mark; Levinson, Stephen E.

2004. 1441-1444 Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.

Research output: Contribution to conferencePaper

Zhang, T, Hasegawa-Johnson, M & Levinson, SE 2004, 'Children's emotion recognition in an intelligent tutoring scenario', Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of, 10/4/04 - 10/8/04 pp. 1441-1444.
Zhang T, Hasegawa-Johnson M, Levinson SE. Children's emotion recognition in an intelligent tutoring scenario. 2004. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.
Zhang, Tong ; Hasegawa-Johnson, Mark ; Levinson, Stephen E. / Children's emotion recognition in an intelligent tutoring scenario. Paper presented at 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, Korea, Republic of.4 p.
@conference{efede80ee83649ddb40014e77068b8e6,
title = "Children's emotion recognition in an intelligent tutoring scenario",
abstract = "This paper presents an approach to automatically recognize emotion which children exhibit in an intelligent tutoring system. Emotion recognition can assist the computer agent to adapt its tutorial strategies to improve the efficiency of knowledge transmission. In this study, we detect three emotional classes: confidence, puzzle, and hesitation. Emotion is detected by means of lexical, prosodic, spectral, and syntactic analyses of users' speech. An automatic speech recognition system serves as the fundamental constituent of the system. A robust classification and regression tree (CART) integrates the various information sources together for final decision. The effectiveness of the proposed approach has been tested on data collected by Wizard-of-Oz (WoZ) experiments. Our emotion recognition was speaker-independent, and yielded 91.3{\%} accuracy. The test results showed that the spectral and duration-related prosodic features played very important roles in emotion recognition.",
author = "Tong Zhang and Mark Hasegawa-Johnson and Levinson, {Stephen E.}",
year = "2004",
month = "1",
day = "1",
language = "English (US)",
pages = "1441--1444",
note = "8th International Conference on Spoken Language Processing, ICSLP 2004 ; Conference date: 04-10-2004 Through 08-10-2004",

}

TY - CONF

T1 - Children's emotion recognition in an intelligent tutoring scenario

AU - Zhang, Tong

AU - Hasegawa-Johnson, Mark

AU - Levinson, Stephen E.

PY - 2004/1/1

Y1 - 2004/1/1

N2 - This paper presents an approach to automatically recognize emotion which children exhibit in an intelligent tutoring system. Emotion recognition can assist the computer agent to adapt its tutorial strategies to improve the efficiency of knowledge transmission. In this study, we detect three emotional classes: confidence, puzzle, and hesitation. Emotion is detected by means of lexical, prosodic, spectral, and syntactic analyses of users' speech. An automatic speech recognition system serves as the fundamental constituent of the system. A robust classification and regression tree (CART) integrates the various information sources together for final decision. The effectiveness of the proposed approach has been tested on data collected by Wizard-of-Oz (WoZ) experiments. Our emotion recognition was speaker-independent, and yielded 91.3% accuracy. The test results showed that the spectral and duration-related prosodic features played very important roles in emotion recognition.

AB - This paper presents an approach to automatically recognize emotion which children exhibit in an intelligent tutoring system. Emotion recognition can assist the computer agent to adapt its tutorial strategies to improve the efficiency of knowledge transmission. In this study, we detect three emotional classes: confidence, puzzle, and hesitation. Emotion is detected by means of lexical, prosodic, spectral, and syntactic analyses of users' speech. An automatic speech recognition system serves as the fundamental constituent of the system. A robust classification and regression tree (CART) integrates the various information sources together for final decision. The effectiveness of the proposed approach has been tested on data collected by Wizard-of-Oz (WoZ) experiments. Our emotion recognition was speaker-independent, and yielded 91.3% accuracy. The test results showed that the spectral and duration-related prosodic features played very important roles in emotion recognition.

UR - http://www.scopus.com/inward/record.url?scp=85009096989&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85009096989&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85009096989

SP - 1441

EP - 1444

ER -