Semantic Differentiation in Speech Emotion Recognition: Insights from Descriptive and Expressive Speech Roles

Published in The 14th Joint Conference on Lexical and Computational Semantics

Conferences

Speech Emotion Recognition (SER) is essential for improving human-computer interaction, yet its accuracy remains constrained by the complexity of emotional nuances in speech. In this study, we distinguish between descriptive semantics, which represents the contextual content of speech, and expressive semantics, which reflects the speaker's emotional state. After watching emotionally charged movie segments, we recorded audio clips of participants describing their experiences, along with the intended emotion tags for each clip, participants' self-rated emotional responses, and their valence/arousal scores. Through experiments, we show that descriptive semantics align with intended emotions, while expressive semantics correlate with evoked emotions.

Cited as R. Guo, et al., "Semantic Differentiation in Speech Emotion Recognition: Insights from Descriptive and Expressive Speech Roles," accepted at SEM 2025.

Download Paper