Acoustic emotion recognition: A benchmark comparison of performances

Björn Schuller; Bogdan Vlasenko; Florian Eyben; Gerhard Rigoll; Andreas Wendemuth

Conference Proceedings

Acoustic emotion recognition: A benchmark comparison of performances

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009 (2009) 552-557

DOI: 10.1109/ASRU.2009.5372886

231Citations

138Readers

Get full text

Abstract

In the light of the first challenge on emotion recognition from speech we provide the largest-to-date benchmark comparison under equal conditions on nine standard corpora in the field using the two pre-dominant paradigms: modeling on a frame-level by means of Hidden Markov Models and supra-segmental modeling by systematic feature brute-forcing. Investigated corpora are the ABC, AVIC, DES, EMO-DB, eNTERFACE, SAL, SmartKom, SUSAS, and VAM databases. To provide better comparability among sets, we additionally cluster each database's emotions into binary valence and arousal discrimination tasks. In the result large differences are found among corpora that mostly stem from naturalistic emotions and spontaneous speech vs. more prototypical events. Further, supra-segmental modeling proves significantly beneficial on average when several classes are addressed at a time. © 2009 IEEE.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Schuller, B., Vlasenko, B., Eyben, F., Rigoll, G., & Wendemuth, A. (2009). Acoustic emotion recognition: A benchmark comparison of performances. In Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009 (pp. 552–557). https://doi.org/10.1109/ASRU.2009.5372886

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 78

80%

Researcher 12

12%

Professor / Associate Prof. 6

Lecturer / Post doc 2

Readers' Discipline

Computer Science 58

62%

Engineering 27

29%

Psychology 6

Medicine and Dentistry 3

Acoustic emotion recognition: A benchmark comparison of performances

Abstract

References Powered by Scopus

The eNTERFACE'05 Audio-Visual emotion database

Emotional speech: Towards a new generation of databases

The Vera am Mittag German audio-visual emotional speech database

Cited by Powered by Scopus

The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing

Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge

Evaluating deep learning architectures for Speech Emotion Recognition

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline