Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup

Fedotov, Dmitrii; Kaya, Heysem; Karpov, Alexey

Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup

dc.contributor.author	Fedotov, Dmitrii
dc.contributor.author	Kaya, Heysem
dc.contributor.author	Karpov, Alexey
dc.date.accessioned	2024-10-29T17:43:23Z
dc.date.available	2024-10-29T17:43:23Z
dc.date.issued	2018
dc.department	Tekirdağ Namık Kemal Üniversitesi
dc.description	20th International Conference on Speech and Computer, SPECOM 2018 -- 18 September 2018 through 22 September 2018 -- Leipzig -- 218179
dc.description.abstract	Recently, focus of research in the field of affective computing was shifted to spontaneous interactions and time-continuous annotations. Such data enlarge the possibility for real-world emotion recognition in the wild, but also introduce new challenges. Affective computing is a research area, where data collection is not a trivial and cheap task; therefore it would be rational to use all the data available. However, due to the subjective nature of emotions, differences in cultural and linguistic features as well as environmental conditions, combining affective speech data is not a straightforward process. In this paper, we analyze difficulties of automatic emotion recognition in time-continuous, dimensional scenario using data from RECOLA, SEMAINE and CreativeIT databases. We propose to employ a simple but effective strategy called “mixup” to overcome the gap in feature-target and target-target covariance structures across corpora. We showcase the performance of our system in three different cross-corpus experimental setups: single-corpus training, two-corpora training and training on augmented (mixed up) data. Findings show that the prediction behavior of trained models heavily depends on the covariance structure of the training corpus, and mixup is very effective in improving cross-corpus acoustic emotion recognition performance of context dependent LSTM models. © 2018, Springer Nature Switzerland AG.
dc.identifier.doi	10.1007/978-3-319-99579-3_17
dc.identifier.endpage	165
dc.identifier.isbn	978-331999578-6
dc.identifier.issn	0302-9743
dc.identifier.scopus	2-s2.0-85053808792
dc.identifier.scopusquality	Q3
dc.identifier.startpage	155
dc.identifier.uri	https://doi.org/10.1007/978-3-319-99579-3_17
dc.identifier.uri	https://hdl.handle.net/20.500.11776/12309
dc.identifier.volume	11096 LNAI
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Springer Verlag
dc.relation.ispartof	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	Cross-corpus emotion recognition
dc.subject	Data augmentation
dc.subject	Time-continuous emotion recognition
dc.title	Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup
dc.type	Conference Object

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

Context Modeling for Cross-Corpus Dimensional Acoustic Emotion Recognition: Challenges and Mixup

Dosyalar

Koleksiyon