LSTM based Cross-corpus and Cross-task Acoustic Emotion Recognition
dc.authorid | 0000-0002-5583-0410 | |
dc.authorid | 0000-0003-3424-652X | |
dc.authorid | 0000-0003-4039-1221 | |
dc.authorid | 0000-0001-7947-5508 | |
dc.authorscopusid | 36241785000 | |
dc.authorscopusid | 57195680712 | |
dc.authorscopusid | 57204209887 | |
dc.authorscopusid | 57199057349 | |
dc.authorscopusid | 57204218917 | |
dc.authorscopusid | 57219469958 | |
dc.authorwosid | Fedotov, Dmitrii/AAE-1738-2019 | |
dc.authorwosid | ????????, ??????/L-5818-2016 | |
dc.authorwosid | Karpov, Alexey A/A-8905-2012 | |
dc.contributor.author | Kaya, Heysem | |
dc.contributor.author | Fedotov, Dmitrii | |
dc.contributor.author | Yesilkanat, Ali | |
dc.contributor.author | Verkholyak, Oxana Vladimirovna | |
dc.contributor.author | Zhang, Yang | |
dc.contributor.author | Karpov, Alexey A. | |
dc.date.accessioned | 2022-05-11T14:15:53Z | |
dc.date.available | 2022-05-11T14:15:53Z | |
dc.date.issued | 2018 | |
dc.department | Fakülteler, Çorlu Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | |
dc.description | 19th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2018) -- AUG 02-SEP 06, 2018 -- Hyderabad, INDIA | |
dc.description.abstract | Acoustic emotion recognition is a popular and central research direction in paralinguistic analysis, due its relation to a wide range of affective states/traits and manifold applications. Developing highly generalizable models still remains as a challenge for researchers and engineers, because of multitude of nuisance factors. To assert generalization, deployed models need to handle spontaneous speech recorded under different acoustic conditions compared to the training set. This requires that the models are tested for cross-corpus robustness. In this work, we first investigate the suitability of Long-Short-Term-Memory (LSTM) models trained with time- and space-continuously annotated affective primitives for cross-corpus acoustic emotion recognition. We next employ an effective approach to use the frame level valence and arousal predictions of LSTM models for utterance level affect classification and apply this approach on the ComParE 2018 challenge corpora. The proposed method alone gives motivating results both on development and test set of the Self-Assessed Affect Sub-Challenge. On the development set, the cross-corpus prediction based method gives a boost to performance when fused with top components of the baseline system. Results indicate the suitability of the proposed method for both time-continuous and utterance level cross-corpus acoustic emotion recognition tasks. | |
dc.description.sponsorship | Int Speech Commun Assoc | |
dc.description.sponsorship | Russian Science FoundationRussian Science Foundation (RSF) [18-11-00145]; Huawei Innovation Research ProgramHuawei Technologies [HO2017050001BM] | |
dc.description.sponsorship | The participation in the ComParE 2018 challenge with experiments on USoMS corpus (Section 4) was supported exclusively by the Russian Science Foundation (Project No. 18-11-00145). The rest research was supported by the Huawei Innovation Research Program (Agreement No. HO2017050001BM). | |
dc.identifier.endpage | 525 | |
dc.identifier.isbn | 978-1-5108-7221-9 | |
dc.identifier.issn | 2308-457X | |
dc.identifier.scopus | 2-s2.0-85053749426 | |
dc.identifier.scopusquality | N/A | |
dc.identifier.startpage | 521 | |
dc.identifier.uri | https://hdl.handle.net/20.500.11776/6109 | |
dc.identifier.wos | WOS:000465363900108 | |
dc.identifier.wosquality | N/A | |
dc.indekslendigikaynak | Web of Science | |
dc.indekslendigikaynak | Scopus | |
dc.institutionauthor | Kaya, Heysem | |
dc.language.iso | en | |
dc.publisher | Isca-Int Speech Communication Assoc | |
dc.relation.ispartof | 19th Annual Conference of the International Speech Communication Association (Interspeech 2018), Vols 1-6: Speech Research For Emerging Markets in Multilingual Societies | |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | |
dc.subject | speech emotion recognition | |
dc.subject | cross-corpus emotion recognition | |
dc.subject | context modeling | |
dc.subject | LSTM | |
dc.subject | computational paralinguistics | |
dc.title | LSTM based Cross-corpus and Cross-task Acoustic Emotion Recognition | |
dc.type | Conference Object |
Dosyalar
Orijinal paket
1 - 1 / 1
Küçük Resim Yok
- İsim:
- 6109.pdf
- Boyut:
- 283.34 KB
- Biçim:
- Adobe Portable Document Format
- Açıklama:
- Tam Metin / Full Text