Hierarchical Two-Level Modelling of Emotional States in Spoken Dialog Systems

Verkholyak, Oxana Vladimirovna; Fedotov, Dmitrii; Kaya, Heysem; Zhang, Yang; Karpov, Alexey A.

Hierarchical Two-Level Modelling of Emotional States in Spoken Dialog Systems

dc.authorid	0000-0003-3424-652X
dc.authorid	0000-0002-5583-0410
dc.authorid	0000-0001-7947-5508
dc.authorwosid	Karpov, Alexey A/A-8905-2012
dc.authorwosid	Fedotov, Dmitrii/AAE-1738-2019
dc.authorwosid	????????, ??????/L-5818-2016
dc.contributor.author	Verkholyak, Oxana Vladimirovna
dc.contributor.author	Fedotov, Dmitrii
dc.contributor.author	Kaya, Heysem
dc.contributor.author	Zhang, Yang
dc.contributor.author	Karpov, Alexey A.
dc.date.accessioned	2022-05-11T14:15:55Z
dc.date.available	2022-05-11T14:15:55Z
dc.date.issued	2019
dc.department	Fakülteler, Çorlu Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description	44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) -- MAY 12-17, 2019 -- Brighton, ENGLAND
dc.description.abstract	Emotions occur in complex social interactions, and thus processing of isolated utterances may not be sufficient to grasp the nature of underlying emotional states. Dialog speech provides useful information about context that explains nuances of emotions and their transitions. Context can be defined on different levels; this paper proposes a hierarchical context modelling approach based on RNN-LSTM architecture, which models acoustical context on the frame level and partner's emotional context on the dialog level. The method is proved effective together with cross-corpus training setup and domain adaptation technique in a set of speaker independent cross-validation experiments on IEMOCAP corpus for three levels of activation and valence classification. As a result, the state-of-the-art on this corpus is advanced for both dimensions using only acoustic modality.
dc.description.sponsorship	Inst Elect & Elect Engineers, Inst Elect & Elect Engineers Signal Proc Soc
dc.description.sponsorship	Russian Science FoundationRussian Science Foundation (RSF) [18-11-00145]; Huawei Innovation Research ProgramHuawei Technologies
dc.description.sponsorship	The study is supported by the Russian Science Foundation (project No. 18-11-00145) and Huawei Innovation Research Program.
dc.identifier.endpage	6704
dc.identifier.isbn	978-1-4799-8131-1
dc.identifier.issn	1520-6149
dc.identifier.scopus	2-s2.0-85068970731
dc.identifier.scopusquality	N/A
dc.identifier.startpage	6700
dc.identifier.uri	https://hdl.handle.net/20.500.11776/6119
dc.identifier.wos	WOS:000482554006186
dc.identifier.wosquality	N/A
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.institutionauthor	Kaya, Heysem
dc.language.iso	en
dc.publisher	IEEE
dc.relation.ispartof	2019 Ieee International Conference on Acoustics, Speech and Signal Processing (Icassp)
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	Emotion recognition
dc.subject	cross-corpus
dc.subject	context modelling
dc.subject	dialog systems
dc.subject	LSTM
dc.subject	Cross-Corpus
dc.subject	Recognition
dc.title	Hierarchical Two-Level Modelling of Emotional States in Spoken Dialog Systems
dc.type	Conference Object

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: 6119,.pdf
Boyut:: 195.6 KB
Biçim:: Adobe Portable Document Format
Açıklama:: Tam Metin / Full Text

İndir

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu
Çorlu Mühendislik Fakültesi Koleksiyonu