Gelişmiş Arama

Basit öğe kaydını göster

dc.contributor.authorKaya, Heysem
dc.contributor.authorKarpov, Alexey A.
dc.date.accessioned2022-05-11T14:15:49Z
dc.date.available2022-05-11T14:15:49Z
dc.date.issued2016
dc.identifier.isbn978-1-5108-3313-5
dc.identifier.issn2308-457X
dc.identifier.urihttps://doi.org/10.21437/Interspeech.2016-995
dc.identifier.urihttps://hdl.handle.net/20.500.11776/6081
dc.description17th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2016) -- SEP 08-12, 2016 -- San Francisco, CAen_US
dc.description.abstractThe field of Computational Paralinguistics is rapidly growing and is of interest in various application domains ranging from biomedical engineering to forensics. The INTERSPEECH ComParE challenge series has a field-leading role, introducing novel problems with a common benchmark protocol for comparability. In this work, we tackle all three ComParE 2016 Challenge corpora (Native Language, Sincerity and Deception) benefiting from multi-level normalization on features followed by fast and robust kernel learning methods. Moreover, we employ computer vision inspired low level descriptor representation methods such as the Fisher vector encoding. After nonlinear preprocessing, obtained Fisher vectors are kernelized and mapped to target variables by classifiers based on Kernel Extreme Learning Machines and Partial Least Squares regression. We finally combine predictions of models trained on popularly used functional based descriptor encoding (openSMILE features) with those obtained from the Fisher vector encoding. In the preliminary experiments, our approach has significantly outperformed the baseline systems for Native Language and Sincerity sub-challenges both in the development and test sets.en_US
dc.description.sponsorshipapple, amazon alexa, Google, Microsoft, ebay, facebook, YAHOO JAPAN, Baidu Res, IBM Res, CIRRUS LOGIC, DATATANG, NUANCE, Speechocean Ltd, Yandex, Raytheon Technolen_US
dc.description.sponsorshipRussian Foundation for Basic ResearchRussian Foundation for Basic Research (RFBR) [16-37-60100]en_US
dc.description.sponsorshipThis research is financially supported by the Russian Foundation for Basic Research (project Ns 16-37-60100).en_US
dc.language.isoengen_US
dc.publisherIsca-Int Speech Communication Assocen_US
dc.identifier.doi10.21437/Interspeech.2016-995
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectComParEen_US
dc.subjectcomputational paralinguisticsen_US
dc.subjectNative Languageen_US
dc.subjectSincerityen_US
dc.subjectFisher vectoren_US
dc.subjectPLSen_US
dc.subjectELMen_US
dc.subjectExtreme Learning-Machineen_US
dc.subjectEmotionen_US
dc.titleFusing Acoustic Feature Representations for Computational Paralinguistics Tasksen_US
dc.typeproceedingPaperen_US
dc.relation.ispartof17th Annual Conference of the International Speech Communication Association (Interspeech 2016), Vols 1-5: Understanding Speech Processing in Humans and Machinesen_US
dc.departmentFakülteler, Çorlu Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.authorid0000-0003-3424-652X
dc.authorid0000-0001-7947-5508
dc.identifier.startpage2046en_US
dc.identifier.endpage2050en_US
dc.institutionauthorKaya, Heysem
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.authorscopusid36241785000
dc.authorscopusid57219469958
dc.authorwosidKarpov, Alexey A/A-8905-2012
dc.authorwosidKAYA, Heysem/V-4493-2019
dc.identifier.wosWOS:000409394401111en_US
dc.identifier.scopus2-s2.0-84994384423en_US


Bu öğenin dosyaları:

Thumbnail

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster