Multimodal Fusion of Audio, Scene, and Face Features for First Impression Estimation

dc.authorid0000-0001-6342-428X
dc.authorid0000-0001-6342-428X
dc.authorid0000-0001-7947-5508
dc.authorwosidSalah, Albert Ali/ABH-5561-2020
dc.authorwosidSalah, Albert Ali/E-5820-2013
dc.contributor.authorGürpınar, Furkan
dc.contributor.authorKaya, Heysem
dc.contributor.authorSalah, Albert Ali
dc.date.accessioned2022-05-11T14:15:49Z
dc.date.available2022-05-11T14:15:49Z
dc.date.issued2016
dc.departmentFakülteler, Çorlu Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description23rd International Conference on Pattern Recognition (ICPR) -- DEC 04-08, 2016 -- Mexican Assoc Comp Vis Robot & Neural Comp, Cancun, MEXICO
dc.description.abstractAffective computing, particularly emotion and personality trait recognition, is of increasing interest in many research disciplines. The interplay of emotion and personality shows itself in the first impression left on other people. Moreover, the ambient information, e.g. the environment and objects surrounding the subject, also affect these impressions. In this work, we employ pre-trained Deep Convolutional Neural Networks to extract facial emotion and ambient information from images for predicting apparent personality. We also investigate Local Gabor Binary Patterns from Three Orthogonal Planes video descriptor and acoustic features extracted via the popularly used openSMILE tool. We subsequently propose classifying features using a Kernel Extreme Learning Machine and fusing their predictions. The proposed system is applied to the ChaLearn Challenge on First Impression Recognition, achieving the winning test set accuracy of 0.913, averaged over the Big Five personality traits.
dc.description.sponsorshipInt Assoc Pattern Recognit, Int Conf Pattern Recognit, Org Comm, Elsevier, IBM Res, INTEL, CONACYT
dc.identifier.endpage48
dc.identifier.isbn978-1-5090-4847-2
dc.identifier.issn1051-4651
dc.identifier.scopus2-s2.0-85019096151
dc.identifier.scopusqualityN/A
dc.identifier.startpage43
dc.identifier.urihttps://hdl.handle.net/20.500.11776/6083
dc.identifier.wosWOS:000406771300008
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.institutionauthorKaya, Heysem
dc.language.isoen
dc.publisherIEEE Computer Soc
dc.relation.ispartof2016 23rd International Conference on Pattern Recognition (Icpr)
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectExtreme Learning-Machine
dc.titleMultimodal Fusion of Audio, Scene, and Face Features for First Impression Estimation
dc.typeConference Object

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
6083.pdf
Boyut:
595.85 KB
Biçim:
Adobe Portable Document Format
Açıklama:
Tam Metin / Full Text