Emotion, age, and gender classification in children's speech by humans and machines

Kaya, Heysem; Salah, Albert Ali; Karpov, Alexey A.; Frolova, Olga; Grigorev, Aleksey; Lyakso, Elena

Emotion, age, and gender classification in children's speech by humans and machines

dc.authorid	0000-0002-6073-0393
dc.authorid	0000-0001-6342-428X
dc.authorid	0000-0003-3424-652X
dc.authorid	0000-0001-6342-428X
dc.authorid	0000-0002-1565-6921
dc.authorid	0000-0001-7947-5508
dc.authorscopusid	36241785000
dc.authorscopusid	7006556254
dc.authorscopusid	57219469958
dc.authorscopusid	8521676200
dc.authorscopusid	55873665200
dc.authorscopusid	24468656100
dc.authorwosid	Lyakso, Elena/H-9904-2013
dc.authorwosid	Salah, Albert Ali/ABH-5561-2020
dc.authorwosid	Salah, Albert Ali/E-5820-2013
dc.authorwosid	Karpov, Alexey A/A-8905-2012
dc.authorwosid	Grigorev, Aleksey/Q-4953-2018
dc.contributor.author	Kaya, Heysem
dc.contributor.author	Salah, Albert Ali
dc.contributor.author	Karpov, Alexey A.
dc.contributor.author	Frolova, Olga
dc.contributor.author	Grigorev, Aleksey
dc.contributor.author	Lyakso, Elena
dc.date.accessioned	2022-05-11T14:15:50Z
dc.date.available	2022-05-11T14:15:50Z
dc.date.issued	2017
dc.department	Fakülteler, Çorlu Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description.abstract	In this article, we present the first child emotional speech corpus in Russian, called EmoChildRu, collected from 3 to 7 years old children. The base corpus includes over 20 K recordings (approx. 30 h), collected from 120 children. Audio recordings are carried out in three controlled settings by creating different emotional states for children: playing with a standard set of toys; repetition of words from a toy-parrot in a game store setting; watching a cartoon and retelling of the story, respectively. This corpus is designed to study the reflection of the emotional state in the characteristics of voice and speech and for studies of the formation of emotional states in ontogenesis. A portion of the corpus is annotated for three emotional states (comfort, discomfort, neutral). Additional data include the results of the adult listeners' analysis of child speech, questionnaires, as well as annotation for gender and age in months. We also provide several baselines, comparing human and machine estimation on this corpus for prediction of age, gender and comfort state. While in age estimation, the acoustics-based automatic systems show higher performance, they do not reach human perception levels in comfort state and gender classification. The comparative results indicate the importance and necessity of developing further linguistic models for discrimination. (C) 2017 Elsevier Ltd. All rights reserved.
dc.description.sponsorship	Russian Foundation for Basic ResearchRussian Foundation for Basic Research (RFBR) [10-00-000.24, 15-06-07852, 16-37-60100]; Russian Foundation for Basic Research DHSS [17-06-00503]; Government of Russia [074-U01]; Bogazici UniversityBogazici University [BAP 16A01P4]; BAGEP Award of the Science Academy; [MD-254.2017.8]
dc.description.sponsorship	The work was supported by the Russian Foundation for Basic Research (grant nos. 10-00-000.24, 15-06-07852, and 16-37-60100), Russian Foundation for Basic Research DHSS (grant No 17-06-00503), by the grant of the President of Russia (project No MD-254.2017.8), by the Government of Russia (grant No 074-U01), by Bogazici University (project BAP 16A01P4) and by the BAGEP Award of the Science Academy.
dc.identifier.doi	10.1016/j.csl.2017.06.002
dc.identifier.endpage	283
dc.identifier.issn	0885-2308
dc.identifier.issn	1095-8363
dc.identifier.scopus	2-s2.0-85021761471
dc.identifier.scopusquality	Q2
dc.identifier.startpage	268
dc.identifier.uri	https://doi.org/10.1016/j.csl.2017.06.002
dc.identifier.uri	https://hdl.handle.net/20.500.11776/6092
dc.identifier.volume	46
dc.identifier.wos	WOS:000407609600016
dc.identifier.wosquality	Q2
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.institutionauthor	Kaya, Heysem
dc.language.iso	en
dc.publisher	Academic Press Ltd- Elsevier Science Ltd
dc.relation.ispartof	Computer Speech and Language
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	Emotional child speech
dc.subject	Perception experiments
dc.subject	Spectrographic analysis
dc.subject	Emotional states
dc.subject	Age recognition
dc.subject	Gender recognition
dc.subject	Computational paralinguistics
dc.subject	Recognition
dc.title	Emotion, age, and gender classification in children's speech by humans and machines
dc.type	Article

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: 6092.pdf
Boyut:: 989.92 KB
Biçim:: Adobe Portable Document Format
Açıklama:: Tam Metin / Full Text

İndir

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu
Çorlu Mühendislik Fakültesi Koleksiyonu