Emotion, age, and gender classification in children's speech by humans and machines
dc.authorid | 0000-0002-6073-0393 | |
dc.authorid | 0000-0001-6342-428X | |
dc.authorid | 0000-0003-3424-652X | |
dc.authorid | 0000-0001-6342-428X | |
dc.authorid | 0000-0002-1565-6921 | |
dc.authorid | 0000-0001-7947-5508 | |
dc.authorscopusid | 36241785000 | |
dc.authorscopusid | 7006556254 | |
dc.authorscopusid | 57219469958 | |
dc.authorscopusid | 8521676200 | |
dc.authorscopusid | 55873665200 | |
dc.authorscopusid | 24468656100 | |
dc.authorwosid | Lyakso, Elena/H-9904-2013 | |
dc.authorwosid | Salah, Albert Ali/ABH-5561-2020 | |
dc.authorwosid | Salah, Albert Ali/E-5820-2013 | |
dc.authorwosid | Karpov, Alexey A/A-8905-2012 | |
dc.authorwosid | Grigorev, Aleksey/Q-4953-2018 | |
dc.contributor.author | Kaya, Heysem | |
dc.contributor.author | Salah, Albert Ali | |
dc.contributor.author | Karpov, Alexey A. | |
dc.contributor.author | Frolova, Olga | |
dc.contributor.author | Grigorev, Aleksey | |
dc.contributor.author | Lyakso, Elena | |
dc.date.accessioned | 2022-05-11T14:15:50Z | |
dc.date.available | 2022-05-11T14:15:50Z | |
dc.date.issued | 2017 | |
dc.department | Fakülteler, Çorlu Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | |
dc.description.abstract | In this article, we present the first child emotional speech corpus in Russian, called EmoChildRu, collected from 3 to 7 years old children. The base corpus includes over 20 K recordings (approx. 30 h), collected from 120 children. Audio recordings are carried out in three controlled settings by creating different emotional states for children: playing with a standard set of toys; repetition of words from a toy-parrot in a game store setting; watching a cartoon and retelling of the story, respectively. This corpus is designed to study the reflection of the emotional state in the characteristics of voice and speech and for studies of the formation of emotional states in ontogenesis. A portion of the corpus is annotated for three emotional states (comfort, discomfort, neutral). Additional data include the results of the adult listeners' analysis of child speech, questionnaires, as well as annotation for gender and age in months. We also provide several baselines, comparing human and machine estimation on this corpus for prediction of age, gender and comfort state. While in age estimation, the acoustics-based automatic systems show higher performance, they do not reach human perception levels in comfort state and gender classification. The comparative results indicate the importance and necessity of developing further linguistic models for discrimination. (C) 2017 Elsevier Ltd. All rights reserved. | |
dc.description.sponsorship | Russian Foundation for Basic ResearchRussian Foundation for Basic Research (RFBR) [10-00-000.24, 15-06-07852, 16-37-60100]; Russian Foundation for Basic Research DHSS [17-06-00503]; Government of Russia [074-U01]; Bogazici UniversityBogazici University [BAP 16A01P4]; BAGEP Award of the Science Academy; [MD-254.2017.8] | |
dc.description.sponsorship | The work was supported by the Russian Foundation for Basic Research (grant nos. 10-00-000.24, 15-06-07852, and 16-37-60100), Russian Foundation for Basic Research DHSS (grant No 17-06-00503), by the grant of the President of Russia (project No MD-254.2017.8), by the Government of Russia (grant No 074-U01), by Bogazici University (project BAP 16A01P4) and by the BAGEP Award of the Science Academy. | |
dc.identifier.doi | 10.1016/j.csl.2017.06.002 | |
dc.identifier.endpage | 283 | |
dc.identifier.issn | 0885-2308 | |
dc.identifier.issn | 1095-8363 | |
dc.identifier.scopus | 2-s2.0-85021761471 | |
dc.identifier.scopusquality | Q2 | |
dc.identifier.startpage | 268 | |
dc.identifier.uri | https://doi.org/10.1016/j.csl.2017.06.002 | |
dc.identifier.uri | https://hdl.handle.net/20.500.11776/6092 | |
dc.identifier.volume | 46 | |
dc.identifier.wos | WOS:000407609600016 | |
dc.identifier.wosquality | Q2 | |
dc.indekslendigikaynak | Web of Science | |
dc.indekslendigikaynak | Scopus | |
dc.institutionauthor | Kaya, Heysem | |
dc.language.iso | en | |
dc.publisher | Academic Press Ltd- Elsevier Science Ltd | |
dc.relation.ispartof | Computer Speech and Language | |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | |
dc.subject | Emotional child speech | |
dc.subject | Perception experiments | |
dc.subject | Spectrographic analysis | |
dc.subject | Emotional states | |
dc.subject | Age recognition | |
dc.subject | Gender recognition | |
dc.subject | Computational paralinguistics | |
dc.subject | Recognition | |
dc.title | Emotion, age, and gender classification in children's speech by humans and machines | |
dc.type | Article |
Dosyalar
Orijinal paket
1 - 1 / 1
Küçük Resim Yok
- İsim:
- 6092.pdf
- Boyut:
- 989.92 KB
- Biçim:
- Adobe Portable Document Format
- Açıklama:
- Tam Metin / Full Text