Noise robust voice activity detection based on multi-layer feed-forward neural network

Arslan, Özkan; Engin, Erkan Zeki

dc.contributor.author	Arslan, Özkan
dc.contributor.author	Engin, Erkan Zeki
dc.date.accessioned	2022-05-11T14:03:04Z
dc.date.available	2022-05-11T14:03:04Z
dc.date.issued	2019
dc.identifier.issn	2619-9831
dc.identifier.uri	https://doi.org/10.26650/electrica.2019.18042
dc.identifier.uri	https://hdl.handle.net/20.500.11776/4593
dc.description.abstract	This paper proposes a voice activity detection (VAD) method based on time and spectral domain features using multi-layer feed-forward neural network (MLF-NN) for various noisy conditions. In the proposed method, time features that were short-time energy and zero-crossing rate and spectral features that were entropy, centroid, roll-off, and flux of speech signals were extracted. Clean speech signals were used in training MLF-NN and the network was tested for noisy speech at various noisy conditions. The proposed VAD method was evaluated for six kinds of noises which are white, car, babble, airport, street, and train at four different signal-to-noise ratio (SNR) levels. The proposed method was tested on core TIMIT database and its performance was compared with SOHN, G.729B and Long-Term Spectral Flatness (LSFM) VAD methods in point of correct speech rate, false alarm rate, and overall accuracy rate. Extensive simulation results show that the proposed method gives the most successful average correct speech rate, false alarm rate, and overall accuracy rate in most low and high SNR level conditions for different noise environments. © 2019 Istanbul University. All rights reserved.	en_US
dc.language.iso	eng	en_US
dc.publisher	Istanbul University	en_US
dc.identifier.doi	10.26650/electrica.2019.18042
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Multi-layer feed-forward neural network	en_US
dc.subject	Time and spectral features	en_US
dc.subject	Voice activity detection	en_US
dc.subject	Errors	en_US
dc.subject	Feature extraction	en_US
dc.subject	Feedforward neural networks	en_US
dc.subject	Image resolution	en_US
dc.subject	Signal to noise ratio	en_US
dc.subject	Speech	en_US
dc.subject	Speech communication	en_US
dc.subject	Speech recognition	en_US
dc.subject	Extensive simulations	en_US
dc.subject	Multilayer feedforward neural networks	en_US
dc.subject	Noise environments	en_US
dc.subject	Overall accuracies	en_US
dc.subject	Short-time energy	en_US
dc.subject	Spectral feature	en_US
dc.subject	Voice activity detection	en_US
dc.subject	Zero crossing rate	en_US
dc.subject	Multilayer neural networks	en_US
dc.title	Noise robust voice activity detection based on multi-layer feed-forward neural network	en_US
dc.type	article	en_US
dc.relation.ispartof	Electrica	en_US
dc.department	Fakülteler, Çorlu Mühendislik Fakültesi, Elektronik ve Haberleşme Mühendisliği Bölümü	en_US
dc.identifier.volume	19	en_US
dc.identifier.issue	2	en_US
dc.identifier.startpage	91	en_US
dc.identifier.endpage	100	en_US
dc.institutionauthor	Engin, Erkan Zeki
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.authorscopusid	57203165669
dc.authorscopusid	7801396079
dc.identifier.wos	WOS:000474421400001	en_US
dc.identifier.scopus	2-s2.0-85072693867	en_US

Bu öğenin dosyaları:

Ad:: 4593.pdf
Boyut:: 940.2Kb
Biçim:: PDF
Açıklama:: Tam Metin / Full Text

Göster/Aç

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Scopus İndeksli Yayınlar Koleksiyonu [4328]
Scopus Indexed Publications Collection
TR-Dizin İndeksli Yayınlar Koleksiyonu [2795]
TR-Dizin Indexed Publications Collection
WoS İndeksli Yayınlar Koleksiyonu [4789]
WoS Indexed Publications Collection
Çorlu Mühendislik Fakültesi Koleksiyonu [990]

Basit öğe kaydını göster

Noise robust voice activity detection based on multi-layer feed-forward neural network

Bu öğenin dosyaları:

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

İlgili Öğeler

Gender Determination from Pictures with CNN Models ﻿

Estimation of power output and thermodynamic analysis of standard and finned photovoltaic panels ﻿

An approach for the prioritization of wind energy production farms ﻿

Gender Determination from Pictures with CNN Models

Estimation of power output and thermodynamic analysis of standard and finned photovoltaic panels

An approach for the prioritization of wind energy production farms