Gelişmiş Arama

Basit öğe kaydını göster

dc.contributor.authorArslan, Özkan
dc.contributor.authorEngin, Erkan Zeki
dc.date.accessioned2022-05-11T14:03:04Z
dc.date.available2022-05-11T14:03:04Z
dc.date.issued2019
dc.identifier.issn2619-9831
dc.identifier.urihttps://doi.org/10.26650/electrica.2019.18042
dc.identifier.urihttps://hdl.handle.net/20.500.11776/4593
dc.description.abstractThis paper proposes a voice activity detection (VAD) method based on time and spectral domain features using multi-layer feed-forward neural network (MLF-NN) for various noisy conditions. In the proposed method, time features that were short-time energy and zero-crossing rate and spectral features that were entropy, centroid, roll-off, and flux of speech signals were extracted. Clean speech signals were used in training MLF-NN and the network was tested for noisy speech at various noisy conditions. The proposed VAD method was evaluated for six kinds of noises which are white, car, babble, airport, street, and train at four different signal-to-noise ratio (SNR) levels. The proposed method was tested on core TIMIT database and its performance was compared with SOHN, G.729B and Long-Term Spectral Flatness (LSFM) VAD methods in point of correct speech rate, false alarm rate, and overall accuracy rate. Extensive simulation results show that the proposed method gives the most successful average correct speech rate, false alarm rate, and overall accuracy rate in most low and high SNR level conditions for different noise environments. © 2019 Istanbul University. All rights reserved.en_US
dc.language.isoengen_US
dc.publisherIstanbul Universityen_US
dc.identifier.doi10.26650/electrica.2019.18042
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectMulti-layer feed-forward neural networken_US
dc.subjectTime and spectral featuresen_US
dc.subjectVoice activity detectionen_US
dc.subjectErrorsen_US
dc.subjectFeature extractionen_US
dc.subjectFeedforward neural networksen_US
dc.subjectImage resolutionen_US
dc.subjectSignal to noise ratioen_US
dc.subjectSpeechen_US
dc.subjectSpeech communicationen_US
dc.subjectSpeech recognitionen_US
dc.subjectExtensive simulationsen_US
dc.subjectMultilayer feedforward neural networksen_US
dc.subjectNoise environmentsen_US
dc.subjectOverall accuraciesen_US
dc.subjectShort-time energyen_US
dc.subjectSpectral featureen_US
dc.subjectVoice activity detectionen_US
dc.subjectZero crossing rateen_US
dc.subjectMultilayer neural networksen_US
dc.titleNoise robust voice activity detection based on multi-layer feed-forward neural networken_US
dc.typearticleen_US
dc.relation.ispartofElectricaen_US
dc.departmentFakülteler, Çorlu Mühendislik Fakültesi, Elektronik ve Haberleşme Mühendisliği Bölümüen_US
dc.identifier.volume19en_US
dc.identifier.issue2en_US
dc.identifier.startpage91en_US
dc.identifier.endpage100en_US
dc.institutionauthorEngin, Erkan Zeki
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.authorscopusid57203165669
dc.authorscopusid7801396079
dc.identifier.wosWOS:000474421400001en_US
dc.identifier.scopus2-s2.0-85072693867en_US


Bu öğenin dosyaları:

Thumbnail

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster