Speech enhancement using adaptive thresholding based on gamma distribution of Teager energy operated intrinsic mode functions
Özet
This paper introduces a new speech enhancement algorithm based on the adaptive threshold of intrinsic mode functions (IMFs) of noisy signal frames extracted by empirical mode decomposition. Adaptive threshold values are estimated by using the gamma statistical model of Teager energy operated IMFs of noisy speech and estimated noise based on symmetric Kullback–Leibler divergence. The enhanced speech signal is obtained by a semisoft thresholding function, which is utilized by threshold IMF coefficients of noisy speech. The method is tested on the NOIZEUS speech database and the proposed method is compared with wavelet-shrinkage and EMD-shrinkage methods in terms of segmental SNR improvement (SegSNR), weighted spectral slope (WSS), and perceptual evaluation of speech quality (PESQ). Experimental results show that the proposed method provides a higher SegSNR improvement in dB, lower WSS distance, and higher PESQ scores than wavelet-shrinkage and EMD-shrinkage methods. The proposed method shows better performance than traditional threshold-based speech enhancement approaches from high to low SNR levels. © TÜBİTAK
Cilt
27Sayı
2Koleksiyonlar
İlgili Öğeler
Başlık, yazar, küratör ve konuya göre gösterilen ilgili öğeler.
-
Evaluation of single-channel speech enhancement algorithms by using objective quality and intelligibility measures
Arslan, Özkan; Engin, Erkan Zeki (Institute of Electrical and Electronics Engineers Inc., 2018)In this study, single-channel speech enhancement algorithms were evaluated with objective quality and objective intelligibility measures using Turkish speech database. The clean 30 sentences from the METU database are ... -
Noise robust voice activity detection based on multi-layer feed-forward neural network
Arslan, Özkan; Engin, Erkan Zeki (Istanbul University, 2019)This paper proposes a voice activity detection (VAD) method based on time and spectral domain features using multi-layer feed-forward neural network (MLF-NN) for various noisy conditions. In the proposed method, time ... -
A Novel Voice Activity Detection for Multi-Channel Noise Reduction
Çolak, Ramazan; Akdeniz, Rafet (Ieee-Inst Electrical Electronics Engineers Inc, 2021)In this study, a voice activity detection technique is designed using features such as short-term energy, periodicity and spectral flatness. The desired results are obtained by using these three features, even at low signal ...