[PDF] Single Channel Speech Enhancement Based On Deep Neural Networks Download Online

Download Single Channel Speech Enhancement Based On Deep Neural Networks full books in PDF, epub, and Kindle. Read online free Single Channel Speech Enhancement Based On Deep Neural Networks ebook anywhere anytime directly on your device. Fast Download speed and no annoying ads. We cannot guarantee that every ebooks is available!

Single-Channel Speech Enhancement Based on Deep Neural Networks

Author	: Zhiheng Ouyang
Publisher	:
Total Pages	: 0
Release	: 2020
ISBN-10	: OCLC:1337590414
ISBN-13	:
Rating	: 4/5 ( Downloads)

DOWNLOAD EBOOK

Book Synopsis Single-Channel Speech Enhancement Based on Deep Neural Networks by : Zhiheng Ouyang

Download or read book Single-Channel Speech Enhancement Based on Deep Neural Networks written by Zhiheng Ouyang and published by . This book was released on 2020 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Speech enhancement (SE) aims to improve the speech quality of the degraded speech. Recently, researchers have resorted to deep-learning as a primary tool for speech enhancement, which often features deterministic models adopting supervised training. Typically, a neural network is trained as a mapping function to convert some features of noisy speech to certain targets that can be used to reconstruct clean speech. These methods of speech enhancement using neural networks have been focused on the estimation of spectral magnitude of clean speech considering that estimating spectral phase with neural networks is difficult due to the wrapping effect. As an alternative, complex spectrum estimation implicitly resolves the phase estimation problem and has been proven to outperform spectral magnitude estimation. In the first contribution of this thesis, a fully convolutional neural network (FCN) is proposed for complex spectrogram estimation. Stacked frequency-dilated convolution is employed to obtain an exponential growth of the receptive field in frequency domain. The proposed network also features an efficient implementation that requires much fewer parameters as compared with conventional deep neural network (DNN) and convolutional neural network (CNN) while still yielding a comparable performance. Consider that speech enhancement is only useful in noisy conditions, yet conventional SE methods often do not adapt to different noisy conditions. In the second contribution, we proposed a model that provides an automatic "on/off" switch for speech enhancement. It is capable of scaling its computational complexity under different signal-to-noise ratio (SNR) levels by detecting clean or near-clean speech which requires no processing. By adopting information maximizing generative adversarial network (InfoGAN) in a deterministic, supervised manner, we incorporate the functionality of SNR-indicator into the model that adds little additional cost to the system. We evaluate the proposed SE methods with two objectives: speech intelligibility and application to automatic speech recognition (ASR). Experimental results have shown that the CNN-based model is applicable for both objectives while the InfoGAN-based model is more useful in terms of speech intelligibility. The experiments also show that SE for ASR may be more challenging than improving the speech intelligibility, where a series of factors, including training dataset and neural network models, would impact the ASR performance.

Single Channel Speech Enhancement Based On Deep Neural Networks

Single-Channel Speech Enhancement Based on Deep Neural Networks

Single-Channel Speech Enhancement Based on Deep Neural Networks Related Books

Single-Channel Speech Enhancement Based on Deep Neural Networks

Deep Neural Network Approach for Single Channel Speech Enhancement Processing

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments

New Era for Robust Speech Recognition

Audio Source Separation