Single Channel Speech Enhancement Based On Deep Neural Networks

Download Single Channel Speech Enhancement Based On Deep Neural Networks full books in PDF, epub, and Kindle. Read online free Single Channel Speech Enhancement Based On Deep Neural Networks ebook anywhere anytime directly on your device. Fast Download speed and no annoying ads. We cannot guarantee that every ebooks is available!

Single-Channel Speech Enhancement Based on Deep Neural Networks

Single-Channel Speech Enhancement Based on Deep Neural Networks
Author :
Publisher :
Total Pages : 0
Release :
ISBN-10 : OCLC:1337590414
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Single-Channel Speech Enhancement Based on Deep Neural Networks by : Zhiheng Ouyang

Download or read book Single-Channel Speech Enhancement Based on Deep Neural Networks written by Zhiheng Ouyang and published by . This book was released on 2020 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Speech enhancement (SE) aims to improve the speech quality of the degraded speech. Recently, researchers have resorted to deep-learning as a primary tool for speech enhancement, which often features deterministic models adopting supervised training. Typically, a neural network is trained as a mapping function to convert some features of noisy speech to certain targets that can be used to reconstruct clean speech. These methods of speech enhancement using neural networks have been focused on the estimation of spectral magnitude of clean speech considering that estimating spectral phase with neural networks is difficult due to the wrapping effect. As an alternative, complex spectrum estimation implicitly resolves the phase estimation problem and has been proven to outperform spectral magnitude estimation. In the first contribution of this thesis, a fully convolutional neural network (FCN) is proposed for complex spectrogram estimation. Stacked frequency-dilated convolution is employed to obtain an exponential growth of the receptive field in frequency domain. The proposed network also features an efficient implementation that requires much fewer parameters as compared with conventional deep neural network (DNN) and convolutional neural network (CNN) while still yielding a comparable performance. Consider that speech enhancement is only useful in noisy conditions, yet conventional SE methods often do not adapt to different noisy conditions. In the second contribution, we proposed a model that provides an automatic "on/off" switch for speech enhancement. It is capable of scaling its computational complexity under different signal-to-noise ratio (SNR) levels by detecting clean or near-clean speech which requires no processing. By adopting information maximizing generative adversarial network (InfoGAN) in a deterministic, supervised manner, we incorporate the functionality of SNR-indicator into the model that adds little additional cost to the system. We evaluate the proposed SE methods with two objectives: speech intelligibility and application to automatic speech recognition (ASR). Experimental results have shown that the CNN-based model is applicable for both objectives while the InfoGAN-based model is more useful in terms of speech intelligibility. The experiments also show that SE for ASR may be more challenging than improving the speech intelligibility, where a series of factors, including training dataset and neural network models, would impact the ASR performance.


Single-Channel Speech Enhancement Based on Deep Neural Networks Related Books

Single-Channel Speech Enhancement Based on Deep Neural Networks
Language: en
Pages: 0
Authors: Zhiheng Ouyang
Categories:
Type: BOOK - Published: 2020 - Publisher:

DOWNLOAD EBOOK

Speech enhancement (SE) aims to improve the speech quality of the degraded speech. Recently, researchers have resorted to deep-learning as a primary tool for sp
Deep Neural Network Approach for Single Channel Speech Enhancement Processing
Language: en
Pages:
Authors: Dongfu Li
Categories: University of Ottawa theses
Type: BOOK - Published: 2016 - Publisher:

DOWNLOAD EBOOK

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments
Language: en
Pages: 282
Authors: Xiao-Lei Zhang
Categories: Computers
Type: BOOK - Published: 2024-09-04 - Publisher: Elsevier

DOWNLOAD EBOOK

Speech Signal Processing Based on Deep Learning in Complex Acoustic Environments provides a detailed discussion of deep learning-based robust speech processing
New Era for Robust Speech Recognition
Language: en
Pages: 433
Authors: Shinji Watanabe
Categories: Computers
Type: BOOK - Published: 2017-10-30 - Publisher: Springer

DOWNLOAD EBOOK

This book covers the state-of-the-art in deep neural-network-based methods for noise robustness in distant speech recognition applications. It provides insights
Audio Source Separation
Language: en
Pages: 389
Authors: Shoji Makino
Categories: Technology & Engineering
Type: BOOK - Published: 2018-03-01 - Publisher: Springer

DOWNLOAD EBOOK

This book provides the first comprehensive overview of the fascinating topic of audio source separation based on non-negative matrix factorization, deep neural