Speech Enhancement using K-Sparse Autoencoder Techniques

机译：使用K-Sparse AutoEncoder技术进行语音增强

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech signals are almost invariably corrupted with either background noise or mixed with other coherent speech. Various techniques are used for speech enhancement like Nonnegative matrix factorization (NMF), Independent component analysis (ICA) etc. One of the techniques is sparse coding and dictionary learning. For this standard algorithmic approaches use iterative techniques like KSVD and Orthogonal Matching Pursuit (OMP) which require significant memory and computation time to process successfully. We, however, use a novel approach of using k-sparse autoencoders which has not been previously used in speech processing. The proposed approach extends k-sparse autoencoders as a denoising autoencoder which allows us to achieve significantly better performance. This research work demonstrate that the use of k-sparse autoencoder has number of advantages especially it does not need any prior knowledge on the statistical characteristics of the noise and it performs much better on signals more heavily corrupted with noise. In addition to standard datasets,it’s superior performance over other dictionary learning techniques are demonstrated on speech signals that are sensed on android phones.

机译：语音信号几乎总是损坏，并且与背景噪音或与其他相干语音混合。各种技术用于语音增强，如非负矩阵分解（NMF），独立分量分析（ICA）等。其中一个技术是稀疏编码和字典学习。对于此标准算法方法，使用迭代技术，如KSVD和正交匹配追求（OMP），这需要重大内存和计算时间来成功处理。然而，我们使用使用以前没有用于语音处理的K-Sparse AutoEndoders的新方法。所提出的方法将K-Sparse AutoEncoders延伸为去噪自身额，使我们能够实现显着更好的性能。这项研究工作表明，使用K-Sparse AutoEncoder的使用数量，特别是它不需要任何关于噪声统计特征的先验知识，并且它对噪音更严重的信号更好地表现更好。除了标准数据集之外，它还在其他字典学习技术上的性能卓越，在Android手机上感应的语音信号上进行了演示。

著录项

来源
《International Conference on Artificial Intelligence and Smart Systems》|2021年|518-525|共8页
会议地点
作者
Sujoy Kumar Roy Chowdhury; Amitava Chatterjee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Noise reduction; Matching pursuit algorithms; Machine learning; Speech enhancement; Sparse matrices; Standards; Smart phones;

机译：降噪;匹配追踪算法;机器学习;语音增强;稀疏矩阵;标准;智能手机;

相似文献

外文文献
中文文献
专利

1. Hyperspectral image classification using k-sparse denoising autoencoder and spectral-restricted spatial characteristics [J] . Lan Rushi, Li Zeya, Liu Zhenbing, Applied Soft Computing . 2019,第期

机译：使用k-稀疏的去噪自动化器和光谱限制空间特征的高光谱图像分类
2. A STATISTICAL ANALYSIS ON THE IMPACT OF SPEECH ENHANCEMENT TECHNIQUES ON THE FEATURE VECTORS OF NOISY SPEECH SIGNALS FOR SPEECH RECOGNITION [J] . SWAPNANIL GOGOI, UTPAL BHATTACHARJEE Journal of computer science engineering and information technology research . 2016,第3期

机译：语音增强技术对语音识别中嘈杂语音信号特征向量影响的统计分析
3. A STATISTICAL ANALYSIS ON THE IMPACT OF SPEECH ENHANCEMENT TECHNIQUES ON THE FEATURE VECTORS OF NOISY SPEECH SIGNALS FOR SPEECH RECOGNITION [J] . SWAPNANIL GOGOI, UTPAL BHATTACHARJEE Journal of computer science engineering and information technology research . 2016,第3期

机译：语音增强技术对语音识别中嘈杂语音信号特征向量影响的统计分析
4. Early diagnosis of Alzheimer's disease: A multi-class deep learning framework with modified k-sparse autoencoder classification [C] . Pushkar Bhatkoti, Manoranjan Paul International Conference on Image and Vision Computing New Zealand . 2016

机译：阿尔茨海默氏病的早期诊断：具有改进的k-稀疏自动编码器分类的多类深度学习框架
5. Speech Enhancement Using Speech Synthesis Techniques [D] . Maiti, Soumi. 2021

机译：使用语音合成技术进行语音增强
6. Enhanced protein domain discovery by using language modeling techniques from speech recognition [O] . Lachlan Coin, Alex Bateman, Richard Durbin 2003

机译：通过使用语音识别中的语言建模技术来增强蛋白质结构域发现
7. k-Sparse Autoencoders [O] . Makhzani, Alireza, Frey, Brendan 2014

机译：k-sparse autoencoders

Speech Enhancement using K-Sparse Autoencoder Techniques

摘要

著录项

相似文献

相关主题

期刊订阅