首页> 外文会议>International Conference on Artificial Intelligence and Smart Systems >Speech Enhancement using K-Sparse Autoencoder Techniques
【24h】

Speech Enhancement using K-Sparse Autoencoder Techniques

机译:使用K-Sparse AutoEncoder技术进行语音增强

获取原文

摘要

Speech signals are almost invariably corrupted with either background noise or mixed with other coherent speech. Various techniques are used for speech enhancement like Nonnegative matrix factorization (NMF), Independent component analysis (ICA) etc. One of the techniques is sparse coding and dictionary learning. For this standard algorithmic approaches use iterative techniques like KSVD and Orthogonal Matching Pursuit (OMP) which require significant memory and computation time to process successfully. We, however, use a novel approach of using k-sparse autoencoders which has not been previously used in speech processing. The proposed approach extends k-sparse autoencoders as a denoising autoencoder which allows us to achieve significantly better performance. This research work demonstrate that the use of k-sparse autoencoder has number of advantages especially it does not need any prior knowledge on the statistical characteristics of the noise and it performs much better on signals more heavily corrupted with noise. In addition to standard datasets,it’s superior performance over other dictionary learning techniques are demonstrated on speech signals that are sensed on android phones.
机译:语音信号几乎总是损坏,并且与背景噪音或与其他相干语音混合。各种技术用于语音增强,如非负矩阵分解(NMF),独立分量分析(ICA)等。其中一个技术是稀疏编码和字典学习。对于此标准算法方法,使用迭代技术,如KSVD和正交匹配追求(OMP),这需要重大内存和计算时间来成功处理。然而,我们使用使用以前没有用于语音处理的K-Sparse AutoEndoders的新方法。所提出的方法将K-Sparse AutoEncoders延伸为去噪自身额,使我们能够实现显着更好的性能。这项研究工作表明,使用K-Sparse AutoEncoder的使用数量,特别是它不需要任何关于噪声统计特征的先验知识,并且它对噪音更严重的信号更好地表现更好。除了标准数据集之外,它还在其他字典学习技术上的性能卓越,在Android手机上感应的语音信号上进行了演示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号