Continuous Speech Recognition With Sparse Coding

W.J. Smit; E. Barnard

首页> 外文期刊>Computer speech and language >Continuous Speech Recognition With Sparse Coding

【24h】

Continuous Speech Recognition With Sparse Coding

机译：稀疏编码的连续语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sparse coding is an efficient way of coding information. In a sparse code most of the code elements are zero; very few are active. Sparse codes are intended to correspond to the spike trains with which biological neurons communicate. In this article, we show how sparse codes can be used to do continuous speech recognition. We use the TIDIGITS dataset to illustrate the process. First a waveform is transformed into a spectrogram, and a sparse code' for the spectrogram is found by means of a linear generative model. The spike train is classified by making use of a spike train model and dynamic programming. It is computationally expensive to find a sparse code. We use an iterative subset selection algorithm with quadratic programming for this process. This algorithm finds a sparse code in reasonable time if the input is limited to a fairly coarse spectral resolution. At this resolution, our system achieves a word error rate of 19%, whereas a system based on Hidden Markov Models achieves a word error rate of 15% at the same resolution.

机译：稀疏编码是编码信息的有效方法。在稀疏代码中，大多数代码元素为零；很少有人活跃。稀疏代码旨在对应于与生物神经元进行通信的尖峰序列。在本文中，我们展示了稀疏代码如何用于进行连续语音识别。我们使用TIDIGITS数据集来说明该过程。首先，将波形转换成频谱图，然后通过线性生成模型找到频谱图的稀疏代码。通过使用尖峰序列模型和动态编程对尖峰序列进行分类。找到稀疏代码在计算上是昂贵的。对于此过程，我们使用带有二次编程的迭代子集选择算法。如果输入被限制在相当粗糙的光谱分辨率下，该算法会在合理的时间内找到稀疏代码。在此分辨率下，我们的系统可实现19％的单词错误率，而基于隐马尔可夫模型的系统在相同分辨率下可实现15％的单词错误率。

著录项

来源
《Computer speech and language》 |2009年第2期|200-219|共20页
作者
W.J. Smit; E. Barnard;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
sparse coding; spike train; speech recognition; linear generative model;

机译：稀疏编码峰值序列语音识别线性生成模型;

相似文献

外文文献
中文文献
专利

1. Discriminative feature extraction for speech recognition using continuous output codes [J] . Omid Dehzangi, Bin Ma, Eng Siong Chng, Pattern recognition letters . 2012,第13期

机译：使用连续输出码进行语音识别的鉴别特征提取
2. A novel speech emotion recognition algorithm based on wavelet kernel sparse classifier in stacked deep auto-encoder model [J] . Wei Pengcheng, Zhao Yu Personal and Ubiquitous Computing . 2019,第3a4期

机译：堆叠深度自动编码器模型中基于小波核稀疏分类器的语音情感识别新算法
3. A novel speech emotion recognition algorithm based on wavelet kernel sparse classifier in stacked deep auto-encoder model [J] . Wei Pengcheng, Zhao Yu Personal and Ubiquitous Computing . 2019,第3a4期

机译：一种基于小波核稀疏分类器的新型语音情感识别算法，堆叠深自动编码器模型
4. Enhancing Large Vocabulary Continuous Speech Recognition System for Urdu-English Conversational Code-Switched Speech [C] . Muhammad Umar Farooq, Farah Adeeba, Sarmad Hussain, Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques . 2020

机译：增强Urdu-English会话代码切换语音的大型词汇连续语音识别系统
5. Objective speech intelligibility assessment using speech recognition and bigram statistics with application to low bit-rate codec evaluation [D] . Teng, Yan 2006

机译：使用语音识别和双字母组统计的客观语音清晰度评估及其在低比特率编解码器评估中的应用
6. Sparse Codes for Speech Predict Spectrotemporal Receptive Fields in the Inferior Colliculus [O] . Nicole L. Carlson, Vivienne L. Ming, Michael Robert DeWeese 2012

机译：语音的稀疏代码预测下眼囊的光谱时域接受场
7. Sparse coding of the modulation spectrum for noise-robust automatic speech recognition [O] . Sara Ahmadi, Seyed Mohammad Ahadi, Bert Cranen, 2014

机译：调制频谱的稀疏编码，用于噪声稳定的自动语音识别

Continuous Speech Recognition With Sparse Coding

摘要

著录项

相似文献

相关主题

期刊订阅