Automatic Recognition of Repetitions in Stuttered Speech: Using End-Point Detection and Dynamic Time Warping

P.H. Yeh; S.L. Yang; C.C. Yang; M.D. Shieh

首页> 外文期刊>Procedia - Social and Behavioral Sciences >Automatic Recognition of Repetitions in Stuttered Speech: Using End-Point Detection and Dynamic Time Warping

【24h】

Automatic Recognition of Repetitions in Stuttered Speech: Using End-Point Detection and Dynamic Time Warping

机译：自动识别口吃语音中的重复：使用端点检测和动态时间规整

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This study proposes a methodology for recognizing repetitions in stuttered speech. First, the recorded speech is parameterized by extracting six acoustic features, including volume, zero crossing rate, spectral entropy, high-order derivatives, VH curve, and VE curve. Second, the speech is segmented using the technique of end-point detection according (EPD) to the threshold of VH curve. Third, the features of the segmented speech are processed by dynamic time warping (DTW) to identify similar patterns in neighbouring segments. The proposed method was verified using the artificial stuttering samples of Mandarin Chinese. Ten male subjects were asked to imitate stuttering by speak out 39 predefined repetition settings. These settings are planned by considering three Mandarin Phonetic Symbols ([t], [k], [t‘]) and three kinds of repetitions (part-word repetition, whole-word repetition, multi-syllable word repetition). The experimental results indicate that EPD using VH curve is capable to slice the repetition in artificial stuttered speech. Comparing the results for recognizing the phoneme and single syllable words, there is no significant difference for the threshold of DTW. The performance of DTW in recognizing repetitions had high accuracy of 83%. Therefore, the proposed method combining EPD and DTW is feasible for automatic recognition of repetitions in stuttered speech. However, more real stuttered speech samples are still needed to verify and improve the proposed method.

机译：这项研究提出了一种识别口吃重复的方法。首先，通过提取六个声学特征（包括音量，零交叉率，频谱熵，高阶导数，VH曲线和VE曲线）来对录制的语音进行参数化。其次，使用根据（EPD）的端点检测技术将语音分割为VH曲线的阈值。第三，通过动态时间规整（DTW）处理分段语音的特征，以识别相邻分段中的相似模式。人工口吃的普通话样本验证了该方法的有效性。要求十名男性受试者通过说出39种预定义的重复设置来模仿口吃。通过考虑三个普通话音标（[t]，[k]，[t’]）和三种重复（部分单词重复，全单词重复，多音节单词重复）来计划这些设置。实验结果表明，使用VH曲线的EPD能够对人工口吃语音中的重复进行切片。比较识别音素和单个音节单词的结果，DTW阈值没有显着差异。 DTW在识别重复项方面的性能具有83％的高精度。因此，提出的将EPD和DTW相结合的方法对于自动识别口吃语音中的重复是可行的。但是，仍然需要更多真实的口吃语音样本来验证和改进所提出的方法。

著录项

来源
《Procedia - Social and Behavioral Sciences》 |2015年第2期|共1页
作者
P.H. Yeh; S.L. Yang; C.C. Yang; M.D. Shieh;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类社会科学现状及发展;
关键词

相似文献

外文文献
中文文献
专利

1. Automatic Recognition of Prolongations and Repetitions in Stuttering Speech using ANN [J] . G. Manjula, M. Shiva Kumar, Y. V. Geetha Journal of the Instrument Society of India: Proceedings of the national symposium on instrumentation . 2016,第3期

机译：使用ANN自动识别口吃语音中的延伸和重复
2. An HMM-Like Dynamic Time Warping Scheme for Automatic Speech Recognition [J] . Ing-JrDing, Yen-MingHsu Mathematical Problems in Engineering: Theory, Methods and Applications . 2014,第a期

机译：用于自动语音识别的HMM样动态时间翘曲方案
3. FIELD-PROGRAMMABLE GATE ARRAY IMPLEMENTATION OF THE DYNAMIC TIME WARPING ALGORITHM FOR SPEECH RECOGNITION [J] . John Sahaya Rani Alex, Mitali Bhojwani Asian Journal of Pharmaceutical and Clinical Research . 2017,第13期

机译：语音识别的动态时间规整算法的现场可编程门阵列实现
4. MULTI PATTERN DYNAMIC TIME WARPING FOR AUTOMATIC SPEECH RECOGNITION [C] . Nishanth Ulhas Nair, T. V. Sreenivas IEEE Region 10 Conference . 2008

机译：多模式动态时间翘曲自动语音识别
5. Frequency warping by linear transformation, and vocal tract inversion for speaker normalization in automatic speech recognition. [D] . Panchapagesan, Sankaran. 2008

机译：通过线性变换实现的频率扭曲和声道反转，可在自动语音识别中实现说话人归一化。
6. Development of a Two-Stage Procedure for the Automatic Recognition of Dysfluencies in the Speech of Children Who Stutter: II. ANN Recognition of Repetitions and Prolongations With Supplied Word Segment Markers [O] . Peter Howell, Stevie Sackin, Kazan Glenn -1

机译：自动识别口吃儿童言语中流离失所的两阶段程序的发展：II。具有提供的词段标记的ANN识别重复和延长
7. Automatic Recognition of Repetitions in Stuttered Speech: Using End-Point Detection and Dynamic Time Warping [O] . Yeh P.H., Yang S.L., Yang C.C., 2015

机译：自动识别口吃语音中的重复：使用端点检测和动态时间规整

Automatic Recognition of Repetitions in Stuttered Speech: Using End-Point Detection and Dynamic Time Warping

摘要

著录项

相似文献

相关主题

期刊订阅