Preprocessing for elderly speech recognition of smart devices

Soonil Kwon; Sung-Jae Kim; Joon Yeon Choeh

首页> 外文期刊>Computer speech and language >Preprocessing for elderly speech recognition of smart devices

【24h】

Preprocessing for elderly speech recognition of smart devices

机译：智能设备老年人语音识别的预处理

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Due to the increasing aging population in modern society and to the proliferation of smart devices, there is a need to enhance speech recognition among smart devices in order to make information easily accessible to the elderly as it is to the younger population. In general, speech recognition systems are optimized to an average adult's voice and tend to exhibit a lower accuracy rate when recognizing an elderly person's voice, due to the effects of speech articulation and speaking style. Additional costs are bound to be incurred when adding modifications to current speech recognitions systems for better speech recognition among elderly users. Thus, using a preprocessing application on a smart device can not only deliver better speech recognition but also substantially reduce any added costs. Audio samples of 50 words uttered by 80 elderly and young adults were collected and comparatively analyzed. The speech patterns of the elderly have a slower speech rate with longer inter-syllabic silence length and slightly lower speech intelligibility. The speech recognition rate for elderly adults could be improved by means of increasing the speech rate, adding a 1.5% increase in accuracy, eliminating silence periods, adding another 4.2% increase in accuracy, and boosting the energy of the formant frequency bands for a 6% boost in accuracy. After all the preprocessing, a 12% increase in the accuracy of elderly speech recognition was achieved. Through this study, we show that speech recognition of elderly voices can be improved through modifying specific aspects of differences in speech articulation and speaking style. In the future, we will conduct studies on methods that can precisely measure and adjust speech rate and find additional factors that impact intelligibility.

机译：由于现代社会中老龄化人口的增加以及智能设备的普及，需要增强智能设备之间的语音识别，以使老年人和年轻人一样容易获得信息。通常，由于语音清晰度和说话风格的影响，语音识别系统针对普通成年人的语音进行了优化，并且在识别老年人的语音时倾向于表现出较低的准确率。当增加对当前语音识别系统的修改以在老年人中更好地识别语音时，必然会产生额外的费用。因此，在智能设备上使用预处理应用程序不仅可以提供更好的语音识别，还可以大大降低任何增加的成本。收集并比较了80位老年人和年轻人发出的50个单词的音频样本。老年人的言语模式语速较慢，音节间的沉默时间较长，言语清晰度较低。老年人的语音识别率可以通过提高语音速率，增加1.5％的准确度，消除静音期，增加4.2％的准确度以及增加共振峰频段的能量来提高（6）％提高准确性。经过所有的预处理，老年人语音识别的准确性提高了12％。通过这项研究，我们表明，通过修改语音清晰度和说话风格差异的特定方面，可以改善老年人语音的语音识别。将来，我们将研究可精确测量和调整语速并发现影响清晰度的其他因素的方法。

著录项

来源
《Computer speech and language》 |2016年第3期|110-121|共12页
作者
Soonil Kwon; Sung-Jae Kim; Joon Yeon Choeh;
展开▼
作者单位

Interaction Technology Laboratory, Department of Digital Contents, Sejong University, 98 Gunja-Dong, Gwangjin-Gu, Seoul 143-747, Republic of Korea;

Interaction Technology Laboratory, Department of Digital Contents, Sejong University, 98 Gunja-Dong, Gwangjin-Gu, Seoul 143-747, Republic of Korea;

Interaction Technology Laboratory, Department of Digital Contents, Sejong University, 98 Gunja-Dong, Gwangjin-Gu, Seoul 143-747, Republic of Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Elderly voice interface; Speech recognition; Aging society;

机译：老人语音接口;语音识别;老龄化社会;

相似文献

外文文献
中文文献
专利

1. Noise Cancellation Based on Voice Activity Detection Using Spectral Variation for Speech Recognition in Smart Home Devices [J] . Park Jeong-Sik, Kim Seok-Hoon Intelligent automation and soft computing . 2020,第1期

机译：基于频谱变化的语音活动检测的噪声消除，用于智能家居设备中的语音识别
2. Privacy-Preserving Outsourced Speech Recognition for Smart IoT Devices [J] . Ma Zhuo, Liu Yang, Liu Ximeng, Internet of Things Journal, IEEE . 2019,第5期

机译：智能物联网设备的隐私保护外包语音识别
3. An Overview of Basics Speech Recognition and Autonomous Approach for Smart Home IOT Low Power Devices [J] . Jean-Yves Fourniols, Nadim Nasreddine, Christophe Escriba, Journal of Signal and Information Processing . 2018,第4期

机译：智能家居物联网低功耗设备的语音识别和自主方法基础概述
4. Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition [C] . Meiko Fukuda, Hiromitsu Nishizaki, Yurie Iribe, International Conference on Language Resources and Evaluation . 2020

机译：改善老年人的演讲识别：老年日语语音和语音识别声学建模的调查
5. Noise-Robust Speech Source Localization and Tracking Using Microphone Arrays for Smartphone-Assisted Hearing Aid Devices [D] . Ganguly, Anshuman. 2018

机译：智能手机辅助助听器设备的麦克风阵列噪声鲁棒语音源定位和跟踪
6. Speech Perception for Adult Cochlear Implant Recipients in a Realistic Background Noise: Effectiveness of Preprocessing Strategies and External Options for Improving Speech Recognition in Noise [O] . René H. Gifford, Lawrence J. Revit -1

机译：成人耳蜗植入者在现实背景噪声中的言语感知：预处理策略和外部选择改善噪声语音识别的有效性
7. Speech Rate Control for Improving Elderly Speech Recognition of Smart Devices [O] . SON, G., KWON, S., LIM, Y. 2017

机译：提高智能设备老年人语音识别的语音速率控制

Preprocessing for elderly speech recognition of smart devices

摘要

著录项

相似文献

相关主题

期刊订阅