An End-to-End Approach to Automatic Speech Assessment for Cantonese-speaking People with Aphasia

Qin Ying; Wu Yuzhong; Lee Tan; Kong Anthony Pak Hin

首页> 外文期刊>Journal of VLSI signal processing systems for signal, image, and video technology >An End-to-End Approach to Automatic Speech Assessment for Cantonese-speaking People with Aphasia

【24h】

An End-to-End Approach to Automatic Speech Assessment for Cantonese-speaking People with Aphasia

机译：对粤语讲话的粤语人的自动演讲评估的端到端方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Conventional automatic assessment of pathological speech usually follows two main steps: (1) extraction of pathology-specific features; (2) classification or regression on extracted features. Given the great variety of speech and language disorders, feature design is never a straightforward task, and yet it is most crucial to the performance of assessment. This paper presents an end-to-end approach to automatic speech assessment for Cantonese-speaking People With Aphasia (PWA). The assessment is formulated as a binary classification task to discriminate PWA with high scores of subjective assessment from those with low scores. The 2-layer Gated Recurrent Unit (GRU) and Convolutional Neural Network (CNN) models are applied to realize the end-to-end mapping from basic speech features to the classification outcome. The pathology-specific features used for assessment are learned implicitly by the neural network model. The Class Activation Mapping (CAM) method is utilized to visualize how the learned features contribute to the assessment result. Experimental results show that the end-to-end approach can achieve comparable performance to the conventional two-step approach in the classification task, and the CNN model is able to learn impairment-related features that are similar to the hand-crafted features. The experimental results also indicate that CNN model performs better than 2-layer GRU model in this specific task.

机译：传统的病理语音自动评估通常遵循两个主要步骤：（1）提取病理学特征; （2）提取特征对分类或回归。鉴于各种各样的言语和语言障碍，功能设计绝不是一项直接的任务，但它对于评估性能至关重要。本文介绍了对粤语人物（PWA）的粤语人的自动演讲评估的端到端方法。评估作为二进制分类任务制定，以区分PWA，以极高分数的高度评估。应用2层门控复发单元（GRU）和卷积神经网络（CNN）模型来实现从基本语音特征到分类结果的端到端映射。用于评估的病理学特定特征由神经网络模型隐含地学习。类激活映射（CAM）方法用于可视化学习功能如何促进评估结果。实验结果表明，端到端方法可以在分类任务中实现与传统的两步方法的相当性能，并且CNN模型能够学习与手工制作功能类似的损伤相关的特征。实验结果还表明CNN模型在该特定任务中执行了比2层GRU模型更好。

著录项

来源
《Journal of VLSI signal processing systems for signal, image, and video technology》 |2020年第8期|819-830|共12页
作者
Qin Ying; Wu Yuzhong; Lee Tan; Kong Anthony Pak Hin;
展开▼
作者单位

Chinese Univ Hong Kong Dept Elect Engn Shatin Hong Kong Peoples R China;

Chinese Univ Hong Kong Dept Elect Engn Shatin Hong Kong Peoples R China;

Chinese Univ Hong Kong Dept Elect Engn Shatin Hong Kong Peoples R China;

Univ Cent Florida Sch Commun Sci & Disorders Orlando FL 32816 USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Pathological speech assessment; End-to-end; Aphasia; Cantonese; Deep neural network;

机译：病理言论评估;端到端;失语症;粤语;深神经网络;

相似文献

外文文献
中文文献
专利

1. Automatic Assessment of Speech Impairment in Cantonese-Speaking People with Aphasia [J] . Qin Ying, Lee Tan, Kong Anthony Pak Hin Selected Topics in Signal Processing, IEEE Journal of . 2020,第2期

机译：在粤语讲话中自动评估讲话者的失语症
2. Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition (L) [J] . Odette Scharenborg, Louis ten Bosch, Lou Boves, The Journal of the Acoustical Society of America . 2003,第6期

机译：桥接自动语音识别和心理语言学：将候选清单扩展到人类语音识别的端到端模型（L）
3. The Aphasia Action, Success, and Knowledge Programme: Results from an Australian Phase I Trial of a Speech-Pathology-Led Intervention for People with Aphasia Early Post Stroke [J] . BrookeRyan, KylaHudson, LindaWorrall, Brain impairment : . 2017,第3期

机译：失语症的行动，成功和知识计划：澳大利亚阶段的结果，我对具有性腺早期卒中的患者的讲话病理学导向干预的结果
4. An End-to-End Approach to Automatic Speech Assessment for People with Aphasia [C] . Ying Qin, Tan Lee, Yuzhong Wu, International Symposium on Chinese Spoken Language Processing . 2018

机译：一种用于失语症患者自动语音评估的端到端方法
5. Towards Automatic Speech-Language Assessment for Aphasia Rehabilitation [D] . Le, Duc. 2017

机译：致力于失语康复的自动语音评估
6. Validated automatic speech biomarkers in primary progressive aphasia [O] . Naomi Nevler, Sharon Ash, David J Irwin, 2019

机译：在原发性进行性失语症中验证的自动语音生物标志物
7. People with aphasia’s perspectives of the therapeutic alliance during speech-language intervention: A Q methodological approach [O] . Michelle Lawton, Gillian Haddock, Paul Conroy, 2019

机译：在语言干预期间，具有阿北景的观点的人的观点：Q方法方法

An End-to-End Approach to Automatic Speech Assessment for Cantonese-speaking People with Aphasia

摘要

著录项

相似文献

相关主题

期刊订阅