首页> 外文期刊>Computer speech and language >A Spanish multispeaker database of esophageal speech
【24h】

A Spanish multispeaker database of esophageal speech

机译:食管言语的西班牙多分单位数据库

获取原文
获取原文并翻译 | 示例
       

摘要

A laryngectomee is a person whose larynx has been removed by surgery, usually due to laryngeal cancer. After surgery, most laryngectomees are able to speak again, using techniques that are learned with the help of a speech therapist. This is termed as alaryngeal speech, and esophageal speech (ES) is one of the several alaryngeal speech production modes. A considerable amount of research has been dedicated to the study of alaryngeal speech, with a wide range of aims such as helping speech therapists with evaluation and diagnosis, and improving its quality and intelligibility using digital signal processing techniques. We present to you a database of Spanish ES voices, named AhoSLABI, which is designed to allow the development of new support technologies for this speech impairment. The database primarily consists of recordings of 31 laryngectomees (27 males and 4 females) pronouncing phonetically balanced sentences. Additionally, it includes parallel recordings of the sentences by 9 healthy speakers (6 males and 3 females) to facilitate speech processing tasks that require small parallel corpora, such as voice conversion or synthetic speech adaptation. Apart from the sentences, the database includes sustained vowels and a small set of isolated words, which can be valuable for research on ES analysis, diagnosis and evaluation. The paper describes the main contents of the database, the recording protocols and procedure, as well as the labeling process. The main acoustic characteristics of the voices, such as speaking rate, durations of the recordings, phones and silences, and other such characteristics are compared with those of a reduced set of healthy voices. In addition, we describe an experiment using the database to improve the performance of an ASR system for ES speakers. This new resource will be made available to the scientific community with the hope that it will be used to improve the quality of life of the laryngectomees.
机译:喉射影是一种人,其喉部已被手术移除,通常是由于喉癌。手术后,大多数喉部都能够再次发言,使用言语治疗师的帮助来了解。这被称为艾里尼亚语言,食管语音是几种艾里尼血症致辞生产模式之一。大量的研究致力于艾里尼亚语演讲的研究,具有广泛的目的,如帮助言语治疗师进行评估和诊断,并使用数字信号处理技术提高其质量和可懂度。我们向您展示了一个名为Ahoslabi的西班牙语ES声音数据库,旨在允许开发这种语音损伤的新支持技术。该数据库主要包括31个喉部(27名男性和4名女性)发出语音平衡句子的录音。此外,它包括9个健康扬声器(6名男性和3名女性)的句子的并行录制,以便于需要小并行语言的语音处理任务,例如语音转换或合成语音适应。除了句子,数据库包括持续的元音和一小组孤立的词,这对于ES分析,诊断和评估研究可能是有价值的。本文介绍了数据库的主要内容,记录协议和过程以及标签过程。与减少的健康声音相比,声音的主要声学特性,如说话率,录音,手机和沉默的持续时间,以及其他这样的特征。此外,我们描述了一种使用该数据库的实验,以提高ASR系统的ES扬声器的性能。这一新资源将提供给科学界,希望它将用于改善喉部内部的生活质量。

著录项

  • 来源
    《Computer speech and language》 |2021年第3期|101168.1-101168.12|共12页
  • 作者单位

    HiTZ Basque Center for Language Technology University of the Basque Country (UPV/EHU) Bilbao Spain;

    HiTZ Basque Center for Language Technology University of the Basque Country (UPV/EHU) Bilbao Spain;

    HiTZ Basque Center for Language Technology University of the Basque Country (UPV/EHU) Bilbao Spain Communications Engineering Department Faculty of Engineering of Bilbao University of the Basque Country (UPV/EHU) Spain;

    HiTZ Basque Center for Language Technology University of the Basque Country (UPV/EHU) Bilbao Spain;

    HiTZ Basque Center for Language Technology University of the Basque Country (UPV/EHU) Bilbao Spain;

    HiTZ Basque Center for Language Technology University of the Basque Country (UPV/EHU) Bilbao Spain;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Esophageal speech; Voice conversion; Speech databases; Speech intelligibility; Speech analysis;

    机译:食管言论;语音转换;语音数据库;语音可懂性;言语分析;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号