【24h】

Large Broadcast News and Read Speech Corpora of Spoken Czech

机译:大型广播新闻和口语捷克语朗读语料库

获取原文
获取原文并翻译 | 示例

摘要

This paper presents the first annotated and phonetically transcribed large speech corpora developed for spoken Czech. All corpora were collected during the last two years at the Department of Cybernetics, University of West Bohemia (UWB) in Pilsen. The first two collections are broadcast news, the third corpus is a high-quality read-speech database. This paper describes the collection conditions, annotation and phonetic transcription process related to each corpus. The basic phonetic and lexical characteristics of all corpora will be given and compared mutually.
机译:本文介绍了为捷克捷克语开发的第一个带注释和语音转录的大型语音语料库。在最近两年中,所有语料库都在比尔森的西波西米亚大学(UWB)的控制论系收集。前两个集合是广播新闻,第三个语料库是高质量的语音朗读数据库。本文介绍了与每个语料库相关的收集条件,注释和语音转录过程。所有语料库的基本语音和词汇特征都将给出并相互比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号