【24h】

A Large Speech Database for Brazilian Portuguese Spoken Language Research

机译:用于巴西葡萄牙语口语研究的大型语音数据库

获取原文
获取原文并翻译 | 示例

摘要

Speech recognition systems use statistical methods based algorithms, and therefore need several training samples to perform properly. Consequently such systems require huge databases for training and testing. The development of large speech corpora in Europe and in the USA was possible only with the cooperation among research centers, universities, private companies and the government. In these countries, the availability of such databases provided the resources which were responsible for the great improvement in speech technologies in the last few years. In Brazil, such consortiums are not even mentioned, and the researchers have to work with small, locally developed databases. In this article we report an effort to develop a large speech corpus for Brazilian Portuguese to fill this crucial gap.
机译:语音识别系统使用基于统计方法的算法,因此需要多个训练样本才能正确执行。因此,此类系统需要庞大的数据库来进行培训和测试。只有在研究中心,大学,私人公司和政府之间的合作下,欧洲和美国大型语音语料库的发展才有可能。在这些国家中,此类数据库的可用性提供了资源,这些资源在过去几年中极大地改善了语音技术。在巴西,甚至没有提到这样的财团,研究人员必须使用本地开发的小型数据库。在本文中,我们报告了为巴西葡萄牙语开发一种大型语音语料库以填补这一关键空白的努力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号