首页> 外文会议> >Discrimination of Linguistic and Non-Linguistic Vocalizations in Spontaneous Speech: Intra- and Inter-Corpus Perspectives
【24h】

Discrimination of Linguistic and Non-Linguistic Vocalizations in Spontaneous Speech: Intra- and Inter-Corpus Perspectives

机译:自发性言语中的语言和非语言发声的区别:企业内部和企业之间的观点

获取原文

摘要

We present a large-scale study on classification of linguistic and non-linguistic vocalizations including laughter, vocal noise, hesitation and consent on four corpora amounting to 46 h of spontaneous conversational speech. We consider training and testing on speaker-independent subsets of single corpora (intra-corpus) as well as inter-corpus experiments where models built on one or more corpora are evaluated on a disjoint corpus. Our results reveal that while inter-corpus performance is consider ably lower than comparable intra-corpus results, this effect can be countered by data agglomeration; furthermore, we observe that inter-corpus classification accuracies indicate suitability of corpora for building generalizing models.
机译:我们对语言和非语言发声的分类进行了大规模研究,包括对四个语料的笑声,发声噪声,犹豫和同意,总计46小时的自发对话语音。我们考虑对单个语料库(语料库)中与说话者无关的子集进行训练和测试,以及考虑在不连续的语料库上评估基于一个或多个语料库建立的模型的语料间实验。我们的结果表明,尽管语料库间的绩效被认为比同等的语料库内绩效低得多,但这种影响可以通过数据集聚来抵消。此外,我们观察到语料库间分类的准确性表明语料库对于构建泛化模型的适用性。

著录项

  • 来源
    《》|2012年|102-105|共4页
  • 会议地点
  • 作者单位
  • 会议组织
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号