首页> 外文会议>International conference on computational linguistics >Rapid Development of a Corpus with Discourse Annotations using Two-stage Crowdsourcing
【24h】

Rapid Development of a Corpus with Discourse Annotations using Two-stage Crowdsourcing

机译:使用两阶段众包的带有话语注释的语料库的快速发展

获取原文

摘要

We present a novel approach for rapidly developing a corpus with discourse annotations using crowdsourcing. Although discourse annotations typically require much time and cost owing to their complex nature, we realize discourse annotations in an extremely short time while retaining good quality of the annotations by crowdsourcing two annotation subtasks. In fact, our experiment to create a corpus comprising 30,000 Japanese sentences took less than eight hours to run. Based on this corpus, we also develop a supervised discourse parser and evaluate its performance to verify the usefulness of the acquired corpus.
机译:我们提出了一种使用众包快速开发带有话语注释的语料库的新颖方法。尽管由于其复杂的性质,话语注释通常需要大量时间和成本,但我们可以在极短的时间内实现话语注释,同时通过众包两个注释子任务来保持注释的良好质量。实际上,我们创建包含30,000个日语句子的语料库的实验耗时不到八个小时。基于此语料库,我们还开发了一个监督的语篇解析器,并评估其性能以验证所获取语料库的有用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号