首页> 外文会议>9th International conference on language resources and evaluation >Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution
【24h】

Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution

机译:使用大锤破解坚果?词汇多样性和事件COSEREREDED解决

获取原文

摘要

In this paper we examine the representativeness of the EventCorefBank (ECB) (Bejan and Harabagiu, 2010) with regards to the language population of large-volume streams of news. The ECB corpus is one of the data sets used for evaluation of the task of event coreference resolution. Our analysis shows that the ECB in most cases covers one seminal event per domain, what considerably simplifies event and so language diversity that one comes across in the news. We augmented the corpus with a new corpus component, consisting of 502 texts, describing different instances of event types that were already captured by the 43 topics of the ECB, making it more representative of news articles on the web. The new "ECB+" corpus is available for further research.
机译:在本文中,我们考虑了empercorefbank(欧洲央行)(Bejan和Harabagiu,2010)的代表性,关于大量新闻的语言群体。 ECB语料库是用于评估事件COREREFED分辨率任务的数据集之一。我们的分析表明,欧洲央行在大多数情况下占地一个开创事件,有很大简化的事件,因此语言多样性在新闻中遇到。我们将语料库增强了一个新的语料库组件,由502个文本组成,描述了欧洲央行的43个主题已经捕获的事件类型的不同实例,使其更多代表网络上的新闻文章。新的“欧洲央行+”语料库可用于进一步研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号