Exploring the utility of coreference chains for improved identification of personal names

机译：探索Coreference链条的效用，以改善个人名称的识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Identifying the real world entity that a proper name refers to is an important task in many NLP applications. Context plays an important role in disambiguating entities with the same names. In this paper, we discuss a dataset and experimental set-up that allows us to systematically explore the effects of different sizes and types of context in this disambiguation task. We create context by first identifying coreferent expressions in the document and then combining sentences these expressions occur in to one informative context. We apply different filters to obtain different levels of coreference-based context. Since hand-labeling a dataset of a decent size is expensive, we investigate the usefulness of an automatically created pseudo-ambiguity dataset. The results on this pseudo-ambiguity dataset show that using coreference-based context performs better than using a fixed window of context around the entity. The insights taken from the pseudo data experiments can be used to predict how the method works with real data. In our experiments on real data we obtain comparable results.

机译：识别正确名称是指在许多NLP应用程序中的重要任务。背景信息在歧义具有相同名称的歧义实体中扮演重要作用。在本文中，我们讨论了数据集和实验设置，使我们能够系统地探索不同尺寸和语境类型在这种歧义任务中的影响。我们通过首先在文档中识别Coreferent表达式来创建上下文，然后将这些表达式组合到一个信息性上下文中的句子。我们应用不同的过滤器以获取不同级别的基于Coreference的上下文。由于手工标记了体面大小的数据集是昂贵的，因此我们调查自动创建的伪模糊数据集的有用性。该伪模糊的数据集上的结果显示，使用基于Coreference的上下文比使用实体周围的固定窗口更好地执行。从伪数据实验中采取的见解可用于预测该方法如何使用真实数据。在我们对实际数据的实验中，我们获得了可比的结果。

著录项

来源
《9th International conference on language resources and evaluation》|2014年||共8页
会议地点
作者
Andrea Glaser; Jonas Kuhn;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
named entity disambiguation; coreference; pseudo-ambiguity dataset;

机译：命名实体歧义;练习;伪模糊的数据集;

相似文献

外文文献
中文文献
专利

1. Beyond environmental concerns: using means-end chains to explore the personal psychological values and motivations of leisure/recreational cyclists [J] . Ho Chaang-Iuan, Liao Tsai-Yuan, Huang Shu-Chin, Journal of Sustainable Tourism . 2015,第2期

机译：超越环境问题：使用经济手段链来探索休闲/休闲自行车手的个人心理价值和动机
2. Which Factors Contribute to Resolving Coreference Chains with Bayesian Networks? [J] . Davy Weissenbacher, Yutaka Sasaki International journal of computational linguistics and applications . 2013,第2期

机译：哪些因素有助于使用贝叶斯网络解决共指链？
3. Processing new and repeated names: effects of coreference on repetition priming with speech and fast RSVP. [J] . Camblin CC, Ledoux K, Boudewyn M, Brain research . 2007,第0期

机译：处理新名称和重复名称：共指对语音和快速RSVP重复启动的影响。
4. Exploring the utility of coreference chains for improved identification of personal names [C] . Andrea Glaser, Jonas Kuhn 9th International conference on language resources and evaluation . 2014

机译：探索共指链的实用程序，以改善个人姓名识别
5. Exploring naming behavior in personal digital image collections: The iconology and language games of Pinterest. [D] . Sutcliffe, Tami. 2014

机译：探索个人数字图像集中的命名行为：Pinterest的图标和语言游戏。
6. Processing new and repeated names: Effects of coreference on repetition priming with speech and fast RSVP [O] . C. Christine Camblin, Kerry Ledoux, Megan Boudewyn, -1

机译：处理新的和重复的名称：共指对语音和快速RSVP重复启动的影响
7. Improving Event Coreference Resolution by Modeling Correlations between Event Coreference Chains and Document Topic Structures [O] . Prafulla Kumar Choubey, Ruihong Huang 2018

机译：通过建模事件Coreference链和文档主题结构之间的相关性提高事件培训决策

Exploring the utility of coreference chains for improved identification of personal names

摘要

著录项

相似文献

相关主题

期刊订阅