Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts

Laura Plaza; Antonio J Jimeno-Yepes; Alberto Díaz; Alan R Aronson

首页> 外文期刊>BMC Bioinformatics >Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts

【24h】

Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts

机译：研究生物医学文本中不同词义消歧方法与摘要效果之间的相关性

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background Word sense disambiguation (WSD) attempts to solve lexical ambiguities by identifying the correct meaning of a word based on its context. WSD has been demonstrated to be an important step in knowledge-based approaches to automatic summarization. However, the correlation between the accuracy of the WSD methods and the summarization performance has never been studied. Results We present three existing knowledge-based WSD approaches and a graph-based summarizer. Both the WSD approaches and the summarizer employ the Unified Medical Language System (UMLS) Metathesaurus as the knowledge source. We first evaluate WSD directly, by comparing the prediction of the WSD methods to two reference sets: the NLM WSD dataset and the MSH WSD collection. We next apply the different WSD methods as part of the summarizer, to map documents onto concepts in the UMLS Metathesaurus, and evaluate the summaries that are generated. The results obtained by the different methods in both evaluations are studied and compared. Conclusions It has been found that the use of WSD techniques has a positive impact on the results of our graph-based summarizer, and that, when both the WSD and summarization tasks are assessed over large and homogeneous evaluation collections, there exists a correlation between the overall results of the WSD and summarization tasks. Furthermore, the best WSD algorithm in the first task tends to be also the best one in the second. However, we also found that the improvement achieved by the summarizer is not directly correlated with the WSD performance. The most likely reason is that the errors in disambiguation are not equally important but depend on the relative salience of the different concepts in the document to be summarized.

机译：背景技术词义歧义消除（WSD）尝试通过根据单词的上下文识别单词的正确含义来解决词汇歧义。在基于知识的自动摘要方法中，WSD已被证明是重要的一步。但是，从未研究过WSD方法的准确性与摘要性能之间的相关性。结果我们提出了三种现有的基于知识的WSD方法和基于图的摘要器。 WSD方法和摘要器都采用统一医学语言系统（UMLS）元同义词库作为知识源。我们首先通过将WSD方法的预测与两个参考集进行比较来直接评估WSD：NLM WSD数据集和MSH WSD集合。接下来，我们将不同的WSD方法用作摘要程序的一部分，以将文档映射到UMLS Metathesaurus中的概念，并评估所生成的摘要。研究并比较了两种评估中通过不同方法获得的结果。结论已经发现，使用WSD技术对基于图形的汇总器的结果具有积极影响，并且当在大型且均质的评估集合中同时评估WSD和摘要任务时，两者之间存在相关性。水务署的总体结果和总结任务。此外，第一个任务中最好的WSD算法往往在第二个任务中也是最好的。但是，我们还发现，汇总器获得的改进与WSD性能没有直接关系。最可能的原因是，歧义歧义的错误不是同等重要，而是取决于要概括的文档中不同概念的相对重要性。

著录项

来源
《BMC Bioinformatics》 |2011年第1期|共页
作者
Laura Plaza; Antonio J Jimeno-Yepes; Alberto Díaz; Alan R Aronson;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类生物科学;
关键词

相似文献

外文文献
中文文献
专利

1. Improvement of query-based text summarization using word sense disambiguation [J] . Nazreena Rahman, Bhogeswar Borah Complex & Intelligent Systems . 2019,第1期

机译：使用词义消歧改进基于查询的文本摘要
2. deepBioWSD: effective deep neural word sense disambiguation of biomedical text data [J] . Ahmad Pesaranghader, Stan Matwin, Marina Sokolova, Journal of the American Medical Informatics Association : . 2019,第5期

机译：DeepBiowsd：生物医学文本数据的有效深度神经词感义歧义
3. Kernel Fuzzy C-Means Clustering for Word Sense Disambiguation in BioMedical Texts [J] . K. REN, Y.F. REN Journal of digital information management . 2015,第6期

机译：生物医学文本中词义消歧的内核模糊C均值聚类
4. Unsupervised Word Sense Disambiguation in Biomedical Texts with Co-occurrence Network and Graph Kernel [C] . Tae-Gil Noh, Seong-Bae Park, Sang-Jo Lee 4th ACM international workshop on data and text mining in bioinformatics 2010 . 2010

机译：具有共现网络和图核的生物医学文本中的无监督词义消歧
5. Subjectivity word sense disambiguation: A method for sense-aware subjectivity analysis. [D] . Akkaya, Cem. 2014

机译：主观性词义消歧：一种用于感知感知的主观性分析的方法。
6. Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts [O] . Laura Plaza, Antonio J Jimeno-Yepes, Alberto Díaz, 2011

机译：研究生物医学文本中不同词义消歧方法与摘要效果之间的相关性
7. Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts [O] . Laura Plaza, Antonio J Jimeno-Yepes, Alberto Díaz, 2011

机译：研究生物医学文本中不同词义消歧方法与摘要效果之间的相关性

Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts

摘要

著录项

相似文献

相关主题

期刊订阅