一种基于TextRank的单文本关键字提取算法

柳林青; 余瀚; 费宁; 陈春玲

首页> 中文期刊> 《计算机应用研究》 >一种基于TextRank的单文本关键字提取算法

一种基于TextRank的单文本关键字提取算法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

作为一种经典的文本关键字提取和摘要自动生成算法,TextRank将文本看做若干单词组成的集合,并通过对单词节点图的节点权值进行迭代计算,挖掘单词之间的潜在语义关系.在TextRank节点图模型的基础上,将马尔可夫状态转移模型与节点图相结合,提出节点间边权为条件概率的新模型生成算法TextRank Revised.通过对有标记和无标记的验证集进行验证,证明新的算法在不提升时间复杂度的前提下,通过计算单文本得出的单词排序结果相较于原TextRank算法更加吻合人工对文档的关键字提取结果.%As a classical key-word extracting and abstraction auto-generating algorithm,TextRank considered the text as a group of terms,and sought a latent semantic relationship between terms according to iteratively calculating the weights of the terms in the nodes graph.Based on the nodes graph model of TextRank,combined node graph and Markov state transform model,weighted the edge between nodes with conditional probability,proposed a new nodes graph model and corresponding algorithm TextRank_Revised(TR-R).According to the verification on labeled and unlabeled samples,it shows that without promotion of time complexity,the new algorithm can get a key-word sorting consequence which is closer to the manual than the original algorithm from the single text.

著录项

来源
《计算机应用研究》 |2018年第3期|705-710|共6页
作者
柳林青; 余瀚; 费宁; 陈春玲;
展开▼
作者单位

南京邮电大学计算机学院;

南京210003;

南京邮电大学计算机学院;

南京210003;

南京邮电大学计算机学院;

南京210003;

南京邮电大学计算机学院;

南京210003;

展开▼
原文格式 PDF
正文语种 chi
中图分类文字信息处理;算法理论;
关键词
TextRank; 单文本关键字; 提取算法; 有向带权图; 马尔可夫状态转移模型;

相似文献

中文文献
外文文献
专利

1. 基于TextRank的单文本关键字提取算法 [J] . 朱必熙 . 兰州工业学院学报 . 2018,第003期
2. 基于TextRank的单文本关键字提取算法 [J] . 朱必熙 . 兰州工业学院学报 . 2018,第003期
3. 改进TextRank的文本关键词提取算法 [J] . 王俊玲 . 软件导刊 . 2021,第004期
4. 改进TextRank的文本关键词提取算法 [J] . 王俊玲 . 软件导刊 . 2021,第004期
5. 一种基于自适应关联熵的关键字提取算法 [J] . 罗有志 ,陈征明 ,陈明 . 计算机与现代化 . 2020,第004期
6. 一种基于文本关键字模型的Audio音乐情感分类方法 [C] . 刘怡 ,高玥 . 第四届和谐人机环境联合学术会议 . 2008
7. TextRank关键词提取算法与SOM文本聚类模型的优化研究 [A] . 陈万振 . 2016

一种基于TextRank的单文本关键字提取算法

摘要

著录项

相似文献

相关主题

期刊订阅