A visual attention-based keyword extraction for document classification

Wu Xing; Du Zhikang; Guo Yike

首页> 外文期刊>Multimedia Tools and Applications >A visual attention-based keyword extraction for document classification

【24h】

A visual attention-based keyword extraction for document classification

机译：基于视觉注意的关键词提取，用于文档分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document classification plays an important role in natural language processing. Among that, keyword extraction algorithm shows its great potential in summarizing the entire document. Attention is the process of selectively concentrating on a discrete aspect of information, while ignoring other perceivable information. A new probabilistic keyword extraction algorithm is proposed, which is inspired by the visual attention mechanism. An unsupervised neural network based pre-training method is proposed for training the semantic attention based keyword extraction algorithm, which is helpful in extracting keywords with rich contextual information from the document. A bidirectional Long short-term memory network combined with the proposed semantic keyword extraction algorithm is designed for both topic and sentiment classification tasks. Experiments on four large scale datasets show that the proposed visual attention based keyword extraction algorithm gives a better performance than the baseline methods. The semantic attention based keyword extraction method is significant in summarizing the content of a document, which is very useful for large scale document classification.

机译：文档分类在自然语言处理中起着重要作用。其中，关键词提取算法在总结整个文档方面显示出巨大的潜力。注意是选择性地专注于信息的离散方面，而忽略了其他可感知信息的过程。在视觉注意力机制的启发下，提出了一种新的概率关键字提取算法。提出了一种基于无监督神经网络的预训练方法，用于训练基于语义注意的关键词提取算法，该方法有助于从文档中提取具有丰富上下文信息的关键词。针对主题和情感分类任务设计了双向双向长短期记忆网络，并结合了所提出的语义关键词提取算法。在四个大型数据集上的实验表明，所提出的基于视觉注意力的关键词提取算法比基线方法具有更好的性能。基于语义注意力的关键词提取方法在总结文档内容方面具有重要意义，这对于大规模文档分类非常有用。

著录项

来源
《Multimedia Tools and Applications》 |2018年第19期|25355-25367|共13页
作者
Wu Xing; Du Zhikang; Guo Yike;
展开▼
作者单位

Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China;

Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China;

Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Visual attention; Semantic context; Keyword extraction; Document classification; Long short-term memory;

机译：视觉注意力;语义上下文;关键词提取;文档分类;长期短期记忆;

相似文献

外文文献
中文文献
专利

1. Combining attention-based bidirectional gated recurrent neural network and two-dimensional convolutional neural network for document-level sentiment classification [J] . Liu Fagui, Zheng Jingzhong, Zheng Lailei, Neurocomputing . 2020,第Jana2期

机译：结合基于注意力的双向门控递归神经网络和二维卷积神经网络进行文档级情感分类
2. Attention-based word-level contextual feature extraction and cross-modality fusion for sentiment analysis and emotion classification [J] . International journal of intelligent engineering informatics . 2020,第1期

机译：基于注意力的词级上下文特征提取和跨模态融合，用于情感分析和情感分类
3. Keyword Search over Probabilistic XML Documents Based on Node Classification [J] . Zhao Yue, Yuan Ye, Wang Guoren Mathematical Problems in Engineering . 2015,第PTa9期

机译：基于节点分类的概率XML文档关键词搜索
4. GA Based Optimal Keyword Extraction in an Automatic Chinese Web Document Classification System [C] . Chih-Hsun Chou, Chin-Chuan Han, Ya-Hui Chen ISPA 2007 international workshops, SSDSN, UPWN, WISH, SGC, ParDMCom, HiPCoMB, and IST-AWSN; 20070829-31; Niagara Falls(CA) . 2007

机译：中文文档自动分类系统中基于遗传算法的最佳关键词提取
5. Keywords in the mist: Automated keyword extraction for very large documents and back of the book indexing. [D] . Csomai, Andras. 2008

机译：薄雾中的关键字：自动提取非常大的文档并在书后建立索引的关键字。
6. Hierarchical bi-directional attention-based RNNs for supporting document classification on protein–protein interactions affected by genetic mutations [O] . Aris Fergadis, Christos Baziotis, Dimitris Pappas, 2018

机译：基于分层双向注意的RNN支持受基因突变影响的蛋白质间相互作用的文档分类
7. An Approach for Extraction of Keywords and Weighting Words for Improvement Farsi Documents Classification [O] . Vahideh Rrezaie, Mahid Mohammadpour, Hamid Parvin, 2018

机译：提取改进的关键词和加权词的方法

A visual attention-based keyword extraction for document classification

摘要

著录项

相似文献

相关主题

期刊订阅