Fast Training of a Graph Boosting for Large-Scale Text Classification

机译：图训练的快速训练，用于大规模文本分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a fast training method for graph classification based on a boosting algorithm and its application to sentimental analysis with input texts represented by graphs. Graph format is very suitable for representing texts structured with Natural Language Processing techniques such as morphological analysis, Named Entity Recognition, and parsing. A number of classification methods which represent texts as graphs have been proposed so far. However, many of them limit candidate features in advance because of quite large size of feature space. Instead of limiting search space in advance, we propose two approximation methods for learning of graph-based rules in a boosting. Experimental results on a sentimental analysis dataset show that our method contributes to improved training speed. In addition, the graph representation-based classification method exploits rich structural information of texts, which is impossible to be detected when using other simpler input formats, and shows higher accuracy.

机译：提出了一种基于提升算法的图分类快速训练方法，并将其应用于图表示输入文本的情感分析。图形格式非常适合表示使用自然语言处理技术（例如形态分析，命名实体识别和解析）构造的文本。迄今为止，已经提出了许多将文本表示为图形的分类方法。但是，由于特征空间很大，它们中的许多预先限制了候选特征。代替预先限制搜索空间，我们提出了两种近似方法来学习基于图的规则。在情感分析数据集上的实验结果表明，我们的方法有助于提高训练速度。另外，基于图形表示的分类方法利用了丰富的文本结构信息，这在使用其他更简单的输入格式时是无法检测到的，并且显示出更高的准确性。

著录项

来源
《Pacific Rim international conference on artificial intelligence》|2016年|638-650|共13页
会议地点
作者
Hiyori Yoshikawa; Tomoya Iwakura;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Text classification; Feature engineering; Graph boosting;

机译：文字分类;特征工程;图提升;

相似文献

外文文献
中文文献
专利

1. CogBoost: Boosting for Fast Cost-Sensitive Graph Classification [J] . Pan Shirui, Wu Jia, Zhu Xingquan Knowledge and Data Engineering, IEEE Transactions on . 2015,第11期

机译：CogBoost：提高成本敏感图的快速分类
2. Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for Large-Scale Multi-Label Text Classification [J] . Peng Hao, Li Jianxin, Wang Senzhang, IEEE Transactions on Knowledge and Data Engineering . 2021,第6期

机译：用于大型多标签文本分类的分类分类分类和注意力图胶囊RCNN
3. Hierarchical Graph Transformer-Based Deep Learning Model for Large-Scale Multi-Label Text Classification [J] . Gong Jibing, Teng Zhiyong, Teng Qi, Quality Control, Transactions . 2020,第期

机译：基于分层图形变换器的大型多标签文本分类的深度学习模型
4. Fast Training of a Graph Boosting for Large-Scale Text Classification [C] . Hiyori Yoshikawa, Tomoya Iwakura Pacific Rim International Conference on Artificial Intelligence . 2016

机译：用于大规模文本分类的图表提升的图表快速训练
5. Algorithms for training large-scale linear programming support vector regression and classification. [D] . Rivas Perea, Pablo. 2011

机译：训练大规模线性规划的算法支持向量回归和分类。
6. Analyzing the Moving Parts of a Large-Scale Multi-Label Text Classification Pipeline: Experiences in Indexing Biomedical Articles [O] . Anthony Rios, Ramakanth Kavuluru -1

机译：分析大型多标签文本分类管道的运动部分：生物医学文章索引的经验
7. Boosting Text Classification Performance on Sexist Tweets by Text Augmentation and Text Generation Using a Combination of Knowledge Graphs [O] . Sima Sharifirad, Borna Jafarpour, Stan Matwin 2018

机译：通过使用知识图形的组合，通过文本增强和文本生成提升文本分类性能。

Fast Training of a Graph Boosting for Large-Scale Text Classification

摘要

著录项

相似文献

相关主题

期刊订阅