Efficient processing of GRU based on word embedding for text classification

Muhammad Zulqarnain; Rozaida Ghazali; Muhammad Ghulam Ghouse; Muhammad Faheem Mushtaq

首页> 外文期刊>International Journal on Informatics Visualization: JOIV >Efficient processing of GRU based on word embedding for text classification

【24h】

Efficient processing of GRU based on word embedding for text classification

机译：基于文本分类的词嵌入的GRU的高效处理

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text classification has become very serious problem for big organization to manage the large amount of online data and has been extensively applied in the tasks of Natural Language Processing (NLP). Text classification can support users to excellently manage and exploit meaningful information require to be classified into various categories for further use. In order to best classify texts, our research efforts to develop a deep learning approach which obtains superior performance in text classification than other RNNs approaches. However, the main problem in text classification is how to enhance the classification accuracy and the sparsity of the data semantics sensitivity to context often hinders the classification performance of texts. In order to overcome the weakness, in this paper we proposed unified structure to investigate the effects of word embedding and Gated Recurrent Unit (GRU) for text classification on two benchmark datasets included (Google snippets and TREC). GRU is a well-known type of recurrent neural network (RNN), which is ability of computing sequential data over its recurrent architecture. Experimentally, the semantically connected words are commonly near to each other in embedding spaces. First, words in posts are changed into vectors via word embedding technique. Then, the words sequential in sentences are fed to GRU to extract the contextual semantics between words. The experimental results showed that proposed GRU model can effectively learn the word usage in context of texts provided training data. The quantity and quality of training data significantly affected the performance. We evaluated the performance of proposed approach with traditional recurrent approaches, RNN, MV-RNN and LSTM,?the proposed approach is obtained better results on two benchmark datasets in the term of accuracy and error rate.

机译：文本分类对于大型组织来管理大量在线数据并已广泛应用于自然语言处理（NLP）的任务。文本分类可以支持用户以卓越地管理和利用有意义的信息，要求分类为进一步使用的各个类别。为了最佳分类文本，我们的研究努力开发深入学习方法，在文本分类中获得比其他RNN方法的卓越性能。但是，文本分类中的主要问题是如何提高分类准确性和数据语义敏感性对上下文的伤害常常阻碍文本的分类性能。为了克服弱点，在本文中，我们提出了统一的结构来调查单词嵌入和门控复发单元（GRU）对包括的两个基准数据集（Google Sippets和TREC）的文本分类的影响。 GRU是一种众所周知的复发性神经网络（RNN），是通过其经常性架构计算顺序数据的能力。通过实验，在嵌入空间中，语义连接的单词通常在彼此附近。首先，帖子中的单词通过Word嵌入技术改变为向量。然后，句子中顺序的单词被馈送到GRU以提取单词之间的上下文语义。实验结果表明，所提出的GRU模型可以有效地学习文本背景下的用法使用提供培训数据。培训数据的数量和质量显着影响了性能。我们评估了具有传统的经常性方法，RNN，MV-RNN和LSTM的提出方法的性能，ΔThe在准确度和错误率期间的两个基准数据集中获得了所提出的方法。

著录项

来源
《International Journal on Informatics Visualization: JOIV》 |2019年第4期|共7页
作者
Muhammad Zulqarnain; Rozaida Ghazali; Muhammad Ghulam Ghouse; Muhammad Faheem Mushtaq;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词
RNNGRULSTMWord embeddingText classificationNatural language processing;

机译：rnngrulstmword embeddingtext分类语言处理;

相似文献

外文文献
中文文献
专利

1. Word embedding and text classification based on deep learning methods [J] . Saihan Li, Bing Gong MATEC Web of Conferences . 2021,第a期

机译：基于深度学习方法的单词嵌入和文本分类
2. Deep Learning- and Word Embedding-Based Heterogeneous Classifier Ensembles for Text Classification [J] . Kilimci Zeynep H., Akyokus Selim Complexity . 2018,第1期

机译：基于深度学习和词嵌入的异构分类器集成
3. Deep Learning- and Word Embedding-Based Heterogeneous Classifier Ensembles for Text Classification [J] . Kilimci Zeynep H., Akyokus Seim Complexity . 2018,第2期

机译：基于深入的学习和Word嵌入的异构分类器组合文本分类
4. Large Scale Text Classification with Efficient Word Embedding [C] . Xiaohan Ma, Rize Jin, Joon-Young Paik, Catse international conference on mobile and wireless technology . 2018

机译：大规模文本分类，高效单词嵌入
5. An efficient approach to machine learning based text classification through distributed computing [D] . Immaneni, Raghu Nandan. 2015

机译：通过分布式计算进行基于机器学习的文本分类的有效方法
6. The influence of preprocessing on text classification using a bag-of-words representation [O] . Yaakov HaCohen-Kerner, Daniel Miller, Yair Yigal, 2020

机译：使用袋式表示预处理预处理对文本分类的影响
7. Efficient processing of GRU based on word embedding for text classification [O] . Muhammad Zulqarnain, Rozaida Ghazali, Muhammad Ghulam Ghouse, 2019

机译：基于文本分类的词嵌入的GRU的高效处理

Efficient processing of GRU based on word embedding for text classification

摘要

著录项

相似文献

相关主题

期刊订阅