Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction

Li Lifeng; Li Wenxing

首页> 外文期刊>Technical Gazette >Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction

【24h】

Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction

机译：基于特征值提取的Naive Bayesian自动分类铁路服务投诉文本

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Railways have developed rapidly in China for several decades. The hardware of railways has already reached the world's leading level, but the level of service of these railways still has room for improvement. The railway management department receives a large number of passenger complaints every year and records them in text, which needs to be classified and analyzed. The text of railway complaints includes characteristics spanning wide business coverage, various events, serious colloquialisms, interference and useless information. When using the direct classification via traditional text categorization, the classification accuracy is low. The key to the automatic classification of such text lies in an eigenvalue extraction. The more accurate the eigenvalue extraction, the higher the accuracy of text classification. In this paper, the TF-IDF algorithm, TextRank algorithm and Word2vec algorithm are selected to extract text eigenvalues, and a railway complaint text classification method is constructed with a naive Bayesian classifier. The three types of eigenvalue extraction algorithms are compared. The TF-IDF algorithm, based on eigenvalue extraction, achieves the highest automatic text classification accuracy.

机译：铁路在中国迅速发展了几十年。铁路的硬件已经达到了世界领先水平，但这些铁路的服务水平仍然有改进的空间。铁路管理部门每年收到大量乘客投诉，并将其记录在文本中，需要进行分类和分析。铁路投诉的文本包括跨越商业覆盖范围，各种事件，严重的口语，干扰和无用信息的特征。通过传统文本分类使用直接分类，分类准确度低。自动分类这些文本的关键在于特征值提取。特征值提取越准确，文本分类的准确性越高。在本文中，选择TF-IDF算法，TEXTRANK算法和WORD2VEC算法以提取文本特征值，并用NAIVE贝叶斯分类器构建铁路投诉文本分类方法。比较了三种类型的特征值提取算法。基于特征值提取的TF-IDF算法实现了最高的自动文本分类准确性。

著录项

来源
《Technical Gazette》 |2019年第3期|共8页
作者
Li Lifeng; Li Wenxing;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类一般工业技术;
关键词
automatic classificationeigenvaluenaive Bayesrailway complaint textTextRankTF-IDFWord2vec;

机译：自动分类古valuenaive bayesrailway投诉TextExtranktf-Idfword2vec;

相似文献

外文文献
中文文献
专利

1. Artificial intelligence on diabetic retinopathy diagnosis: an automatic classification method based on grey level co-occurrence matrix and naive Bayesian model [J] . Kai Cao1, Jie Xu1, Wei-Qi Zhao1 国际眼科杂志：英文版 . 2019,第007期

机译：人工智能在糖尿病性视网膜病变中的诊断：基于灰度共生矩阵和朴素贝叶斯模型的自动分类方法
2. Application of improved distributed naive Bayesian algorithms in text classification [J] . Gao Hongyi, Zeng Xi, Yao Chunhua Journal of supercomputing . 2019,第9期

机译：改进的分布式朴素贝叶斯算法在文本分类中的应用
3. Bayesian Naive Bayes classifiers to text classification [J] . Shuo Xu Journal of Information Science . 2018,第1期

机译：贝叶斯朴素贝叶斯分类器到文本分类
4. Research on Chinese text classification based on Naive Bayesian method [C] . Geng Xinglong, Gao Xiuyan, Zhao Bin Proceedings of the Fifth international symposium on test automation amp; instrumentation . 2014

机译：基于朴素贝叶斯方法的中文文本分类研究
5. Identification of secondary and tertiary motifs in DNA sequences through naive Bayesian text classification. [D] . Villalobos, Rodney V. 2007

机译：通过朴素的贝叶斯文本分类识别DNA序列中的二级和三级基序。
6. Artificial intelligence on diabetic retinopathy diagnosis: an automatic classification method based on grey level co-occurrence matrix and naive Bayesian model [O] . Kai Cao, Jie Xu, Wei-Qi Zhao 2019

机译：人工智能在糖尿病性视网膜病变中的诊断：基于灰度共生矩阵和朴素贝叶斯模型的自动分类方法
7. Semi-automatic Classification Based on ICD Code for Thai Text-Based Chief Complaint by Machine Learning Techniques [O] . Jarunee Duangsuwan, Pawin Saeku 2018

机译：基于ICD代码的半自动分类，通过机器学习技术基于泰国文本的首席投诉
8. Privacy-Preserving Naive Bayesian Classification [R] . Zhan, Z. , Chang, L. , Matwin, S. 2004

机译：隐私保护朴素贝叶斯分类

Naive Bayesian Automatic Classification of Railway Service Complaint Text Based on Eigenvalue Extraction

摘要

著录项

相似文献

相关主题

期刊订阅