Preprocessing of radicalism dataset to predict radical content in Indonesia

机译：预处理激进主义数据集以预测印度尼西亚的激进分子含量

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

A radical definition according to procedural meanings is content that invites, provokes, performs certain acts, interprets jihad as a suicide bomb. And interpret the jihad is limited. In Indonesia, the radical content is often associated with content issues such Tribe, Religion, and Race. The classification of radical content is a challenging technical problem due to its large numbers, unstructured, and a lot of noise. The larger the amount of content it will produce more and more features. So that impact on the high dimensions and can lead to poor performance against the classification algorithm. How to solve the problem is dimensional reduction such as feature selection. In this study, we propose an approach to select features that are categorized radically and not radically using Human Brain and DF-Threshold. Prior to feature selection, preprocessing is performed, then text mining, then selection of features using Human Brain and DF-Threshold. Testing is done through 10-cross validation with k-Nearest Neighbor (k-NN) as its classification. Based on these trials we get the highest accuracy performance results of 66.37% with k on k-NN equal to 7.

机译：根据程序含义的激进定义是引诱，挑衅，执行某些行为，将圣战解释为自杀炸弹的内容。并解释圣战是有限的。在印度尼西亚，激进内容通常与部落，宗教和种族等内容相关。自由基含量的分类由于其数量大，结构混乱和噪音大，因此是一个具有挑战性的技术问题。内容量越大，它将产生越来越多的功能。这样会对高尺寸产生影响，并可能导致分类算法性能不佳。如何解决该问题是诸如特征选择之类的降维。在这项研究中，我们提出了一种使用人脑和DF阈值选择从根本上而不是从根本上分类的特征的方法。在特征选择之前，先进行预处理，然后进行文本挖掘，然后使用人脑和DF阈值选择特征。通过使用k最近邻（k-NN）作为分类的10交叉验证来完成测试。根据这些试验，在k-NN上的k等于7时，我们获得了66.37％的最高精度性能结果。

著录项

来源
《2017 International Electronics Symposium on Knowledge Creation and Intelligent Computing》|2017年|270-275|共6页
会议地点 Surabaya(ID)
作者
Muh. Subhan; Amang Sudarsono; Aliridho Barakbah;
展开▼
作者单位

Departement of Information and Computer Engineering, Graduate Program of Engineering Technology, Politeknik Elektronika Negeri Surabaya, Surabaya, Indonesia;

Departement of Information and Computer Engineering, Graduate Program of Engineering Technology, Politeknik Elektronika Negeri Surabaya, Surabaya, Indonesia;

Departement of Information and Computer Engineering, Graduate Program of Engineering Technology, Politeknik Elektronika Negeri Surabaya, Surabaya, Indonesia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Terrorism; Feature extraction; Classification algorithms; Algorithm design and analysis; Labeling; Text mining; Uniform resource locators;

机译：恐怖主义;特征提取;分类算法;算法设计与分析;标签;文本挖掘;统一资源定位符;;

相似文献

外文文献
中文文献
专利

1. Dataset relating to the relationship between teacher self-concept and teacher efficacy as the predictors of burnout: A survey in Indonesian education [J] . Lantip Diat Prasojo, Akhmad Habibi, Mohd Faiz Mohd Yaakob, Data in Brief . 2020,第2期

机译：与教师自我概念与教师疗效关系有关的数据集作为倦怠的预测因子：印度尼西亚教育的调查
2. Evaluation of Phenolic Content and Free Radical Scavenging Activity of Indonesia Wild Honey Collected from Seven Different Regions [J] . Y. Riswahyuli, Abdul Rohman, Francis Setyabudi, Journal of Food Research . 2019,第6期

机译：从七个不同地区收集的印尼野生蜂蜜中酚含量和清除自由基活性的评估
3. Determination of γ-oryzanol, Phenolic total content, and (2,2-difenil-1-picrylhydrazyl) Radical Scavenging Activity in Different Varieties of Rice (Oryza sativa) in Yogyakarta, Indonesia [J] . Erna Prawita Setyowati, Andayana Puspitasari Gani Universitas Gadjah Mada . 2018,第2期

机译：印度尼西亚日惹不同水稻品种中γ-谷维素醇，酚类总含量和（2,2-二苯腈-1-picylhydrazyl）自由基清除活性的测定
4. Preprocessing of radicalism dataset to predict radical content in Indonesia [C] . Muh. Subhan, Amang Sudarsono, Aliridho Barakbah International Conference on Knowledge Creation and Intelligent Computing . 2017

机译：预处理激进主义数据集预测印度尼西亚的激进含量
5. Creating fast and accurate machine learning ensembles through training dataset preprocessing. [D] . Whitehead, Matthew E. N. 2010

机译：通过训练数据集预处理创建快速而准确的机器学习集合。
6. Telomere DNA Content in Prostate Biopsies Predicts Early Rise in Prostate Specific Antigen Following Radical Prostatectomy for Prostate Cancer [O] . Eric G. Treat, Christopher M. Heaphy, Larry W. Massie, -1

机译：前列腺活组织检查中的端粒DNA含量预测前列腺前列腺切除术治疗前列腺癌后前列腺特异性抗原的早期升高
7. Telomere DNA Content in Prostate Biopsies Predicts Early Rise in Prostate-specific Antigen After Radical Prostatectomy for Prostate Cancer [O] . Eric G. Treat, Christopher M. Heaphy, Larry W. Massie, 2010

机译：前列腺活组织检查中的端粒DNA含量预测前列腺前列腺切除后前列腺特异性抗原的早期升高

Preprocessing of radicalism dataset to predict radical content in Indonesia

摘要

著录项

相似文献

相关主题

期刊订阅