Comparative study of classification techniques on biomedical data from hypertext documents

Rashedur M. Rahman; Sazia Salahuddin

首页> 外文期刊>International journal of knowledge engineering and soft data paradigms >Comparative study of classification techniques on biomedical data from hypertext documents

【24h】

Comparative study of classification techniques on biomedical data from hypertext documents

机译：超文本文件生物医学数据分类技术的比较研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, our goal is to mine biomedical data from hypertext documents (e.g., mining data from web contents) using data mining algorithms with the help of 'biomedical ontology'. We collect a number of documents using Google and preprocess the hypertext documents and extract the text data. Next job is the identification of biomedical data. To identify whether a word is a biomedical entity or not we use a biomedical database, the 'UMI.S metathesaurus'. The mapping of biomedical entity from the metathesaurus will be done based on keyword query. The more occurrence of a biomedical entity in a page, the more relevant the page is, and thus, we can re-rank the documents to find the most important documents. Then we test and analyse the performance of seven most popular classification algorithms by training them separately with the documents ranked by Google and our algorithm.

机译：在本文中，我们的目标是借助``生物医学本体''使用数据挖掘算法从超文本文档中挖掘生物医学数据（例如，从Web内容中挖掘数据）。我们使用Google收集了许多文档，并对超文本文档进行了预处理并提取了文本数据。下一项工作是生物医学数据的识别。为了确定单词是否为生物医学实体，我们使用生物医学数据库“ UMI.S metathesaurus”。来自词库的生物医学实体的映射将基于关键字查询来完成。页面中生物医学实体出现的次数越多，页面的相关性就越高，因此，我们可以对文档进行重新排序以找到最重要的文档。然后，通过分别使用Google和我们的算法排名的文档对它们进行单独训练，来测试和分析七种最受欢迎的分类算法的性能。

著录项

来源
《International journal of knowledge engineering and soft data paradigms》 |2013年第1期|21-41|共21页
作者
Rashedur M. Rahman; Sazia Salahuddin;
展开▼
作者单位

Department of Electrical Engineering and Computer Science,North South University,Plot-15, Block-B, Bashundhara, Dhaka 1229, Bangladesh;

Department of Electrical Engineering and Computer Science,North South University,Plot-15, Block-B, Bashundhara, Dhaka 1229, Bangladesh;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
data mining; biomedical ontology; classification; performance analysis; document clustering;

机译：数据挖掘;生物医学本体分类;绩效分析;文档聚类;

相似文献

外文文献
中文文献
专利

1. A Comparative Study of Data Mining Classification Techniques using Lung Cancer Data [J] . Er.Tapas Ranjan Baitharu, Dr.Subhendu Kumar Pani International Journal of Computer Trends and Technology . 2015,第2期

机译：基于肺癌数据的数据挖掘分类技术的比较研究
2. Comparative Study of Data Mining Classification Techniques for Detection and Prediction of Phishing Websites [J] . Luai Al-Shalabi Journal of computer sciences . 2019,第3期

机译：网络钓鱼网站检测与预测的数据挖掘分类技术比较研究
3. A Comparative Study of Deep Learning Techniques on Frame-Level Speech Data Classification [J] . Shahrebabaki Abdolreza Sabzi, Imran Ali Shariq, Olfati Negar, Circuits, systems, and signal processing . 2019,第8期

机译：深度学习技术在帧级语音数据分类中的比较研究
4. Comparative Study of Machine Learning Supervised Techniques for Image Classification using an Institutional Identification Documents Dataset [C] . Alvaro Ramiro Hernandez Millan, Miguel Mendoza-Moreno, Larry Mauricio Portocarrero Lopez, Congreso Internacional de Innovacion y Tendencias en Ingenieria . 2018

机译：使用机构识别文件数据集进行图像分类的机器学习监督技术的比较研究
5. A comparative study of machine vision classification techniques for the detection of missing clips. [D] . Miles, Brandon. 2009

机译：机器视觉分类技术用于检测缺失片段的比较研究。
6. Effective biomedical document classification for identifying publications relevant to the mouse Gene Expression Database (GXD) [O] . Xiangying Jiang, Martin Ringwald, Judith Blake, 2017

机译：有效的生物医学文献分类可用于识别与小鼠基因表达数据库（GXD）相关的出版物
7. Comparative Study on Different Classification Techniques for Spam Dataset [O] . Sanaa Hassan Abou Elhamayed 2018

机译：垃圾邮件数据集不同分类技术的比较研究

Comparative study of classification techniques on biomedical data from hypertext documents

摘要

著录项

相似文献

相关主题

期刊订阅