Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases

Xiaohua Hu; Wu D.

首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases

【24h】

Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases

机译：生物医学文献数据库中生物分子网络的数据挖掘和预测建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a novel approach Bio-IEDM (biomedical information extraction and data mining) to integrate text mining and predictive modeling to analyze biomolecular network from biomedical literature databases. Our method consists of two phases. In phase 1, we discuss a semisupervised efficient learning approach to automatically extract biological relationships such as protein-protein interaction, protein-gene interaction from the biomedical literature databases to construct the biomolecular network. Our method automatically learns the patterns based on a few user seed tuples and then extracts new tuples from the biomedical literature based on the discovered patterns. The derived biomolecular network forms a large scale-free network graph. In phase 2, we present a novel clustering algorithm to analyze the biomolecular network graph to identify biologically meaningful subnetworks (communities). The clustering algorithm considers the characteristics of the scale-free network graphs and is based on the local density of the vertex and its neighborhood functions that can be used to find more meaningful clusters with different density level. The experimental results indicate our approach is very effective in extracting biological knowledge from a huge collection of biomedical literature. The integration of data mining and information extraction provides a promising direction for analyzing the biomolecular network

机译：在本文中，我们提出了一种新颖的方法Bio-IEDM（生物医学信息提取和数据挖掘），该方法将文本挖掘和预测建模相集成，以从生物医学文献数据库中分析生物分子网络。我们的方法包括两个阶段。在阶段1中，我们讨论了一种半监督的高效学习方法，该方法可从生物医学文献数据库中自动提取生物关系，例如蛋白质-蛋白质相互作用，蛋白质-基因相互作用，以构建生物分子网络。我们的方法根据一些用户种子元组自动学习模式，然后根据发现的模式从生物医学文献中提取新的元组。导出的生物分子网络形成了无比例的大型网络图。在阶段2中，我们提出了一种新颖的聚类算法，用于分析生物分子网络图，以识别具有生物学意义的子网络（社区）。聚类算法考虑了无标度网络图的特征，并基于顶点的局部密度及其邻域函数，可用于查找具有不同密度级别的更有意义的聚类。实验结果表明，我们的方法在从大量生物医学文献中提取生物学知识方面非常有效。数据挖掘和信息提取的集成为分析生物分子网络提供了一个有希望的方向

著录项

来源
《IEEE/ACM transactions on computational biology and bioinformatics》 |2007年第2期|p.251-263|共13页
作者
Xiaohua Hu; Wu D.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类生物数学方法;生物信息论;
关键词
biochemistry; data mining; genetics; graphs; learning (artificial intelligence); medical information systems; molecular biophysics; prediction theory; statistical analysis; Bio-IEDM; biomedical information extraction; biomedical literature databases; biomolecular ne;

机译：生物化学;数据挖掘;遗传学;图;学习（人工智能）;医学信息系统;分子生物物理学;预测理论;统计分析;Bio-IEDM;生物医学信息提取;生物医学文献数据库;生物分子网络;

相似文献

外文文献
中文文献
专利

1. A Predictive Model for Mining Opinions of an Educational Database Using Neural Networks [J] . M R Narasinga Rao, Deepthi Gurram, Sai Mahathi Vadde, International Journal of Electrical and Computer Engineering . 2015,第5期

机译：基于神经网络的教育数据库观点挖掘预测模型
2. A Predictive Model for Mining Opinions of an Educational Database Using Neural Networks [J] . M R Narasinga Rao, Deepthi Gurram, Sai Mahathi Vadde, International Journal of Electrical and Computer Engineering . 2015,第5期

机译：基于神经网络的教育数据库观点挖掘预测模型
3. Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine [J] . Ayush Singhal, Michael Simmons, Zhiyong Lu PLoS Computational Biology . 2016,第11期

机译：用于数据库管理和精准医学的生物医学文献中的文本挖掘基因型与表型关系
4. Towards a Theory of Protein Adsorption: Predicting the Adsorption of Proteins on Surfaces Using a Piecewise Linear Model Validated Using the Biomolecular Adsorption Database [C] . Dan V. Nicolau Jr, Dan V. Nicolau Asia-Pacific Bioinformatics Conference . 2004

机译：朝向蛋白质吸附理论：预测使用生物分子吸附数据库验证的分段线性模型预测蛋白质对表面的吸附
5. Design and use of the biomolecular interaction network database (BIND) for storing and analyzing protein-protein interaction data. [D] . Bader, Gary David. 2003

机译：设计和使用用于存储和分析蛋白质间相互作用数据的生物分子相互作用网络数据库（BIND）。
6. Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine [O] . Ayush Singhal, Michael Simmons, Zhiyong Lu 2016

机译：用于数据库管理和精准医学的生物医学文献中的文本挖掘基因型与表型关系
7. 1 Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases [O] . Xiaohua Hu, Daniel Wu 2015

机译：1生物医学文献数据库中生物分子网络的数据挖掘和预测建模

Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases

摘要

著录项

相似文献

相关主题

期刊订阅