首页> 外文会议>2016 International Conference on Bioinformatics and Systems Biology >Improving extraction of protein — Protein interaction datasets from KUPS using hashing approach
【24h】

Improving extraction of protein — Protein interaction datasets from KUPS using hashing approach

机译:使用哈希方法改善从KUPS中提取蛋白质-蛋白质相互作用数据集

获取原文
获取原文并翻译 | 示例

摘要

The machine learning approaches frequently address the extraction of training datasets from the online databases to build computational or mathematical models. The training data downloaded from the online server and databases are most often carry redundancy and noise. Heuristics methods are most common to filter the data. Dataset filtering process is time consuming and researcher has to do this tedious work. We propose a more generic filter to detect frequent exceptions to increase the quality of generated datasets based on Perl Hash Programming and regular expression methodology. Future development of noise and error reduction approaches is important to make use of the full potential of available database knowledge. We make use of the datasets of protein - protein interaction generated by The University of Kansas Proteomics Service (KUPS).
机译:机器学习方法经常解决从在线数据库中提取训练数据集以建立计算或数学模型的问题。从在线服务器和数据库下载的培训数据通常带有冗余和噪音。启发式方法最常用来过滤数据。数据集过滤过程非常耗时,研究人员必须完成这项繁琐的工作。我们提出了一种更通用的过滤器,以基于Perl哈希编程和正则表达式方法来检测频繁的异常,以提高生成的数据集的质量。噪声和错误减少方法的未来发展对于充分利用可用数据库知识的潜力很重要。我们利用堪萨斯大学蛋白质组学服务(KUPS)生成的蛋白质-蛋白质相互作用数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号