...
首页> 外文期刊>Entropy >Analysis of Data Complexity in Human DNA for Gene-Containing Zone Prediction†
【24h】

Analysis of Data Complexity in Human DNA for Gene-Containing Zone Prediction†

机译:人类DNA中数据的复杂性分析,以预测基因所在区域†

获取原文
           

摘要

This study delves further into the analysis of genomic data by computing a variety of complexity measures. We analyze the effect of window size and evaluate the precision and recall of the prediction of gene zones, aided with a much larger dataset (full chromosomes). A technique based on the separation of two cases (gene-containing and non-gene-containing) has been developed as a basic gene predictor for automated DNA analysis. This predictor was tested on various sequences of human DNA obtained from public databases, in a set of three experiments. The first one covers window size and other parameters; the second one corresponds to an analysis of a full human chromosome (198 million nucleic acids); and the last one tests subject variability (with five different individual subjects). All three experiments have high-quality results, in terms of recall and precision, thus indicating the effectiveness of the predictor.
机译:这项研究通过计算各种复杂性指标来进一步研究基因组数据。我们借助更大的数据集(完整染色体)来分析窗口大小的影响,并评估基因区域预测的准确性和召回率。已经开发了基于两种情况(含基因和不含基因)分离的技术,作为自动DNA分析的基本基因预测因子。在一组三个实验中,对从公共数据库获得的人类DNA的各种序列进行了测试,测试了该预测因子。第一个覆盖窗口大小和其他参数;第二个对应于完整人类染色体(1.98亿个核酸)的分析;最后一个测试对象的变异性(五个不同的单个对象)。在召回率和准确性方面,所有三个实验均具有高质量的结果,从而表明了预测器的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号