首页> 外文期刊>Current Protocols in Bioinformatics >UNIT 9.4 Using SQL Databases for Sequence Similarity Searching and Analysis
【24h】

UNIT 9.4 Using SQL Databases for Sequence Similarity Searching and Analysis

机译:单元9.4使用SQL数据库进行序列相似性搜索和分析

获取原文
获取原文并翻译 | 示例
       

摘要

Relational databases can integrate diverse types of information and manage large sets of similarity search results, greatly simplifying genome-scale analyses. By focusing on taxonomic subsets of sequences, relational databases can reduce the size andredundancy of sequence libraries and improve the statistical significance of homologs. In addition, by loading similarity search results into a relational database, it becomes possible to explore and summarize the relationships between all of the proteins in an organism and those in other biological kingdoms. This unit describes how to use relational databases to improve the efficiency of sequence similarity searching and demonstrates various large-scale genomic analyses of homology-related data. It also describes the installation and use of a simple protein sequence database, seqdb_demo, which is used as a basis for the other protocols. The unit also introduces search_demo, a database that stores sequence similarity search results. The search_demo database is then used to explore the evolutionary relationships between E. coli proteins and proteins in other organisms in a large-scale comparative genomic analysis.
机译:关系数据库可以集成不同类型的信息,并管理大量的相似性搜索结果,大大简化了基因组级别分析。通过专注于分类序列的子集,关系数据库可以减少序列文库的尺寸,提高同源物的统计学意义。另外,通过将相似性搜索结果加载到关系数据库中,可以探讨和总结生物体中所有蛋白质之间的关系和其他生物王国之间的关系。本机描述了如何使用关系数据库来提高序列相似性搜索效率,并展示与同源相关数据的各种大规模基因组分析。它还介绍了简单的蛋白质序列数据库,SEQDB_DEMO的安装和使用,其用作其他协议的基础。该单元还介绍了Search_Demo,该数据库存储序列相似性搜索结果。然后,搜索_DEMO数据库在大规模的比较基因组分析中探讨其他生物中的大肠杆菌蛋白和蛋白质之间的进化关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号