...
首页> 外文期刊>Journal of Bioinformatics and Computational Biology >FQC: A novel approach for efficient compression, archival, and dissemination of fastq datasets
【24h】

FQC: A novel approach for efficient compression, archival, and dissemination of fastq datasets

机译:FQC:一种有效压缩,存档和传播fastq数据集的新颖方法

获取原文
获取原文并翻译 | 示例
           

摘要

Sequence data repositories archive and disseminate fastq data in compressed format. In spite of having relatively lower compression efficiency, data repositories continue to prefer GZIP over available specialized fastq compression algorithms. Ease of deployment, high processing speed and portability are the reasons for this preference. This study presents FQC, a fastq compression method that, in addition to providing significantly higher compression gains over GZIP, incorporates features necessary for universal adoption by data repositories/end-users. This study also proposes a novel archival strategy which allows sequence repositories to simultaneously store and disseminate lossless as well as (multiple) lossy variants of fastq files, without necessitating any additional storage requirements. For academic users, Linux, Windows, and Mac implementations (both 32 and 64-bit) of FQC are freely available for download at: https://metagenomics.atc.tcs.com/compression/FQC.
机译:序列数据存储库以压缩格式归档和分发fastq数据。尽管压缩效率相对较低,但数据存储库仍然比可用的专用fastq压缩算法更喜欢GZIP。易于部署,高处理速度和可移植性是导致此优先级的原因。这项研究提出了FQC,一种fastq压缩方法,除了提供比GZIP更高的压缩增益外,还融合了数据存储库/最终用户普遍采用的必要功能。这项研究还提出了一种新颖的归档策略,该策略允许序列存储库同时存储和传播fastq文件的无损以及(多种)有损变体,而无需任何其他存储要求。对于学术用户,可以从以下网址免费下载FQC的Linux,Windows和Mac实施(32位和64位):https://metagenomics.atc.tcs.com/compression/FQC。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号