首页> 外文期刊>BMC Bioinformatics >On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly
【24h】

On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly

机译:具有多路径的子路径约束的最小路径覆盖的复杂性

获取原文
           

摘要

Background Multi-assembly problems have gathered much attention in the last years, as Next-Generation Sequencing technologies have started being applied to mixed settings, such as reads from the transcriptome (RNA-Seq), or from viral quasi-species. One classical model that has resurfaced in many multi-assembly methods (e.g. in Cufflinks, ShoRAH, BRANCH, CLASS) is the Minimum Path Cover (MPC) Problem, which asks for the minimum number of directed paths that cover all the nodes of a directed acyclic graph. The MPC Problem is highly popular because the acyclicity of the graph ensures its polynomial-time solvability. Results In this paper, we consider two generalizations of it dealing with integrating constraints arising from long reads or paired-end reads; these extensions have also been considered by two recent methods, but not fully solved. More specifically, we study the two problems where also a set of subpaths, or pairs of subpaths, of the graph have to be entirely covered by some path in the MPC. We show that in the case of long reads (subpaths), the generalized problem can be solved in polynomial-time by a reduction to the classical MPC Problem. We also consider the weighted case, and show that it can be solved in polynomial-time by a reduction to a min-cost circulation problem. As a side result, we also improve the time complexity of the classical minimum weight MPC Problem. In the case of paired-end reads (pairs of subpaths), the generalized problem becomes NP-hard, but we show that it is fixed-parameter tractable (FPT) in the total number of constraints. This computational dichotomy between long reads and paired-end reads is also a general insight into multi-assembly problems.
机译:背景技术近年来,随着下一代测序技术已开始应用于混合环境,例如转录组(RNA-Seq)或病毒准物种的读取,多组装问题已引起广泛关注。最小路径覆盖(MPC)问题是一种在许多多重装配方法(例如,袖扣,ShoRAH,BRANCH,CLASS)中浮出水面的经典模型,它要求覆盖定向对象所有节点的定向路径的最小数量非循环图。 MPC问题非常受欢迎,因为图的非循环性确保了它的多项式时间可解性。结果在本文中,我们考虑了它的两种概括,它们涉及长读或成对末端读引起的整合约束。最近的两种方法也考虑了这些扩展,但尚未完全解决。更具体地说,我们研究了两个问题,其中图的一组子路径或子路径对也必须被MPC中的某些路径完全覆盖。我们表明,在长读取(子路径)的情况下,可以通过将经典MPC问题简化为多项式时间来解决广义问题。我们还考虑了加权情况,并表明可以通过减少最小成本流通问题在多项式时间内求解。附带的结果,我们还提高了经典最小权重MPC问题的时间复杂度。在成对读取(成对子路径)的情况下,广义问题变得难以解决,但我们证明它在约束总数中是固定参数易处理的(FPT)。长读取和配对末端读取之间的这种计算二分法也是对多程序集问题的普遍了解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号