Fast Parallel Markov Clustering in Bioinformatics Using Massively Parallel Computing on GPU with CUDA and ELLPACK-R Sparse Format

Bustamam Alhadi; Burrage Kevin; Hamilton Nicholas A.

首页> 外文期刊>Computational Biology and Bioinformatics, IEEE/ACM Transactions on >Fast Parallel Markov Clustering in Bioinformatics Using Massively Parallel Computing on GPU with CUDA and ELLPACK-R Sparse Format

【24h】

Fast Parallel Markov Clustering in Bioinformatics Using Massively Parallel Computing on GPU with CUDA and ELLPACK-R Sparse Format

机译：在具有CUDA和ELLPACK-R稀疏格式的GPU上使用大规模并行计算在生物信息学中进行快速并行Markov聚类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Markov clustering (MCL) is becoming a key algorithm within bioinformatics for determining clusters in networks. However, with increasing vast amount of data on biological networks, performance and scalability issues are becoming a critical limiting factor in applications. Meanwhile, GPU computing, which uses CUDA tool for implementing a massively parallel computing environment in the GPU card, is becoming a very powerful, efficient, and low-cost option to achieve substantial performance gains over CPU approaches. The use of on-chip memory on the GPU is efficiently lowering the latency time, thus, circumventing a major issue in other parallel computing environments, such as MPI. We introduce a very fast Markov clustering algorithm using CUDA (CUDA-MCL) to perform parallel sparse matrix-matrix computations and parallel sparse Markov matrix normalizations, which are at the heart of MCL. We utilized ELLPACK-R sparse format to allow the effective and fine-grain massively parallel processing to cope with the sparse nature of interaction networks data sets in bioinformatics applications. As the results show, CUDA-MCL is significantly faster than the original MCL running on CPU. Thus, large-scale parallel computation on off-the-shelf desktop-machines, that were previously only possible on supercomputing architectures, can significantly change the way bioinformaticians and biologists deal with their data.

机译：马尔可夫聚类（MCL）正在成为生物信息学中确定网络中聚类的关键算法。但是，随着生物网络上海量数据的增加，性能和可伸缩性问题已成为应用程序中的关键限制因素。同时，使用CUDA工具在GPU卡中实现大规模并行计算环境的GPU计算正成为一种非常强大，高效且低成本的选择，以实现比CPU方法更高的性能。在GPU上使用片上内存可有效缩短等待时间，从而避免了其他并行计算环境（例如MPI）中的主要问题。我们介绍一种使用CUDA（CUDA-MCL）的非常快的马尔可夫聚类算法，以执行并行稀疏矩阵计算和并行稀疏马尔可夫矩阵归一化，这是MCL的核心。我们利用ELLPACK-R稀疏格式来进行有效且细粒度的大规模并行处理，以应对生物信息学应用程序中交互网络数据集的稀疏性质。结果表明，CUDA-MCL明显快于在CPU上运行的原始MCL。因此，以前只能在超级计算架构上进行的现成台式计算机上的大规模并行计算可以显着改变生物信息学家和生物学家处理数据的方式。

著录项

来源
《Computational Biology and Bioinformatics, IEEE/ACM Transactions on》 |2012年第3期|p.679-692|共14页
作者
Bustamam Alhadi; Burrage Kevin; Hamilton Nicholas A.;
展开▼
作者单位

The University of Queensland, Brisbane, and University of Indonesia, Depok;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
CUDA; ELLPACK-R sparse format; GPU computing; Markov clustering; PPI networks; bioinformatics.; graphs and networks; parallelism and concurrency; performance evaluation; scalable parallel programming;

机译：CUDA;ELLPACK-R稀疏格式;GPU计算;马尔可夫聚类;PPI网络;生物信息学;图形和网络;并行性和并发性;性能评估;可扩展的并行编程;

相似文献

外文文献
专利

1. A fast Hough Transform algorithm for straight lines detection in an image using GPU parallel computing with CUDA-C [J] . Yam-Uicab R., Lopez-Martinez J. L., Trejo-Sanchez J. A., Journal of supercomputing . 2017,第11期

机译：快速的Hough变换算法，用于使用CUDA-C与GPU并行计算的图像中的直线
2. A novel parallel Markov clustering method in biological interaction network analysis under multi-GPU computing environment [J] . Fu You, Zhou Wei Journal of supercomputing . 2020,第10期

机译：多GPU计算环境下的生物交互网络分析中的一种新的并行马尔可夫聚类方法
3. Massively parallel GPU computing for fast stereo correspondence algorithms [J] . Wang Hongjian, Zhang Naiyu, Creput Jean-Charles, Journal of systems architecture . 2016,第Null期

机译：大规模并行GPU计算，可实现快速立体对应算法
4. A GPU Implementation of Fast Parallel Markov Clustering in Bioinformatics Using EllPACK-R Sparse Data Format [C] . Bustamam Alhadi, Burrage Kevin, Hamilton Nicholas A. 2010 Second International Conference on Advances in Computing, Control and Telecommunication Technologies . 2010

机译：使用EllPACK-R稀疏数据格式的生物信息学中的快速并行马尔可夫聚类的GPU实现
5. GPU data-parallel computing of sequence alignment using CUDA. [D] . Jung, Sungbo. 2008

机译：使用CUDA进行GPU数据并行计算的序列比对。
6. Parallelized pairwise sequence alignment using CUDA on multiple GPUs [O] . Sungbo Jung 2009

机译：在多个GPU上使用CUDA进行并行的成对序列比对
7. An MPI-CUDA Implementation for Massively Parallel Incompressible Flow Computations on Multi-GPU Clusters [O] . Jacobsen Dana A., Thibault Julien C., Senocak Inanc 2010

机译：MPI-CUDA在多GPU群集上大规模并行不可压缩流量计算的实现

Fast Parallel Markov Clustering in Bioinformatics Using Massively Parallel Computing on GPU with CUDA and ELLPACK-R Sparse Format

摘要

著录项

相似文献

相关主题

期刊订阅