Communication Optimization for Multi GPU Implementation of Smith-Waterman Algorithm

Sampath Kumar N V S S P; P. K. Baruah

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

Navigating the Future of Cybersecurity: A Strategic Approach to Crypto Agility for Modern Enterprises

Aditya Gupta

Random Articles

IoT based Smart Irrigation System at University of Chittagong, Bangladesh

Apr

2020

Sentinel Roadway Oversight System: Emergency and Traffic Rule Violation System

Jan

2025

Efficient Adaptive Lossless Compression of Hyperspectral Data using Enhanced DPCM

December

2011

An Approach to Query Processing in Homogenously Distributed Spatial Databases

April

2016

Reseach Article

Communication Optimization for Multi GPU Implementation of Smith-Waterman Algorithm

by Sampath Kumar N V S S P, P. K. Baruah

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 80 - Number 12

Year of Publication: 2013

Authors: Sampath Kumar N V S S P, P. K. Baruah

10.5120/13910-1121

Sampath Kumar N V S S P, P. K. Baruah . Communication Optimization for Multi GPU Implementation of Smith-Waterman Algorithm. International Journal of Computer Applications. 80, 12 ( October 2013), 1-7. DOI=10.5120/13910-1121

@article{ 10.5120/13910-1121,

author = { Sampath Kumar N V S S P, P. K. Baruah },

title = { Communication Optimization for Multi GPU Implementation of Smith-Waterman Algorithm },

journal = { International Journal of Computer Applications },

issue_date = { October 2013 },

volume = { 80 },

number = { 12 },

month = { October },

year = { 2013 },

issn = { 0975-8887 },

pages = { 1-7 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume80/number12/13910-1121/ },

doi = { 10.5120/13910-1121 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:54:20.290580+05:30

%A Sampath Kumar N V S S P

%A P. K. Baruah

%T Communication Optimization for Multi GPU Implementation of Smith-Waterman Algorithm

%J International Journal of Computer Applications

%@ 0975-8887

%V 80

%N 12

%P 1-7

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

GPU parallelism for real applications can achieve enormous performance gain. CPU-GPU Communication is one of the major bottlenecks that limit this performance gain. Among several libraries developed so far to optimize this communication, DyManD (Dynamically Managed Data) provides better communication optimization strategies and achieves better performance on a single GPU. Smith-Waterman is a well known algorithm in the field of computational biology for finding functional similarities in a protein database. CUDA implementation of this algorithm speeds up the process of sequence matching in the protein database. When input databases are large, multi-GPU implementation gives better performance than single GPU implementation. Since this algorithm acts upon large databases, there is need for optimizing CPU-GPU communication. DyManD implementation provides efficient data management and communication optimization only for single GPU. For providing communication optimization on multiple GPUs, an approach of combining DyManD with a multi-threaded framework called GPUWorker was proposed. Our contribution in this work is to propose an optimized CUDA implementation of this algorithm on multiple GPUs i. e. , GPUWorker-DyManD which reduces the communication overhead between CPU and multiple GPUs. This implementation combines DyManD functionality with GPUWorker for optimizing communication. The performance gain obtained for the GPUWorker-DyManD implementation of this algorithm over default multi-GPU implementation is 3. 5x.

References

D. M. Dang, C. Christara and K. Jackson. GPU pricing of exotic cross-currency interest rate derivatives with a foreign exchange volatility skew model. SSRN eLibrary, 2010.
Thomas B. Jablin, Prakash Prabhu, James A. Jablin, Nick P. Johnson, Stephen R. Beard, and David I. August. Automatic cpu-gpu communication management and optimization. In Mary W. Hall and David A. Padua, editors, PLDI , pages 142-151. ACM, 2011.
NVIDIA Corporation. CUDA C Best Practices Guide 3. 2,2010.
SARJAT SAHNI JUNJIE LI & SANJAY RANKA. Pairwise sequence alignment for very long sequences on gpu.
A. Basumallik and R. Eigenmann. Optimizing irregular shared-memory applications for distributed-memory systems. Number 3, 2006. .
Thomas B. Jablin, James A. Jablin, Prakash Prabhu, Feng Liu, and David I. August. Dynamically managed data for cpu-gpu architectures. In Proceedings of the Tenth International Symposium on Code Generation and Optimization, CGO '12, pages 165{174, New York, NY, USA, 2012. ACM.
GPUWorker master/ slave multi-GPU approach. https://devtalk. nvidia. com/default/topic/390598/gpuworker-master-slave-multi-gpu-approach/

Index Terms

Computer Science

Information Sciences

Keywords

DyManD GPUWorker Data Mangement Communication Optimization