CFP last date
20 December 2024
Reseach Article

Accelerated Parallel Generation of Binomial Coefficients using GPU

by Mohsin Altaf Wani, S. M. K Quadri
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 71 - Number 2
Year of Publication: 2013
Authors: Mohsin Altaf Wani, S. M. K Quadri
10.5120/12329-8570

Mohsin Altaf Wani, S. M. K Quadri . Accelerated Parallel Generation of Binomial Coefficients using GPU. International Journal of Computer Applications. 71, 2 ( June 2013), 11-13. DOI=10.5120/12329-8570

@article{ 10.5120/12329-8570,
author = { Mohsin Altaf Wani, S. M. K Quadri },
title = { Accelerated Parallel Generation of Binomial Coefficients using GPU },
journal = { International Journal of Computer Applications },
issue_date = { June 2013 },
volume = { 71 },
number = { 2 },
month = { June },
year = { 2013 },
issn = { 0975-8887 },
pages = { 11-13 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume71/number2/12329-8570/ },
doi = { 10.5120/12329-8570 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:34:25.478459+05:30
%A Mohsin Altaf Wani
%A S. M. K Quadri
%T Accelerated Parallel Generation of Binomial Coefficients using GPU
%J International Journal of Computer Applications
%@ 0975-8887
%V 71
%N 2
%P 11-13
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

GPUs (Graphics processing units) can be used for general purpose parallel computation. Developers can develop parallel programs running on GPUs using different computing architectures like CUDA or OpenCL. The Binomial Coefficient Generation is used to generate a table of binomial coefficients each entry in row n and column k of this table contains number of combinations of n objects taken k at a time. It is known that this problem can be solved by dynamic programming technique using O(nk)-time complexity algorithm where the table to be generated has n rows and k columns. The main contribution of this paper is to present a parallel implementation of this O(nk)-time algorithm on a GPU and to analyze the speed up possible when compared to a CPU based implementation.

References
  1. Cormen, T. H. , Lieserson, C. E. , Rivest, R. L. 1990 Introduction to Algorithms. MIT Press
  2. Neapolitan, R and Naimipour, K. 2003 Foundations of Algorithms using C++ pseudocode.
  3. Nvidia Corp. 2011 Nvidia CUDA programming guide version 4. 1.
  4. Nvidia Corp. 2011 CUDA C Best Practices Guide version 4. 1.
  5. W. W. Hwu. 2011 GPU Computing Gems Emerald Edition. Morgan Kaufmann,.
  6. D. Man, K. Uda, Y. Ito, and K. Nakano. Dec. 2011 "A GPU implementation of computing euclidean distance map with efficient memory access," in Proc. of International Conference on Networking and Computing, , pp. 68–76.
  7. A. Uchida, Y. Ito, and K. Nakano. Dec. 2011 "Fast and accurate template matching using pixel rearrangement on the GPU," in Proc. of International Conference on Networking and Computing , pp. 153–159.
  8. AMD. 2011 Introduction to OpenCL programming.
Index Terms

Computer Science
Information Sciences

Keywords

Dynamic programming Parallel algorithm GPU OpenCL.