CFP last date
20 December 2024
Reseach Article

Article:OpenMP Optimization and its Translation to OpenGL

by Santosh Kumar, Dr. V.M.Wadhai, Prasad S.Halgaonkar, Kiran P.Gaikwad
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 8 - Number 5
Year of Publication: 2010
Authors: Santosh Kumar, Dr. V.M.Wadhai, Prasad S.Halgaonkar, Kiran P.Gaikwad
10.5120/1209-1732

Santosh Kumar, Dr. V.M.Wadhai, Prasad S.Halgaonkar, Kiran P.Gaikwad . Article:OpenMP Optimization and its Translation to OpenGL. International Journal of Computer Applications. 8, 5 ( October 2010), 5-9. DOI=10.5120/1209-1732

@article{ 10.5120/1209-1732,
author = { Santosh Kumar, Dr. V.M.Wadhai, Prasad S.Halgaonkar, Kiran P.Gaikwad },
title = { Article:OpenMP Optimization and its Translation to OpenGL },
journal = { International Journal of Computer Applications },
issue_date = { October 2010 },
volume = { 8 },
number = { 5 },
month = { October },
year = { 2010 },
issn = { 0975-8887 },
pages = { 5-9 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume8/number5/1209-1732/ },
doi = { 10.5120/1209-1732 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:56:40.359520+05:30
%A Santosh Kumar
%A Dr. V.M.Wadhai
%A Prasad S.Halgaonkar
%A Kiran P.Gaikwad
%T Article:OpenMP Optimization and its Translation to OpenGL
%J International Journal of Computer Applications
%@ 0975-8887
%V 8
%N 5
%P 5-9
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

For general purpose high-performance computing, recently GPGPUs have emerged as powerful vehicles. Programming GPGPUs is complex when compared to programming general purpose CPUs and parallel programming models such as OpenMP. Goal of our translation is to improve programmability and make existing OpenMP applications to be able to execute on GPGPUs. OpenMP has established itself as an important method and language extension for programming shared-memory parallel computers. Our translator works well on regular applications, leading to performance improvements of up to 50X over the un-optimized translation.

References
  1. OpenMP [online]. available: http://openmp.org/wp/
  2. NVIDIA CUDA [online]. available: http://developer.nvidia.com/object/cuda home.html
  3. S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S. Ueng, J. A. Stratton, and W. W. Hwu. Program optimization space pruning for a multithreaded GPU. International Symposium on Code Generation and Optimization (CGO), 2008.
  4. S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, andW.W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pages 73–82, 2008.
  5. M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. A compiler framework for optimization of affine loop nests for GPGPUs. ACM International Conference on Supercomputing (ICS), 2008.
  6. Seung-Jai Min, Ayon Basumallik, and Rudolf Eigenmann. Optimizing OpenMP programs on software distributed shared memory systems. International Journel of Parallel Programming (IJPP), 31:225–249, June 2003.
  7. Seung-Jai Min and Rudolf Eigenmann. Optimizing irregular sharedmemory applications for clusters. ACM International Conference on Supercomputing (ICS), pages 256–265, 2008.
  8. Ayon Basumallik and Rudolf Eigenmann. Towards automatic translation of OpenMP to MPI. ACM International Conference on Supercomputing (ICS), pages 189–198, 2005.
  9. K. O’Brien, K. O’Brien, Z. Sura, T. Chen, and T. Zhang. Supporting OpenMP on Cell. International Journel of Parallel Programming (IJPP), 36(3):289–311, June 2008.
  10. Haitao Wei and Junqing Yu. Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip. International Workshop on OpenMP (IWOMP), 2007.
  11. J. A. Stratton, S. S. Stone, and W. W. Hwu. MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2008.
  12. Narayanan Sundaram, Anand Raghunathan, and Srimat T. Chakradhar. A framework for efficient and scalable execution of domainspecific templates on GPUs. IEEE International Parallel and Distributed Processing Symposium (IPDPS), May 2009.
Index Terms

Computer Science
Information Sciences

Keywords

OpenMP GPU Brook+ Automatic translation