CFP last date
20 January 2025
Reseach Article

Evaluating Embedded GPUs Performance via Computer Vision Applications

by Paulo S. S. De Souza, Arthur F. Lorenzon, Marcelo C. Luizelli, Fabio D. Rossi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 176 - Number 36
Year of Publication: 2020
Authors: Paulo S. S. De Souza, Arthur F. Lorenzon, Marcelo C. Luizelli, Fabio D. Rossi
10.5120/ijca2020920518

Paulo S. S. De Souza, Arthur F. Lorenzon, Marcelo C. Luizelli, Fabio D. Rossi . Evaluating Embedded GPUs Performance via Computer Vision Applications. International Journal of Computer Applications. 176, 36 ( Jul 2020), 7-11. DOI=10.5120/ijca2020920518

@article{ 10.5120/ijca2020920518,
author = { Paulo S. S. De Souza, Arthur F. Lorenzon, Marcelo C. Luizelli, Fabio D. Rossi },
title = { Evaluating Embedded GPUs Performance via Computer Vision Applications },
journal = { International Journal of Computer Applications },
issue_date = { Jul 2020 },
volume = { 176 },
number = { 36 },
month = { Jul },
year = { 2020 },
issn = { 0975-8887 },
pages = { 7-11 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume176/number36/31433-2020920518/ },
doi = { 10.5120/ijca2020920518 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:44:19.664000+05:30
%A Paulo S. S. De Souza
%A Arthur F. Lorenzon
%A Marcelo C. Luizelli
%A Fabio D. Rossi
%T Evaluating Embedded GPUs Performance via Computer Vision Applications
%J International Journal of Computer Applications
%@ 0975-8887
%V 176
%N 36
%P 7-11
%D 2020
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Computer vision applications usually present significant demand for computing resources, which limit its usage on embedded systems, since such devices typically have limited processing capacity. In this sense, hybrid embedded architectures are becoming more popular by offering higher levels of parallelism through Graphics Processing Units (GPUs). Despite the similarities with generalpurpose architectures that already exploit the benefits of GPUs, this new kind of embedded devices presents some architectural singularities, such as differences in memory access bandwidth. In this paper, we present an evaluation, considering how much these differences affect GPUs’ gains in the context of embedded systems. The results show that, despite the architectural limitations, using such devices can lead to a speed-up of 8 times compared to traditional embedded systems processing data only on CPUs.

References
  1. Nicola Bombieri, Sara Vinco, Valeria Bertacco, and Debapriya Chatterjee. Systemc simulation on gp-gpus: Cuda vs. opencl. In Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, pages 343–352. ACM, 2012.
  2. Jiaxing Che, Chunjiang Zhao, Yunhe Zhang, Cheng Wang, Xiaojun Qiao, and Xinlu Zhang. Plant stem diameter measuring device based on computer vision and embedded system. In World Automation Congress (WAC), 2010, pages 51–55. IEEE, 2010.
  3. Djamila Dekkiche, Bastien Vincke, and Alain Merigot. Investigation and performance analysis of openvx optimizations on computer vision applications. In Control, Automation, Robotics and Vision (ICARCV), 2016 14th International Conference on, pages 1–6. IEEE, 2016.
  4. Djamila Dekkiche, Bastien Vincke, and Alain Merigot. Targeting system-level and kernel-level optimizations of computer vision applications on embedded systems. Journal of Low Power Electronics, 13(4):607–615, 2017.
  5. Mark Harris. Many-core GPU computing with nvidia cuda. In Proceedings of the 22Nd Annual International Conference on Supercomputing, ICS ’08, pages 1–1, New York, NY, USA, 2008. ACM.
  6. Dominik Honegger, Helen Oleynikova, and Marc Pollefeys. Real-time and low latency embedded computer vision hardware based on a combination of fpga and mobile cpu. In Intelligent Robots and Systems (IROS 2014), 2014 IEEE/RSJ International Conference on, pages 4930–4935. IEEE, 2014.
  7. Andr´es Felipe Hurtado, Jairo Alejandro G´omez, V´ictor Manuel Pe˜ne˜nory, Iv´an Mauricio Cabezas, and Felipe Elvira Garc´ia. Proposal of a computer vision system to detect and track vehicles in real time using an embedded platform enabled with a graphical processing unit. In Mechatronics, Electronics and Automotive Engineering (ICMEAE), 2015 International Conference on, pages 76–80. IEEE, 2015.
  8. Faria Kalim, Shadi A. Noghabi, and Shiv Verma. To edge or not to edge? In Proceedings of the 2017 Symposium on Cloud Computing, SoCC ’17, pages 629–629, New York, NY, USA, 2017. ACM.
  9. D. Kumar and M. A. Qadeer. Fast heterogeneous computing with cuda compatible tesla gpu computing processor (personal supercomputing). In Proceedings of the International Conference and Workshop on Emerging Trends in Technology, ICWET 10, page 925930, New York, NY, USA, 2010. Association for Computing Machinery.
  10. Vijay P. Kumar and Anshul Gupta. Analyzing scalability of parallel algorithms and architectures. Journal of parallel and distributed computing, 22(3):379–391, 1994.
  11. Rui Li, Qiming Hou, and Kun Zhou. Efficient gpu path rendering using scanline rasterization. ACM Trans. Graph., 35(6):228:1–228:12, November 2016.
  12. Jacobo Lobeiras, Margarita Amor, and Ramon Doallo. Designing efficient index-digit algorithms for cuda gpu architectures. IEEE Transactions on Parallel and Distributed Systems, 27(5):1331–1343, 2016.
  13. Hongying Meng, Nick Pears, and Chris Bailey. A human action recognition system for embedded computer vision application. In Computer Vision and Pattern Recognition, 2007. CVPR’07. IEEE Conference on, pages 1–6. IEEE, 2007.
  14. Nicholas Pauly and Nader I Rafla. An automated embedded computer vision system for object measurement. In Circuits and Systems (MWSCAS), 2013 IEEE 56th International Midwest Symposium on, pages 1108–1111. IEEE, 2013.
  15. SV Viraktamath, Mukund Katti, Aditya Khatawkar, and Pavan Kulkarni. Face detection and tracking using opencv. The SIJ Transactions on Computer Networks & Communication Engineering (CNCE), 1(3):2321–2403, 2013.
  16. Sunil Kumar Vishwakarma, Divakar Singh Yadav, et al. Analysis of lane detection techniques using opencv. In India Conference (INDICON), 2015 Annual IEEE, pages 1–4. IEEE, 2015.
  17. Rui Wang, Xianjin Yang, Yazhen Yuan, Wei Chen, Kavita Bala, and Hujun Bao. Automatic shader simplification using surface signal approximation. ACM Trans. Graph., 33(6):226:1–226:11, November 2014.
Index Terms

Computer Science
Information Sciences

Keywords

Embedded Systems GPGPU Computer Vision Performance Analysis