CFP last date
20 February 2025
Reseach Article

You Only Look Once (YOLO): Object Detection Algorithm

Published on January 2025 by Akmal Anorbaev, Prerna Agarwal, Pranav Shrivastava
International Conference on Artificial Intelligence and Data Science Applications - 2023
Control System labs
ICAIDSC2023 - Number 3
January 2025
Authors: Akmal Anorbaev, Prerna Agarwal, Pranav Shrivastava
10.5120/icaidsc202419

Akmal Anorbaev, Prerna Agarwal, Pranav Shrivastava . You Only Look Once (YOLO): Object Detection Algorithm. International Conference on Artificial Intelligence and Data Science Applications - 2023. ICAIDSC2023, 3 (January 2025), 9-14. DOI=10.5120/icaidsc202419

@article{ 10.5120/icaidsc202419,
author = { Akmal Anorbaev, Prerna Agarwal, Pranav Shrivastava },
title = { You Only Look Once (YOLO): Object Detection Algorithm },
journal = { International Conference on Artificial Intelligence and Data Science Applications - 2023 },
issue_date = { January 2025 },
volume = { ICAIDSC2023 },
number = { 3 },
month = { January },
year = { 2025 },
issn = 0975-8887,
pages = { 9-14 },
numpages = 6,
url = { /proceedings/icaidsc2023/number3/you-only-look-once-yolo-object-detection-algorithm/ },
doi = { 10.5120/icaidsc202419 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Artificial Intelligence and Data Science Applications - 2023
%A Akmal Anorbaev
%A Prerna Agarwal
%A Pranav Shrivastava
%T You Only Look Once (YOLO): Object Detection Algorithm
%J International Conference on Artificial Intelligence and Data Science Applications - 2023
%@ 0975-8887
%V ICAIDSC2023
%N 3
%P 9-14
%D 2025
%I International Journal of Computer Applications
Abstract

In their groundbreaking research paper titled "You Only Look Once: Unified, Real-Time Object Detection,"[1] Joseph Redmon, Santosha Divvala, Ross Girshick, and Ali Farhadi introduced the innovative “YOLO (You Only Look Once)” algorithm. This algorithm revolutionizes real-time object detection by providing a unified and efficient methodology. The rapid advancements in computer vision have led to the emergence of real-time object detection as a pivotal challenge with applications ranging from surveillance to autonomous vehicles. This study delves deeply into the groundbreaking "You Only Look Once" (YOLO) algorithm, a cutting-edge approach in real-time object detection. YOLO has transformed the landscape of object detection by seamlessly incorporating object localization and classification within a single pass of a neural network.. This innovative method ensures outstanding efficiency while maintaining exceptional accuracy, marking a significant advancement in the field of computer vision. The central aim of this research endeavor is to comprehensively elucidate YOLO's architecture, methodology, and performance. The novel grid-based approach and holistic end-to-end detection process are highlighted. Through theoretical experiments on benchmark datasets and custom scenarios, YOLO's accuracy and processing speed are rigorously evaluated. The mean Average Precision (mAP) metric is employed to assess accuracy across various Intersection over Union (IoU) thresholds, showcasing YOLO's robustness in object identification. Additionally, high frames per second (FPS) figures underscore YOLO's real-time processing capabilities [2,3]. The paper discusses YOLO's strengths in efficiency, accuracy, and adaptability across different versions and variations. It also addresses potential limitations in detecting small objects, close-packed objects, and complex scenes. The implications of YOLO's performance are discussed, emphasizing its significance in applications like robotics, autonomous vehicles, and industrial automation. Looking ahead, future developments in fine-grained detection, 3D object detection, multi-modal fusion, and domain-specific customization are anticipated. The exceptional performance, efficiency, and adaptability of YOLO position it as a transformative force in the realm of real-time object detection, shaping the landscape of various industries and fostering innovation. This paper equips researchers, practitioners, and enthusiasts with a comprehensive understanding of YOLO, enabling them to harness its capabilities effectively and explore its potential for addressing complex challenges in computer vision and object detection.

References
  1. Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi. "You Only Look Once: Unified, Real-Time Object Detection", 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
  2. R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 580–587. IEEE, 2014.
  3. R. B. Girshick. Fast R-CNN. CoRR, abs/1504.08083, 2015.
  4. J. Yan, Z. Lei, L. Wen, and S. Z. Li. The fastest deformable part model for object detection. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 2497–2504. IEEE, 2014
  5. K. He, X. Zhang, S. Ren, and J. Sun. Spatial pyramid pooling in deep convolutional networks for visual recognition. arXiv preprint arXiv:1406.4729, 2014.
  6. P. Jiang, D. Ergu, F. Liu, Y. Cai, B.Ma, “A Review of Yolo Algorithm Developments”, Procedia Computer Science, Volume 199,2022,Pages 1066-1073,ISSN 1877-0509, https://doi.org/10.1016/j.procs.2022.01.135.
  7. Y. Tian, G. Yang, Z. Wang, H. Wang, E. Li, Z. Liang, “Apple detection during different growth stages in orchards using the improved YOLO-V3 model”, Computers and Electronics in Agriculture, Volume 157, 2019, Pages 417-426, ISSN 0168-1699, https://doi.org/10.1016/j.compag.2019.01.012.
  8. R. Huang, J. Pedoeem and C. Chen, "YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers," 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 2018, pp. 2503-2510, doi: 10.1109/BigData.2018.8621865.
  9. W. Fang, L. Wang and P. Ren, "Tinier-YOLO: A Real-Time Object Detection Method for Constrained Environments," in IEEE Access, vol. 8, pp. 1935-1944, 2020, doi: 10.1109/ACCESS.2019.2961959.
  10. W. Lan, J. Dang, Y. Wang and S. Wang, "Pedestrian Detection Based on YOLO Network Model," 2018 IEEE International Conference on Mechatronics and Automation (ICMA), Changchun, China, 2018, pp. 1547-1551, doi: 10.1109/ICMA.2018.8484698.
  11. Diwan, T., Anirudh, G. & Tembhurne, J.V. Object detection using YOLO: challenges, architectural successors, datasets and applications. Multimed Tools Appl 82, 9243–9275 (2023). https://doi.org/10.1007/s11042-022-13644-y
  12. Juan Du 2018 J. Phys.: Conf. Ser. 1004 012029, DOI 10.1088/1742-6596/1004/1/012029
  13. Terven, J.; Córdova-Esparza, D.-M.; Romero-González, J.-A. A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS. Mach. Learn. Knowl. Extr. 2023, 5, 1680-1716. https://doi.org/10.3390/make5040083
  14. Liu, W., Ren, G., Yu, R., Guo, S., Zhu, J., & Zhang, L. (2022). Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2), 1792-1800. https://doi.org/10.1609/aaai.v36i2.20072
  15. Gai, R., Chen, N. & Yuan, H. A detection algorithm for cherry fruits based on the improved YOLO-v4 model. Neural Comput & Applic 35, 13895–13906 (2023). https://doi.org/10.1007/s00521-021-06029-z Z. Huang, J. Wang, X. Fu, T. Yu, Y. Guo, R. Wang, “DC-SPP-YOLO: Dense connection and spatial pyramid pooling based YOLO for object detection”, Information Sciences, Volume 522, 2020, Pages 241-258, ISSN 0020-0255, https://doi.org/10.1016/j.ins.2020.02.067.
Index Terms

Computer Science
Information Sciences

Keywords

YOLO object detection real-time convolutional neural network bounding boxes