CFP last date
20 March 2025
Reseach Article

Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms

by Balaji Thadagam Kandavel
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 55
Year of Publication: 2024
Authors: Balaji Thadagam Kandavel
10.5120/ijca2024924270

Balaji Thadagam Kandavel . Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms. International Journal of Computer Applications. 186, 55 ( Dec 2024), 14-19. DOI=10.5120/ijca2024924270

@article{ 10.5120/ijca2024924270,
author = { Balaji Thadagam Kandavel },
title = { Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2024 },
volume = { 186 },
number = { 55 },
month = { Dec },
year = { 2024 },
issn = { 0975-8887 },
pages = { 14-19 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number55/serverless-machine-learning-framework-for-efficient-training-and-deployment-of-models-across-multiple-cloud-platforms/ },
doi = { 10.5120/ijca2024924270 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-12-27T02:45:45.129970+05:30
%A Balaji Thadagam Kandavel
%T Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 55
%P 14-19
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The rise of serverless computing has revolutionized the deployment and scaling of applications, including machine learning (ML). Traditional cloud-based ML systems often incur high costs, complexity in scaling, and infrastructure management. Serverless computing offers a simplified alternative, abstracting the underlying infrastructure to reduce operational overhead. This paper proposes a serverless machine learning framework that enables efficient training and deployment of ML models across multiple cloud platforms such as AWS Lambda, Google Cloud Functions, and Azure Functions. The framework optimizes the allocation of compute resources dynamically based on workload, significantly reducing both time and cost for training and inference processes. We implemented the framework using Kubernetes for container orchestration, and applied it to various machine learning tasks, including image classification and natural language processing. Results demonstrate up to 45% cost savings and a 50% reduction in deployment time compared to traditional cloud setups. We conclude that a serverless ML framework provides scalable, cost-effective, and reliable solutions for ML operations while simplifying infrastructure management across cloud platforms.

References
  1. A. Muhammad, A. Aseere, H. Chiroma, H. Shah, A. Y. Gital, and I. A. Hashem, "Deep learning application in smart cities: recent development, taxonomy, challenges and research prospects," Neural Computing and Applications, vol. 33, pp. 2973-3009, 2020.
  2. M. A. Wani, M. Kantardzic, and M. Sayed-Mouchaweh, Deep Learning Applications. Springer, 2020.
  3. M. G. Murshed, C. Murphy, D. Hou, N. Khan, G. Ananthanarayanan, and F. Hussain, "Machine learning at the network edge: A survey," ArXiv, abs/1908.00080, 2019.
  4. Y. Cheng, D. Wang, P. Zhou, and T. Zhang, "A survey of model compression and acceleration for deep neural networks," ArXiv, abs/1710.09282, 2017.
  5. S. Ramos, S. Gehrig, P. Pinggera, U. Franke, and C. Rother, "Detecting unexpected obstacles for self-driving cars: fusing deep learning and geometric modeling," in IEEE Intelligent Vehicles Symposium (IV), 2017.
  6. H. Su, Y. Zhang, J. Li, and J. Hu, "The shopping assistant robot design based on ROS and deep learning," in 2016 2nd International Conference on Cloud Computing and Internet of Things (CCIOT), Dalian, China, 2016, pp. 173-176.
  7. B. Tang, Z. Chen, G. Hefferman, S. Pei, T. Wei, H. He, and Q. Yang, "Incorporating intelligence in fog computing for big data analysis in smart cities," IEEE Transactions on Industrial Informatics, 2017.
  8. M. Liu, J. Niu, and X. Wang, "An autopilot system based on ROS distributed architecture and deep learning," in IEEE 15th International Conference on Industrial Informatics (INDIN), Emden, 2017, pp. 1229-1234.
  9. C. C. Hsu, M. Y. Wang, H. C. H. Shen, R. H. Chiang, and C. H. P. Wen, "FallCare+: An IoT surveillance system for fall detection," in 2017 International Conference on Applied System Innovation (ICASI), Sapporo, Japan, 2017, pp. 921-922.
  10. Y. Chang, P. Chung, and H. Lin, "Deep learning for object identification in ROS-based mobile robots," in IEEE International Conference on Applied System Invention (ICASI), Chiba, 2018, pp. 66-69.
  11. X. Zhang, X. Zhou, M. Lin, and J. Sun, "ShuffleNet: An extremely efficient convolutional neural network for mobile devices," in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6848-6856.
  12. Q. Zhang, M. Zhang, T. Chen, Z. Sun, Y. Ma, and B. Yu, "Recent advances in convolutional neural network acceleration," Neurocomputing, vol. 323, pp. 37-51, 2019.
  13. M. Tan and Q. V. Le, "EfficientNet: Rethinking model scaling for convolutional neural networks," ArXiv, abs/1905.11946, 2019.
Index Terms

Computer Science
Information Sciences

Keywords

Serverless Computing Machine Learning Cloud Platforms Deployment Efficiency Multi-cloud Architecture