Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms

Balaji Thadagam Kandavel

Call for Paper

July Edition

IJCA solicits high quality original research papers for the upcoming July edition of the journal. The last date of research paper submission is 20 June 2025

Submit your paper

Know more

The week's pick

Designing Multi-Tenant E-Learning Systems in the Cloud: A Process-Oriented Approach for Higher Education

Sameh Azouzi Sonia Ayachi Ghannouchi

Random Articles

Data Mining using Modified GFMM Neural Network

April

2015

Monitoring System using GSM

May

2015

ON Tiling Patterns Involving Islamic Stars with an Odd Number of Vertices

March

2013

Design and Implementation of Scalable, Fully Distributed Web Crawler for a Web Search Engine

February

2011

Reseach Article

Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms

by Balaji Thadagam Kandavel

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 186 - Number 55

Year of Publication: 2024

Authors: Balaji Thadagam Kandavel

10.5120/ijca2024924270

Balaji Thadagam Kandavel . Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms. International Journal of Computer Applications. 186, 55 ( Dec 2024), 14-19. DOI=10.5120/ijca2024924270

@article{ 10.5120/ijca2024924270,

author = { Balaji Thadagam Kandavel },

title = { Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms },

journal = { International Journal of Computer Applications },

issue_date = { Dec 2024 },

volume = { 186 },

number = { 55 },

month = { Dec },

year = { 2024 },

issn = { 0975-8887 },

pages = { 14-19 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume186/number55/serverless-machine-learning-framework-for-efficient-training-and-deployment-of-models-across-multiple-cloud-platforms/ },

doi = { 10.5120/ijca2024924270 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-12-27T02:45:45.129970+05:30

%A Balaji Thadagam Kandavel

%T Serverless Machine Learning Framework for Efficient Training and Deployment of Models Across Multiple Cloud Platforms

%J International Journal of Computer Applications

%@ 0975-8887

%V 186

%N 55

%P 14-19

%D 2024

%I Foundation of Computer Science (FCS), NY, USA

Abstract

The rise of serverless computing has revolutionized the deployment and scaling of applications, including machine learning (ML). Traditional cloud-based ML systems often incur high costs, complexity in scaling, and infrastructure management. Serverless computing offers a simplified alternative, abstracting the underlying infrastructure to reduce operational overhead. This paper proposes a serverless machine learning framework that enables efficient training and deployment of ML models across multiple cloud platforms such as AWS Lambda, Google Cloud Functions, and Azure Functions. The framework optimizes the allocation of compute resources dynamically based on workload, significantly reducing both time and cost for training and inference processes. We implemented the framework using Kubernetes for container orchestration, and applied it to various machine learning tasks, including image classification and natural language processing. Results demonstrate up to 45% cost savings and a 50% reduction in deployment time compared to traditional cloud setups. We conclude that a serverless ML framework provides scalable, cost-effective, and reliable solutions for ML operations while simplifying infrastructure management across cloud platforms.

References

A. Muhammad, A. Aseere, H. Chiroma, H. Shah, A. Y. Gital, and I. A. Hashem, "Deep learning application in smart cities: recent development, taxonomy, challenges and research prospects," Neural Computing and Applications, vol. 33, pp. 2973-3009, 2020.
M. A. Wani, M. Kantardzic, and M. Sayed-Mouchaweh, Deep Learning Applications. Springer, 2020.
M. G. Murshed, C. Murphy, D. Hou, N. Khan, G. Ananthanarayanan, and F. Hussain, "Machine learning at the network edge: A survey," ArXiv, abs/1908.00080, 2019.
Y. Cheng, D. Wang, P. Zhou, and T. Zhang, "A survey of model compression and acceleration for deep neural networks," ArXiv, abs/1710.09282, 2017.
S. Ramos, S. Gehrig, P. Pinggera, U. Franke, and C. Rother, "Detecting unexpected obstacles for self-driving cars: fusing deep learning and geometric modeling," in IEEE Intelligent Vehicles Symposium (IV), 2017.
H. Su, Y. Zhang, J. Li, and J. Hu, "The shopping assistant robot design based on ROS and deep learning," in 2016 2nd International Conference on Cloud Computing and Internet of Things (CCIOT), Dalian, China, 2016, pp. 173-176.
B. Tang, Z. Chen, G. Hefferman, S. Pei, T. Wei, H. He, and Q. Yang, "Incorporating intelligence in fog computing for big data analysis in smart cities," IEEE Transactions on Industrial Informatics, 2017.
M. Liu, J. Niu, and X. Wang, "An autopilot system based on ROS distributed architecture and deep learning," in IEEE 15th International Conference on Industrial Informatics (INDIN), Emden, 2017, pp. 1229-1234.
C. C. Hsu, M. Y. Wang, H. C. H. Shen, R. H. Chiang, and C. H. P. Wen, "FallCare+: An IoT surveillance system for fall detection," in 2017 International Conference on Applied System Innovation (ICASI), Sapporo, Japan, 2017, pp. 921-922.
Y. Chang, P. Chung, and H. Lin, "Deep learning for object identification in ROS-based mobile robots," in IEEE International Conference on Applied System Invention (ICASI), Chiba, 2018, pp. 66-69.
X. Zhang, X. Zhou, M. Lin, and J. Sun, "ShuffleNet: An extremely efficient convolutional neural network for mobile devices," in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6848-6856.
Q. Zhang, M. Zhang, T. Chen, Z. Sun, Y. Ma, and B. Yu, "Recent advances in convolutional neural network acceleration," Neurocomputing, vol. 323, pp. 37-51, 2019.
M. Tan and Q. V. Le, "EfficientNet: Rethinking model scaling for convolutional neural networks," ArXiv, abs/1905.11946, 2019.

Index Terms

Computer Science

Information Sciences

Keywords

Serverless Computing Machine Learning Cloud Platforms Deployment Efficiency Multi-cloud Architecture