CFP last date
20 March 2025
Reseach Article

Managing Machine Learning Complexity with Advanced Version Control Techniques

by Koushik Balaji Venkatesan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 65
Year of Publication: 2025
Authors: Koushik Balaji Venkatesan
10.5120/ijca2025924421

Koushik Balaji Venkatesan . Managing Machine Learning Complexity with Advanced Version Control Techniques. International Journal of Computer Applications. 186, 65 ( Feb 2025), 19-26. DOI=10.5120/ijca2025924421

@article{ 10.5120/ijca2025924421,
author = { Koushik Balaji Venkatesan },
title = { Managing Machine Learning Complexity with Advanced Version Control Techniques },
journal = { International Journal of Computer Applications },
issue_date = { Feb 2025 },
volume = { 186 },
number = { 65 },
month = { Feb },
year = { 2025 },
issn = { 0975-8887 },
pages = { 19-26 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number65/managing-machine-learning-complexity-with-advanced-version-control-techniques/ },
doi = { 10.5120/ijca2025924421 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-02-03T23:25:44.048514+05:30
%A Koushik Balaji Venkatesan
%T Managing Machine Learning Complexity with Advanced Version Control Techniques
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 65
%P 19-26
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Managing the complexity of machine learning workflows is a significant challenge, as these projects often involve not just code but also large datasets, model maintenance, and extensive experimentation. While traditional version control tools like Git are effective for software development, they do not fully accommodate the unique requirements of ML workflows, such as tracking multiple dataset versions, managing evolving models, and maintaining experiment histories. Specific utilities and frameworks have been developed to address these challenges, and this paper explores some of these available tools in detail. Incorporating structured workflows and best practices for managing artifacts helps ML practitioners improve reproducibility, scalability, and collaboration across teams. Furthermore, these tools can be leveraged as part of an end-to-end ML pipeline combined with CI/CD practices to facilitate tasks such as data preprocessing, model training, and deployment solutions. Through a hands-on case study of a retail recommendation system, this paper demonstrates how these techniques effectively tackle real-world challenges, including handling dynamic datasets, optimizing iterative experimentation, and maintaining model integrity. Finally, the paper explores emerging trends such as automation and sustainability in ML workflows, highlighting how integrating these strategies can enhance scalability and enable teams to build more efficient and production-ready ML systems.

References
  1. D. Sculley, G. Holt, D. Golovin, E. Davydov, T. Phillips, D. Ebner, V. Chaudhary, M. Young, and J.-F. Crespo, "Hidden technical debt in machine learning systems," in *Proc. 28th Int. Conf. Neural Inf. Process. Syst. (NeurIPS)*, 2015, pp. 2503–2511.
  2. DVC Documentation, "Data Version Control (DVC): Git for Data & Models," Available: https://dvc.org/, Accessed: Jan. 2024.
  3. MLflow Documentation, "MLflow: Open-source platform for machine learning lifecycle," Available: https://mlflow.org/, Accessed: Jan. 2024.
  4. Weights & Biases Documentation, "Experiment tracking for machine learning teams," Available: https://wandb.ai/, Accessed: Jan. 2024.
  5. M. Zaharia, A. Chen, A. Davidson, A. Ghodsi, M. Hong, and A. Konwinski, "Accelerating the machine learning lifecycle with MLflow," in *Proc. Conf. Mach. Learn. Syst. (MLSys)*, 2016.
  6. D. Baylor, E. Breck, H. Cheng, N. Fiedel, and N. Polyzotis, "TFX: A TensorFlow-based production-scale machine learning platform," in *Proc. 23rd ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining (KDD)*, 2017.
  7. J. F. Pimentel, L. Murta, V. Braganholo, and J. Freire, "A large-scale study about quality and reproducibility of Jupyter notebooks," in *Proc. 16th Int. Conf. Mining Softw. Repositories (MSR)*, 2019.
  8. GitHub Documentation, "Git: Distributed version control system," Available: https://github.com/, Accessed: Jan. 2024.
  9. TensorFlow Model Garden, "Pre-trained models for reproducibility," Available: https://www.tensorflow.org/lite/models/, Accessed: Jan. 2024.
  10. E. Muškardin and T. Burgstaller, "Active model learning of Git version control system," in *Proc. IEEE Int. Conf. Softw. Test. Verification, and Validation Workshops*, 2024.
  11. M. Zaharia, A. Chen, et al., "Data pipeline in MapReduce," in *Proc. IEEE 9th Int. Conf. e-Science*, 2013.
  12. Kubeflow Documentation, "Machine learning toolkit for Kubernetes," Available: https://www.kubeflow.org/, Accessed: Jan. 2024.
  13. J. F. Pimentel, et al., "Version control challenges in ML," *Int. J. Softw. Eng.*, 2019.
Index Terms

Computer Science
Information Sciences
Algorithms
Machine Learning
Software Engineering
Data Management
Reproducibility

Keywords

Version Control Machine Learning Engineering Data Version Control (DVC) Experiment Tracking Model Reproducibility