CFP last date
20 May 2025
Reseach Article

Integration of Software Engineering Principles in Machine Learning Pipeline Development

by Sambedana Lenka, Suryasmita Sahoo, Rajesh Sahoo
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 78
Year of Publication: 2025
Authors: Sambedana Lenka, Suryasmita Sahoo, Rajesh Sahoo
10.5120/ijca2025924249

Sambedana Lenka, Suryasmita Sahoo, Rajesh Sahoo . Integration of Software Engineering Principles in Machine Learning Pipeline Development. International Journal of Computer Applications. 186, 78 ( Apr 2025), 16-20. DOI=10.5120/ijca2025924249

@article{ 10.5120/ijca2025924249,
author = { Sambedana Lenka, Suryasmita Sahoo, Rajesh Sahoo },
title = { Integration of Software Engineering Principles in Machine Learning Pipeline Development },
journal = { International Journal of Computer Applications },
issue_date = { Apr 2025 },
volume = { 186 },
number = { 78 },
month = { Apr },
year = { 2025 },
issn = { 0975-8887 },
pages = { 16-20 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number78/integration-of-software-engineering-principles-in-machine-learning-pipeline-development/ },
doi = { 10.5120/ijca2025924249 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-04-26T02:19:17.142464+05:30
%A Sambedana Lenka
%A Suryasmita Sahoo
%A Rajesh Sahoo
%T Integration of Software Engineering Principles in Machine Learning Pipeline Development
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 78
%P 16-20
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Although machine learning (ML) has transformed many sectors, issues with scalability, robustness, and maintainability are frequently encountered during deployment and upkeep. To ensure that AI systems are durable, scalable, and maintainable, software engineering concepts must be incorporated into the creation of machine learning pipelines. In the context of developing machine learning pipelines, this study examines many software engineering techniques, including version control, modular design, testing methodologies, and continuous integration/continuous deployment (CI/CD).

References
  1. Amershi, S., Begel, A., Bird, C., DeLine, R., Gall, H., Kamar, E., & Nagappan, N. (2019). "Software Engineering for Machine Learning: A Case Study." International Conference on Software Engineering (ICSE), 291-300.
  2. Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., & Dennison, D. (2015). "Hidden Technical Debt in Machine Learning Systems." Advances in Neural Information Processing Systems, 28, 2503-2511.
  3. Zhang, H., & Chen, Y. (2020). "SEML: A Framework for Software Engineering in Machine Learning Pipeline Development." Journal of Systems and Software, 165, 110576.
  4. Bosch, J., Olsson, H. H., Björk, J., &Ljungblad, J. (2021). "Software Engineering Challenges for Machine Learning Systems." IEEE Software, 38(6), 36-45.
  5. Breck, E., Cai, S., Nielsen, E., Salib, M., & Sculley, D. (2017). "The ML Test Score: A Rubric for ML Production Readiness and Technical Debt Reduction." IEEE International Conference on Big Data, 1123-1132.
  6. Nourani, M., & Babbar, D. (2020). "Incorporating Software Engineering Principles in Machine Learning Projects." ACM Transactions on Software Engineering and Methodology, 29(4), 27.
  7. Amershi, S., & Nagappan, N. (2019). "Software Engineering Best Practices in Machine Learning." IEEE Software, 36(1), 92-100.
  8. Zhang, X., & Zhu, Y. (2021). "Towards a Unified Software Engineering Framework for Machine Learning." Journal of Software: Evolution and Process, 33(6), e2345.
  9. Hynes, N., & Cho, H. (2020). "Automating the End-to-End Lifecycle of Machine Learning Models with Software Engineering Practices." Proceedings of the 2020 Conference on Machine Learning Systems, 112-125.
  10. Zou, D., & Wei, Y. (2021). "An Agile Approach to Integrating Machine Learning Models in Software Engineering." International Journal of Agile Systems and Management, 14(3), 201-216.
  11. Sato, T., & Toyama, S. (2021). "Machine Learning Operations: Integrating DevOps Principles into Machine Learning Projects." Journal of Information Processing, 29, 123-134.
  12. Breck, E., &Polyzotis, N. (2019). "Data Management Challenges in Production Machine Learning Systems." Proceedings of the VLDB Endowment, 12(12), 2126-2138.
  13. Rausch, T., &Dustdar, S. (2020). "Engineering Production-Ready Machine Learning Pipelines." IEEE Internet Computing, 24(2), 23-31.
  14. Chen, Z., & Kwon, S. (2021). "Continuous Integration and Deployment for Machine Learning Pipelines." ACM Transactions on Software Engineering and Methodology, 30(2), 11.
  15. Lwakatare, L. E., & Kuvaja, P. (2020). "DevOps for AI: MLOps in Practice." IEEE Software, 37(5), 97-103.
  16. Arpteg, A., &Brinne, B. (2020). "Software Engineering for AI-Enabled Systems: A Case Study." Journal of Software Engineering Research and Development, 8(1), 5.
  17. Van der Aalst, W. M. P., & Carmona, J. (2021). "Challenges in Integrating Process Mining and Machine Learning." International Journal of Software and Informatics, 15(1), 1-16.
  18. Krejca, M. S., & Ribeiro, B. (2020). "Machine Learning in Software Engineering: A Study of Practitioners' Perspectives." Empirical Software Engineering, 25, 138-162.
  19. Yazdani, A., &Zowghi, D. (2021). "Adopting Software Engineering Practices in Machine Learning Projects: A Systematic Literature Review." Information and Software Technology, 132, 106485.
  20. Menzies, T., & Zimmermann, T. (2020). "Software Engineering for AI-Based Systems: A Survey of Practices and Challenges." IEEE Transactions on Software Engineering, 47(1), 55-70.
Index Terms

Computer Science
Information Sciences

Keywords

Machine learning software engineering pipeline development version control modular design continuous integration continuous deployment