Asset management in machine learning: State-of-research and state-of-practice

S Idowu, D Strüber, T Berger - ACM Computing Surveys, 2022 - dl.acm.org
Machine learning components are essential for today's software systems, causing a need to
adapt traditional software engineering practices when develo** machine-learning-based …

An empirical study of code smells in transformer-based code generation techniques

ML Siddiq, SH Majumder, MR Mim… - 2022 IEEE 22nd …, 2022 - ieeexplore.ieee.org
Prior works have developed transformer-based language learning models to automatically
generate source code for a task without compilation errors. The datasets used to train these …

Machine learning model development from a software engineering perspective: A systematic literature review

G Lorenzoni, P Alencar, N Nascimento… - arxiv preprint arxiv …, 2021 - arxiv.org
Data scientists often develop machine learning models to solve a variety of problems in the
industry and academy but not without facing several challenges in terms of Model …

A large-scale comparison of Python code in Jupyter notebooks and scripts

K Grotov, S Titov, V Sotnikov, Y Golubev… - Proceedings of the 19th …, 2022 - dl.acm.org
In recent years, Jupyter notebooks have grown in popularity in several domains of software
engineering, such as data science, machine learning, and computer science education …

The prevalence of code smells in machine learning projects

B Van Oort, L Cruz, M Aniche… - 2021 IEEE/ACM 1st …, 2021 - ieeexplore.ieee.org
Artificial Intelligence (AI) and Machine Learning (ML) are pervasive in the current computer
science landscape. Yet, there still exists a lack of software engineering experience and best …

Code smells for machine learning applications

H Zhang, L Cruz, A Van Deursen - … of the 1st international conference on …, 2022 - dl.acm.org
The popularity of machine learning has wildly expanded in recent years. Machine learning
techniques have been heatedly studied in academia and applied in the industry to create …

A Large-Scale Study of Model Integration in ML-Enabled Software Systems

Y Sens, H Knopp, S Peldszus, T Berger - arxiv preprint arxiv:2408.06226, 2024 - arxiv.org
The rise of machine learning (ML) and its embedding in systems has drastically changed the
engineering of software-intensive systems. Traditionally, software engineering focuses on …

Comparative analysis of real issues in open-source machine learning projects

TD Lai, A Simmons, S Barnett, JG Schneider… - Empirical Software …, 2024 - Springer
Context In the last decade of data-driven decision-making, Machine Learning (ML) systems
reign supreme. Because of the different characteristics between ML and traditional Software …

Lint-based warnings in python code: Frequency, awareness and refactoring

N Oliveira, M Ribeiro, R Bonifácio… - 2022 IEEE 22nd …, 2022 - ieeexplore.ieee.org
Python is a popular programming language characterized by its simple syntax and easy
learning curve. Like many languages, Python has a set of best practices that should be …

A large-scale study of ml-related python projects

S Idowu, Y Sens, T Berger, J Krüger… - Proceedings of the 39th …, 2024 - dl.acm.org
The rise of machine learning (ML) for solving current and future problems increased the
production of ML-enabled software systems. Unfortunately, standardized tool chains for …