What do users ask in open-source AI repositories? An empirical study of GitHub issues

Z Yang, C Wang, J Shi, T Hoang… - 2023 IEEE/ACM 20th …, 2023 - ieeexplore.ieee.org
Artificial Intelligence (AI) systems, which benefit from the availability of large-scale datasets
and increasing computational power, have become effective solutions to various critical …

[HTML][HTML] Enterprise architecture-based metamodel for machine learning projects and its management

H Takeuchi, JH Husen, HT Tun, H Washizaki… - Future generation …, 2024 - Elsevier
In this study, we consider projects for develo** service systems using machine learning
(ML) techniques. These projects involve collaboration between various stakeholders …

Peatmoss: A dataset and initial analysis of pre-trained models in open-source software

W Jiang, J Yasmin, J Jones, N Synovic… - 2024 IEEE/ACM 21st …, 2024 - ieeexplore.ieee.org
The development and training of deep learning models have become increasingly costly
and complex. Consequently, software engineers are adopting pre-trained models (PTMs) for …

Exploring Hyperparameter Usage and Tuning in Machine Learning Research

S Simon, N Kolyada, C Akiki, M Potthast… - 2023 IEEE/ACM 2nd …, 2023 - ieeexplore.ieee.org
The success of machine learning (ML) models depends on careful experimentation and
optimization of their hyperparameters. Tuning can affect the reliability and accuracy of a …

Prevalence of code smells in reinforcement learning projects

N Cardozo, I Dusparic… - 2023 IEEE/ACM 2nd …, 2023 - ieeexplore.ieee.org
Reinforcement Learning (RL) is being increasingly used to learn and adapt application
behavior in many domains, including large-scale and safety critical systems, as for example …

Lint-based warnings in python code: Frequency, awareness and refactoring

N Oliveira, M Ribeiro, R Bonifácio… - 2022 IEEE 22nd …, 2022 - ieeexplore.ieee.org
Python is a popular programming language characterized by its simple syntax and easy
learning curve. Like many languages, Python has a set of best practices that should be …

Do Code Quality and Style Issues Differ Across (Non-) Machine Learning Notebooks? Yes!

MS Siddik, CP Bezemer - 2023 IEEE 23rd International Working …, 2023 - ieeexplore.ieee.org
The popularity of computational notebooks is rapidly increasing because of their interactive
code-output visualization and on-demand non-sequential code block execution. These …

Unboxing default argument breaking changes in Scikit Learn

JE Montandon, LL Silva, C Politowski… - 2023 IEEE 23rd …, 2023 - ieeexplore.ieee.org
Machine Learning (ML) has revolutionized the field of computer software development,
enabling data-based predictions and decision-making across several domains. Following …

Automated detection of inter-language design smells in multi-language deep learning frameworks

Z Li, X Zhang, W Wang, P Liang, R Mo, J Tan… - Information and Software …, 2025 - Elsevier
Context: Nowadays, most deep learning frameworks (DLFs) use multilingual programming
of Python and C/C++, facilitating the flexibility and performance of the DLF. However …

Contract-based Validation of Conceptual Design Bugs for Engineering Complex Machine Learning Software

W Meijer - Proceedings of the ACM/IEEE 27th International …, 2024 - dl.acm.org
Context. Modern software systems increasingly commonly contain one or multiple machine
learning (ML) components. Current development practices are generally on a trial-and-error …