Distilled GPT for source code summarization

CY Su, C McMillan - Automated Software Engineering, 2024 - Springer
A code summary is a brief natural language description of source code. Summaries are
usually only a single sentence long, and yet form the backbone of developer documentation …

Deep learning model reuse in the huggingface community: Challenges, benefit and trends

M Taraghi, G Dorcelus, A Foundjem… - … on Software Analysis …, 2024 - ieeexplore.ieee.org
The ubiquity of large-scale Pre-Trained Models (PTMs) is on the rise, sparking interest in
model hubs, and dedicated platforms for hosting PTMs. Despite this trend, a comprehensive …

Communicating study design trade-offs in software engineering

MP Robillard, DM Arya, NA Ernst, JLC Guo… - ACM Transactions on …, 2024 - dl.acm.org
Reflecting on the limitations of a study is a crucial part of the research process. In software
engineering studies, this reflection is typically conveyed through discussions of study …

What Do Developers Feel About Fast-Growing Programming Languages? An Exploratory Study

J Kumar, S Chimalakonda - Proceedings of the 32nd IEEE/ACM …, 2024 - dl.acm.org
The developer community has witnessed an unprecedented surge in recent years, with over
100 million active developers on the GitHub platform in 2023. Along with it, there is a …

Negativity in self-admitted technical debt: how sentiment influences prioritization

N Cassee, N Ernst, N Novielli, A Serebrenik - Empirical Software …, 2025 - Springer
Abstract Self-Admitted Technical Debt, or SATD, is a self-admission of technical debt present
in a software system. The presence of SATD in software systems negatively affects …

A bot identification model and tool based on GitHub activity sequences

N Chidambaram, A Decan, T Mens - Journal of Systems and Software, 2025 - Elsevier
Identifying whether GitHub contributors are automated bots is important for empirical
research on collaborative software development practices. Multiple such bot identification …

Semantic similarity loss for neural source code summarization

CY Su, C McMillan - Journal of Software: Evolution and …, 2024 - Wiley Online Library
This paper presents a procedure for and evaluation of using a semantic similarity metric as a
loss function for neural source code summarization. Code summarization is the task of …

[PDF][PDF] Distinguishing Bots from Human Developers Based on Their GitHub Activity Types.

N Chidambaram, A Decan, T Mens - SATToSE, 2023 - orbi.umons.ac.be
Abstract Development bots are being used by maintainers of GitHub repositories to perform
repetitive or error-prone tasks. While multiple approaches have been proposed in the past to …