Prompt-tuned code language model as a neural knowledge base for type inference in statically-typed partial code

Q Huang, Z Yuan, Z **ng, X Xu, L Zhu… - Proceedings of the 37th …, 2022 - dl.acm.org
Partial code usually involves non-fully-qualified type names (non-FQNs) and undeclared
receiving objects. Resolving the FQNs of these non-FQN types and undeclared receiving …

Toxic code snippets on stack overflow

C Ragkhitwetsagul, J Krinke, M Paixao… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Online code clones are code fragments that are copied from software projects or online
sources to Stack Overflow as examples. Due to an absence of a checking mechanism after …

Predicting the objective and priority of issue reports in software repositories

M Izadi, K Akbari, A Heydarnoori - Empirical Software Engineering, 2022 - Springer
Software repositories such as GitHub host a large number of software entities. Developers
collaboratively discuss, implement, use, and share these entities. Proper documentation …

GitHub Discussions: An exploratory study of early adoption

H Hata, N Novielli, S Baltes, RG Kula… - Empirical Software …, 2022 - Springer
Discussions is a new feature of GitHub for asking questions or discussing topics outside of
specific Issues or Pull Requests. Before being available to all projects in December 2020, it …

9.6 million links in source code comments: Purpose, evolution, and decay

H Hata, C Treude, RG Kula… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Links are an essential feature of the World Wide Web, and source code repositories are no
exception. However, despite their many undisputed benefits, links can suffer from decay …

Open source license inconsistencies on github

T Wolter, A Barcomb, D Riehle… - ACM Transactions on …, 2023 - dl.acm.org
Almost all software, open or closed, builds on open source software and therefore needs to
comply with the license obligations of the open source code. Not knowing which licenses to …

Ethics in the mining of software repositories

NE Gold, J Krinke - Empirical Software Engineering, 2022 - Springer
Abstract Research in Mining Software Repositories (MSR) is research involving human
subjects, as the repositories usually contain data about developers' and users' interactions …

Siamese: scalable and incremental code clone search via multiple code representations

C Ragkhitwetsagul, J Krinke - Empirical Software Engineering, 2019 - Springer
This paper presents a novel code clone search technique that is accurate, incremental, and
scalable to hundreds of million lines of code. Our technique incorporates multiple code …

An empirical study of c++ vulnerabilities in crowd-sourced code examples

M Verdi, A Sami, J Akhondali, F Khomh… - IEEE Transactions …, 2020 - ieeexplore.ieee.org
Software developers share programming solutions in Q&A sites like Stack Overflow, Stack
Exchange, Android forum, and so on. The reuse of crowd-sourced code snippets can …

Why aren't regular expressions a lingua franca? an empirical study on the re-use and portability of regular expressions

JC Davis, LG Michael IV, CA Coghlan… - Proceedings of the …, 2019 - dl.acm.org
This paper explores the extent to which regular expressions (regexes) are portable across
programming languages. Many languages offer similar regex syntaxes, and it would be …