Symlm: Predicting function names in stripped binaries via context-sensitive execution-aware code embeddings

X **, K Pei, JY Won, Z Lin - Proceedings of the 2022 ACM SIGSAC …, 2022 - dl.acm.org
Predicting function names in stripped binaries is an extremely useful but challenging task, as
it requires summarizing the execution behavior and semantics of the function in human …

A survey on machine learning techniques for source code analysis

T Sharma, M Kechagia, S Georgiou, R Tiwari… - arxiv preprint arxiv …, 2021 - arxiv.org
The advancements in machine learning techniques have encouraged researchers to apply
these techniques to a myriad of software engineering tasks that use source code analysis …

Learning to represent programs with heterogeneous graphs

K Zhang, W Wang, H Zhang, G Li, Z ** - Proceedings of the 30th IEEE …, 2022 - dl.acm.org
Code representation, which transforms programs into vectors with semantics, is essential for
source code processing. We have witnessed the effectiveness of incorporating structural …

[HTML][HTML] A survey on machine learning techniques applied to source code

T Sharma, M Kechagia, S Georgiou, R Tiwari… - Journal of Systems and …, 2024 - Elsevier
The advancements in machine learning techniques have encouraged researchers to apply
these techniques to a myriad of software engineering tasks that use source code analysis …

Learning to recommend method names with global context

F Liu, G Li, Z Fu, S Lu, Y Hao, Z ** - Proceedings of the 44th …, 2022 - dl.acm.org
In programming, the names for the program entities, especially for the methods, are the
intuitive characteristic for understanding the functionality of the code. To ensure the …

Natural is the best: Model-agnostic code simplification for pre-trained large language models

Y Wang, X Li, TN Nguyen, S Wang, C Ni… - Proceedings of the ACM …, 2024 - dl.acm.org
Pre-trained Large Language Models (LLM) have achieved remarkable successes in several
domains. However, code-oriented LLMs are often heavy in computational complexity, and …

Lightweight global and local contexts guided method name recommendation with prior knowledge

S Wang, M Wen, B Lin, X Mao - Proceedings of the 29th ACM Joint …, 2021 - dl.acm.org
The quality of method names is critical for the readability and maintainability of source code.
However, it is often challenging to construct concise method names. To alleviate this …

Analysis of Program Representations Based on Abstract Syntax Trees and Higher-Order Markov Chains for Source Code Classification Task

AV Gorchakov, LA Demidova, PN Sovietov - Future Internet, 2023 - mdpi.com
In this paper we consider the research and development of classifiers that are trained to
predict the task solved by source code. Possible applications of such task detection …

Dataset of Program Source Codes Solving Unique Programming Exercises Generated by Digital Teaching Assistant

LA Demidova, EG Andrianova, PN Sovietov… - Data, 2023 - mdpi.com
This paper presents a dataset containing automatically collected source codes solving
unique programming exercises of different types. The programming exercises were …

Machine learning-based automated grading and feedback tools for programming: A meta-analysis

M Messer, NCC Brown, M Kölling, M Shi - Proceedings of the 2023 …, 2023 - dl.acm.org
Research into automated grading has increased as Computer Science courses grow.
Dynamic and static approaches are typically used to implement these graders, the most …