A novel neural source code representation based on abstract syntax tree

J Zhang, X Wang, H Zhang, H Sun… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Exploiting machine learning techniques for analyzing programs has attracted much
attention. One key problem is how to represent code fragments well for follow-up analysis …

Predictive models in software engineering: Challenges and opportunities

Y Yang, X **a, D Lo, T Bi, J Grundy… - ACM Transactions on …, 2022 - dl.acm.org
Predictive models are one of the most important techniques that are widely applied in many
areas of software engineering. There have been a large number of primary studies that …

Exploring emerging hacker assets and key hackers for proactive cyber threat intelligence

S Samtani, R Chinn, H Chen… - Journal of Management …, 2017 - Taylor & Francis
Cyber attacks cost the global economy approximately $445 billion per year. To mitigate
attacks, many companies rely on cyber threat intelligence (CTI), or threat intelligence related …

Graphsearchnet: Enhancing gnns via capturing global dependencies for semantic code search

S Liu, X **e, J Siow, L Ma, G Meng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Code search aims to retrieve accurate code snippets based on a natural language query to
improve software productivity and quality. With the massive amount of available programs …

Similarity-based analyses on software applications: A systematic literature review

M Auch, M Weber, P Mandl, C Wolff - Journal of Systems and Software, 2020 - Elsevier
In empirical studies on processes, practices, and techniques of software engineering,
automation and machine learning are gaining popularity. In order to extract knowledge from …

Why my code summarization model does not work: Code comment improvement with category prediction

Q Chen, X **a, H Hu, D Lo, S Li - ACM Transactions on Software …, 2021 - dl.acm.org
Code summarization aims at generating a code comment given a block of source code and
it is normally performed by training machine learning algorithms on existing code block …

Can pre-trained code embeddings improve model performance? Revisiting the use of code embeddings in software engineering tasks

Z Ding, H Li, W Shang, THP Chen - Empirical Software Engineering, 2022 - Springer
Word representation plays a key role in natural language processing (NLP). Various
representation methods have been developed, among which pre-trained word embeddings …

Clustering mobile apps based on mined textual features

AA Al-Subaihin, F Sarro, S Black, L Capra… - Proceedings of the 10th …, 2016 - dl.acm.org
Context: Categorising software systems according to their functionality yields many benefits
to both users and developers. Goal: In order to uncover the latent clustering of mobile apps …

CRaDLe: Deep code retrieval based on semantic dependency learning

W Gu, Z Li, C Gao, C Wang, H Zhang, Z Xu, MR Lyu - Neural Networks, 2021 - Elsevier
Code retrieval is a common practice for programmers to reuse existing code snippets in the
open-source repositories. Given a user query (ie, a natural language description), code …

Exploring hacker assets in underground forums

S Samtani, R Chinn, H Chen - 2015 IEEE international …, 2015 - ieeexplore.ieee.org
Many large companies today face the risk of data breaches via malicious software,
compromising their business. These types of attacks are usually executed using hacker …