Machine learning for synthetic data generation: a review

Y Lu, M Shen, H Wang, X Wang, C van Rechem… - arxiv preprint arxiv …, 2023 - arxiv.org
Machine learning heavily relies on data, but real-world applications often encounter various
data-related issues. These include data of poor quality, insufficient data points leading to …

A comprehensive survey on deep graph representation learning

W Ju, Z Fang, Y Gu, Z Liu, Q Long, Z Qiao, Y Qin… - Neural Networks, 2024 - Elsevier
Graph representation learning aims to effectively encode high-dimensional sparse graph-
structured data into low-dimensional dense vectors, which is a fundamental task that has …

Structure-based drug design with equivariant diffusion models

A Schneuing, C Harris, Y Du, K Didi… - Nature Computational …, 2024 - nature.com
Abstract Structure-based drug design (SBDD) aims to design small-molecule ligands that
bind with high affinity and specificity to pre-determined protein targets. Generative SBDD …

[HTML][HTML] 3D printing of biodegradable polymers and their composites–Current state-of-the-art, properties, applications, and machine learning for potential future …

SAV Dananjaya, VS Chevali, JP Dear, P Potluri… - Progress in Materials …, 2024 - Elsevier
This review paper comprehensively examines the dynamic landscape of 3D printing and
Machine Learning utilizing biodegradable polymers and their composites, presenting a …

Scientific large language models: A survey on biological & chemical domains

Q Zhang, K Ding, T Lv, X Wang, Q Yin, Y Zhang… - ACM Computing …, 2024 - dl.acm.org
Large Language Models (LLMs) have emerged as a transformative power in enhancing
natural language comprehension, representing a significant stride toward artificial general …

Diffusion-based molecule generation with informative prior bridges

L Wu, C Gong, X Liu, M Ye… - Advances in Neural …, 2022 - proceedings.neurips.cc
AI-based molecule generation provides a promising approach to a large area of biomedical
sciences and engineering, such as antibody design, hydrolase engineering, or vaccine …

Generating 3d molecules for target protein binding

M Liu, Y Luo, K Uchino, K Maruhashi, S Ji - arxiv preprint arxiv …, 2022 - arxiv.org
A fundamental problem in drug discovery is to design molecules that bind to specific
proteins. To tackle this problem using machine learning methods, here we propose a novel …

Diffbp: Generative diffusion of 3d molecules for target protein binding

H Lin, Y Huang, O Zhang, S Ma, M Liu, X Li, L Wu… - Chemical …, 2025 - pubs.rsc.org
Generating molecules that bind to specific proteins is an important but challenging task in
drug discovery. Most previous works typically generate atoms autoregressively, with element …

Molecular geometry pretraining with se (3)-invariant denoising distance matching

S Liu, H Guo, J Tang - arxiv preprint arxiv:2206.13602, 2022 - arxiv.org
Molecular representation pretraining is critical in various applications for drug and material
discovery due to the limited number of labeled molecules, and most existing work focuses …

Midi: Mixed graph and 3d denoising diffusion for molecule generation

C Vignac, N Osman, L Toni, P Frossard - Joint European Conference on …, 2023 - Springer
This work introduces MiDi, a novel diffusion model for jointly generating molecular graphs
and their corresponding 3D atom arrangements. Unlike existing methods that rely on …