Molecular design in drug discovery: a comprehensive review of deep generative models

Y Cheng, Y Gong, Y Liu, B Song… - Briefings in …, 2021 - academic.oup.com
Deep generative models have been an upsurge in the deep learning community since they
were proposed. These models are designed for generating new synthetic data including …

Unifying the design space and optimizing linear and nonlinear truss metamaterials by generative modeling

L Zheng, K Karapiperis, S Kumar… - Nature …, 2023 - nature.com
The rise of machine learning has fueled the discovery of new materials and, especially,
metamaterials—truss lattices being their most prominent class. While their tailorable …

[BOK][B] Dive into deep learning

A Zhang, ZC Lipton, M Li, AJ Smola - 2023 - books.google.com
Deep learning has revolutionized pattern recognition, introducing tools that power a wide
range of technologies in such diverse fields as computer vision, natural language …

A comprehensive survey for generative data augmentation

Y Chen, Z Yan, Y Zhu - Neurocomputing, 2024 - Elsevier
Generative data augmentation (GDA) has emerged as a promising technique to alleviate
data scarcity in machine learning applications. This thesis presents a comprehensive survey …

Interpretable-through-prototypes deepfake detection for diffusion models

A Aghasanli, D Kangin… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The process of recognizing and distinguishing between real content and content generated
by deep learning algorithms, often referred to as deepfakes, is known as deepfake detection …

Neural categorical priors for physics-based character control

Q Zhu, H Zhang, M Lan, L Han - ACM Transactions on Graphics (TOG), 2023 - dl.acm.org
Recent advances in learning reusable motion priors have demonstrated their effectiveness
in generating naturalistic behaviors. In this paper, we propose a new learning framework in …

Generating features with increased crop-related diversity for few-shot object detection

J Xu, H Le, D Samaras - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
Two-stage object detectors generate object proposals and classify them to detect objects in
images. These proposals often do not perfectly contain the objects but overlap with them in …

Disentangling cognitive diagnosis with limited exercise labels

X Chen, L Wu, F Liu, L Chen, K Zhang… - Advances in …, 2023 - proceedings.neurips.cc
Cognitive diagnosis is an important task in intelligence education, which aims at measuring
students' proficiency in specific knowledge concepts. Given a fully labeled exercise-concept …

Lumina-t2x: Transforming text into any modality, resolution, and duration via flow-based large diffusion transformers

P Gao, L Zhuo, D Liu, R Du, X Luo, L Qiu… - arxiv preprint arxiv …, 2024 - arxiv.org
Sora unveils the potential of scaling Diffusion Transformer for generating photorealistic
images and videos at arbitrary resolutions, aspect ratios, and durations, yet it still lacks …

Mm-tts: Multi-modal prompt based style transfer for expressive text-to-speech synthesis

W Guan, Y Li, T Li, H Huang, F Wang, J Lin… - Proceedings of the …, 2024 - ojs.aaai.org
The style transfer task in Text-to-Speech (TTS) refers to the process of transferring style
information into text content to generate corresponding speech with a specific style …