[HTML][HTML] Data augmentation: A comprehensive survey of modern approaches
A Mumuni, F Mumuni - Array, 2022 - Elsevier
To ensure good performance, modern machine learning models typically require large
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …
A survey of controllable text generation using transformer-based pre-trained language models
Controllable Text Generation (CTG) is an emerging area in the field of natural language
generation (NLG). It is regarded as crucial for the development of advanced text generation …
generation (NLG). It is regarded as crucial for the development of advanced text generation …
Edge: Editable dance generation from music
Dance is an important human art form, but creating new dances can be difficult and time-
consuming. In this work, we introduce Editable Dance GEneration (EDGE), a state-of-the-art …
consuming. In this work, we introduce Editable Dance GEneration (EDGE), a state-of-the-art …
Learning fine-grained bimanual manipulation with low-cost hardware
Fine manipulation tasks, such as threading cable ties or slotting a battery, are notoriously
difficult for robots because they require precision, careful coordination of contact forces, and …
difficult for robots because they require precision, careful coordination of contact forces, and …
A survey on trajectory-prediction methods for autonomous driving
In order to drive safely in a dynamic environment, autonomous vehicles should be able to
predict the future states of traffic participants nearby, especially surrounding vehicles, similar …
predict the future states of traffic participants nearby, especially surrounding vehicles, similar …
High-resolution image synthesis with latent diffusion models
By decomposing the image formation process into a sequential application of denoising
autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image …
autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image …
Repurposing diffusion-based image generators for monocular depth estimation
Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth
from a single image is geometrically ill-posed and requires scene understanding so it is not …
from a single image is geometrically ill-posed and requires scene understanding so it is not …
Ambiguous medical image segmentation using diffusion models
A Rahman, JMJ Valanarasu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Collective insights from a group of experts have always proven to outperform an individual's
best diagnostic for clinical tasks. For the task of medical image segmentation, existing …
best diagnostic for clinical tasks. For the task of medical image segmentation, existing …
Motionclip: Exposing human motion generation to clip space
We introduce MotionCLIP, a 3D human motion auto-encoder featuring a latent embedding
that is disentangled, well behaved, and supports highly semantic textual descriptions …
that is disentangled, well behaved, and supports highly semantic textual descriptions …
Diffusion policies as an expressive policy class for offline reinforcement learning
Offline reinforcement learning (RL), which aims to learn an optimal policy using a previously
collected static dataset, is an important paradigm of RL. Standard RL methods often perform …
collected static dataset, is an important paradigm of RL. Standard RL methods often perform …