How discrete and continuous diffusion meet: Comprehensive analysis of discrete diffusion models via a stochastic integral framework
Discrete diffusion models have gained increasing attention for their ability to model complex
distributions with tractable sampling and inference. However, the error analysis for discrete …
distributions with tractable sampling and inference. However, the error analysis for discrete …
Diffusion-based diverse audio captioning with retrieval-guided Langevin dynamics
Y Zhu, A Men, L **ao - Information Fusion, 2025 - Elsevier
Audio captioning, a comprehensive task of audio understanding, aims to provide a natural-
language description of an audio clip. Beyond accuracy, diversity is also a critical …
language description of an audio clip. Beyond accuracy, diversity is also a critical …
[HTML][HTML] Comparing human pose estimation through deep learning approaches: An overview
In the everyday IoT ecosystem, many devices and systems are interconnected in an
intelligent living environment to create a comfortable and efficient living space. In this …
intelligent living environment to create a comfortable and efficient living space. In this …
Beyond autoregression: Discrete diffusion for complex reasoning and planning
Autoregressive language models, despite their impressive capabilities, struggle with
complex reasoning and long-term planning tasks. We introduce discrete diffusion models as …
complex reasoning and long-term planning tasks. We introduce discrete diffusion models as …
Diffusion of thoughts: Chain-of-thought reasoning in diffusion language models
Recently, diffusion models have garnered significant interest in the field of text processing
due to their many potential advantages compared to conventional autoregressive models. In …
due to their many potential advantages compared to conventional autoregressive models. In …
Fast solvers for discrete diffusion models: Theory and applications of high-order algorithms
Discrete diffusion models have emerged as a powerful generative modeling framework for
discrete data with successful applications spanning from text generation to image synthesis …
discrete data with successful applications spanning from text generation to image synthesis …
MADiff: Motion-aware mamba diffusion models for hand trajectory prediction on egocentric videos
Understanding human intentions and actions through egocentric videos is important on the
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …
Table-to-Text Generation with Pretrained Diffusion Models
Diffusion models have demonstrated significant potential in achieving state-of-the-art
performance across various text generation tasks. In this systematic study, we investigate …
performance across various text generation tasks. In this systematic study, we investigate …
Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos
Understanding how humans would behave during hand-object interaction is vital for
applications in service robot manipulation and extended reality. To achieve this, some …
applications in service robot manipulation and extended reality. To achieve this, some …
Private Synthetic Text Generation with Diffusion Models
How capable are diffusion models of generating synthetics texts? Recent research shows
their strengths, with performance reaching that of auto-regressive LLMs. But are they also …
their strengths, with performance reaching that of auto-regressive LLMs. But are they also …