How discrete and continuous diffusion meet: Comprehensive analysis of discrete diffusion models via a stochastic integral framework

Y Ren, H Chen, GM Rotskoff, L Ying - arxiv preprint arxiv:2410.03601, 2024 - arxiv.org
Discrete diffusion models have gained increasing attention for their ability to model complex
distributions with tractable sampling and inference. However, the error analysis for discrete …

Diffusion-based diverse audio captioning with retrieval-guided Langevin dynamics

Y Zhu, A Men, L **ao - Information Fusion, 2025 - Elsevier
Audio captioning, a comprehensive task of audio understanding, aims to provide a natural-
language description of an audio clip. Beyond accuracy, diversity is also a critical …

[HTML][HTML] Comparing human pose estimation through deep learning approaches: An overview

G Dibenedetto, S Sotiropoulos, M Polignano… - Computer Vision and …, 2025 - Elsevier
In the everyday IoT ecosystem, many devices and systems are interconnected in an
intelligent living environment to create a comfortable and efficient living space. In this …

Beyond autoregression: Discrete diffusion for complex reasoning and planning

J Ye, J Gao, S Gong, L Zheng, X Jiang, Z Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Autoregressive language models, despite their impressive capabilities, struggle with
complex reasoning and long-term planning tasks. We introduce discrete diffusion models as …

Diffusion of thoughts: Chain-of-thought reasoning in diffusion language models

J Ye, S Gong, L Chen, L Zheng, J Gao, H Shi… - arxiv preprint arxiv …, 2024 - arxiv.org
Recently, diffusion models have garnered significant interest in the field of text processing
due to their many potential advantages compared to conventional autoregressive models. In …

Fast solvers for discrete diffusion models: Theory and applications of high-order algorithms

Y Ren, H Chen, Y Zhu, W Guo, Y Chen… - arxiv preprint arxiv …, 2025 - arxiv.org
Discrete diffusion models have emerged as a powerful generative modeling framework for
discrete data with successful applications spanning from text generation to image synthesis …

MADiff: Motion-aware mamba diffusion models for hand trajectory prediction on egocentric videos

J Ma, X Chen, W Bao, J Xu, H Wang - arxiv preprint arxiv:2409.02638, 2024 - arxiv.org
Understanding human intentions and actions through egocentric videos is important on the
path to embodied artificial intelligence. As a branch of egocentric vision techniques, hand …

Table-to-Text Generation with Pretrained Diffusion Models

A Krylov, O Somov - IEEE Access, 2024 - ieeexplore.ieee.org
Diffusion models have demonstrated significant potential in achieving state-of-the-art
performance across various text generation tasks. In this systematic study, we investigate …

Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos

J Ma, J Xu, X Chen, H Wang - arxiv preprint arxiv:2405.04370, 2024 - arxiv.org
Understanding how humans would behave during hand-object interaction is vital for
applications in service robot manipulation and extended reality. To achieve this, some …

Private Synthetic Text Generation with Diffusion Models

S Ochs, I Habernal - arxiv preprint arxiv:2410.22971, 2024 - arxiv.org
How capable are diffusion models of generating synthetics texts? Recent research shows
their strengths, with performance reaching that of auto-regressive LLMs. But are they also …