- Academic Search

Z **ng, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024 - dl.acm.org

The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …

Save Cite Cited by 92 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Sora: A review on background, technology, limitations, and opportunities of large vision models

Y Liu, K Zhang, Y Li, Z Yan, C Gao, R Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …

Save Cite Cited by 225 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Patch diffusion: Faster and more data-efficient training of diffusion models

Z Wang, Y Jiang, H Zheng, P Wang… - Advances in neural …, 2024 - proceedings.neurips.cc

Diffusion models are powerful, but they require a lot of time and data to train. We propose
Patch Diffusion, a generic patch-wise training framework, to significantly reduce the training …

Save Cite Cited by 202 Related articles All 11 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Fact: Frame-action cross-attention temporal modeling for efficient action segmentation

Z Lu, E Elhamifar - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

We study supervised action segmentation whose goal is to predict framewise action labels
of a video. To capture temporal dependencies over long horizons prior works either improve …

Save Cite Cited by 13 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Progress-aware online action segmentation for egocentric procedural task videos

Y Shen, E Elhamifar - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

We address the problem of online action segmentation for egocentric procedural task
videos. While previous studies have mostly focused on offline action segmentation where …

Save Cite Cited by 7 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

Save Cite Cited by 70 Related articles All 8 versions Free GPT-4

Learning to schedule in diffusion probabilistic models

Y Wang, X Wang, AD Dinh, B Du, C Xu - Proceedings of the 29th ACM …, 2023 - dl.acm.org

Recently, the field of generative models has seen a significant advancement with the
introduction of Diffusion Probabilistic Models (DPMs). The Denoising Diffusion Implicit Model …

Save Cite Cited by 25 Related articles

[Free GPT-4]

[PDF] thecvf.com

Action Detection via an Image Diffusion Process

LG Foo, T Li, H Rahmani, J Liu - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Action detection aims to localize the starting and ending points of action instances in
untrimmed videos and predict the classes of those instances. In this paper we make the …

Save Cite Cited by 4 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Rethinking conditional diffusion sampling with progressive guidance

AD Dinh, D Liu, C Xu - Advances in Neural Information …, 2024 - proceedings.neurips.cc

This paper tackles two critical challenges encountered in classifier guidance for diffusion
generative models, ie, the lack of diversity and the presence of adversarial effects. These …

Save Cite Cited by 6 Related articles All 3 versions Free GPT-4 View as HTML

ActSonic: Recognizing Everyday Activities from Inaudible Acoustic Wave Around the Body

S Mahmud, V Parikh, Q Liang, K Li, R Zhang… - Proceedings of the …, 2024 - dl.acm.org

We present ActSonic, an intelligent, low-power active acoustic sensing system integrated
into eyeglasses that can recognize 27 different everyday activities (eg, eating, drinking …

Save Cite Cited by 2 Related articles

Create alert

Cite

Advanced search

Saved to My library

Diffusion action segmentation

A survey on video diffusion models

Sora: A review on background, technology, limitations, and opportunities of large vision models

Patch diffusion: Faster and more data-efficient training of diffusion models

Fact: Frame-action cross-attention temporal modeling for efficient action segmentation

Progress-aware online action segmentation for egocentric procedural task videos

Temporal action segmentation: An analysis of modern techniques

Learning to schedule in diffusion probabilistic models

Action Detection via an Image Diffusion Process

Rethinking conditional diffusion sampling with progressive guidance

ActSonic: Recognizing Everyday Activities from Inaudible Acoustic Wave Around the Body