Google 학술 검색

C Liu, PP Li, X Qi, H Zhang, L Li, D Wang… - Proceedings of the 31st …, 2023 - dl.acm.org

The audio-visual segmentation (AVS) task aims to segment sounding objects from a given
video. Existing works mainly focus on fusing audio and visual features of a given video to …

저장 인용 19회 인용 관련 학술자료 전체 3개의 버전

[Free GPT-4]

[PDF] arxiv.org

BAVS: bootstrap** audio-visual segmentation by integrating foundation knowledge

C Liu, P Li, H Zhang, L Li, Z Huang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Given an audio-visual pair, audio-visual segmentation (AVS) aims to locate sounding
sources by predicting pixel-wise maps. Previous methods assume that each sound …

저장 인용 19회 인용 관련 학술자료 전체 3개의 버전

[Free GPT-4]

[PDF] thecvf.com

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation

X Qi, J Pan, P Li, R Yuan, X Chi, M Li… - Proceedings of the …, 2024 - openaccess.thecvf.com

Generating vivid and emotional 3D co-speech gestures is crucial for virtual avatar animation
in human-machine interaction applications. While the existing methods enable generating …

저장 인용 6회 인용 관련 학술자료 전체 4개의 버전 HTML 버전

[Free GPT-4]

[PDF] aaai.org

Chain of generation: Multi-modal gesture synthesis via cascaded conditional control

Z Xu, Y Zhang, S Yang, R Li, X Li - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

This study aims to improve the generation of 3D gestures by utilizing multimodal information
from human speech. Previous studies have focused on incorporating additional modalities …

저장 인용 9회 인용 관련 학술자료 전체 3개의 버전 HTML 버전

[Free GPT-4]

[PDF] acm.org

The diffusestylegesture+ entry to the genea challenge 2023

S Yang, H Xue, Z Zhang, M Li, Z Wu, X Wu… - Proceedings of the 25th …, 2023 - dl.acm.org

In this paper, we introduce the DiffuseStyleGesture+, our solution for the Generation and
Evaluation of Non-verbal Behavior for Embodied Agents (GENEA) Challenge 2023, which …

저장 인용 19회 인용 관련 학술자료 전체 4개의 버전

[Free GPT-4]

[PDF] arxiv.org

Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis

Z Zhang, T Ao, Y Zhang, Q Gao, C Lin… - ACM Transactions on …, 2024 - dl.acm.org

In this work, we present Semantic Gesticulator, a novel framework designed to synthesize
realistic gestures accompanying speech with strong semantic correspondence. Semantically …

저장 인용 2회 인용 관련 학술자료 전체 2개의 버전

[Free GPT-4]

[PDF] thecvf.com

BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics

W Zhang, M Huang, Y Zhou, J Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

The recently emerging text-to-motion advances have spired numerous attempts for
convenient and interactive human motion generation. Yet existing methods are largely …

저장 인용 3회 인용 관련 학술자료 전체 3개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Mambatalk: Efficient holistic gesture synthesis with selective state space models

Z Xu, Y Lin, H Han, S Yang, R Li, Y Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

Gesture synthesis is a vital realm of human-computer interaction, with wide-ranging
applications across various fields like film, robotics, and virtual reality. Recent advancements …

저장 인용 13회 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]

[PDF] thecvf.com

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

H Liu, Z Zhu, G Becherini, Y Peng… - Proceedings of the …, 2024 - openaccess.thecvf.com

We propose EMAGE a framework to generate full-body human gestures from audio and
masked gestures encompassing facial local body hands and global movements. To achieve …

저장 인용 8회 인용 관련 학술자료 HTML 버전

[Free GPT-4]

[PDF] thecvf.com

Learning Transferable Compound Expressions from Masked AutoEncoder Pretraining

F Qiu, H Du, W Zhang, C Liu, L Li… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Video-based Compound Expression Recognition (CER) aims to identify compound
expressions in everyday interactions per frame. Unlike rapid progress in Facial Expression …

저장 인용 2회 인용 관련 학술자료 HTML 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Emotiongesture: Audio-driven diverse emotional co-speech 3d gesture generation

Audio-visual segmentation by exploring cross-modal mutual semantics

BAVS: bootstrap** audio-visual segmentation by integrating foundation knowledge

Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation

Chain of generation: Multi-modal gesture synthesis via cascaded conditional control

The diffusestylegesture+ entry to the genea challenge 2023

Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis

BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics

Mambatalk: Efficient holistic gesture synthesis with selective state space models

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

Learning Transferable Compound Expressions from Masked AutoEncoder Pretraining