Stebėti
Yongxin Zhu
Pavadinimas
Cituota
Cituota
Metai
Videollama 2: Advancing spatial-temporal modeling and audio understanding in video-llms
Z Cheng, S Leng, H Zhang, Y Xin, X Li, G Chen, Y Zhu, W Zhang, Z Luo, ...
arXiv preprint arXiv:2406.07476, 2024
1562024
Empowering diffusion models on the embedding space for text generation
Z Gao, J Guo, X Tan, Y Zhu, F Zhang, J Bian, L Xu
arXiv preprint arXiv:2212.09412, 2022
522022
Sequence-to-action: Grammatical error correction with action guided sequence generation
J Li, J Guo, Y Zhu, X Sheng, D Jiang, B Ren, L Xu
AAAI 2022 36 (10), 10974-10982, 2022
282022
Span-level aspect-based sentiment analysis via table filling
M Zhang, Y Zhu, Z Liu, Z Bao, Y Wu, X Sun, L Xu
ACL 2023-main, 9273-9284, 2023
192023
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Y Zhu, Z Liu, Y Liang, X Li, H Liu, C Bao, L Xu
AAAI 2023 37 (9), 11479-11487, 2023
92023
Difformer: Empowering diffusion models on the embedding space for text generation
Z Gao, J Guo, X Tan, Y Zhu, F Zhang, J Bian, L Xu
NAACL 2024-main, 2022
72022
Addressing representation collapse in vector quantized models with one linear layer
Y Zhu, B Li, Y Xin, L Xu
arXiv preprint arXiv:2411.02038, 2024
62024
Visual Hallucination Elevates Speech Recognition
F Zhang, Y Zhu, X Wang, H Chen, X Sun, L Xu
AAAI 2024 38 (17), 19542-19550, 2024
52024
DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation
Y Zhu, Z Gao, X Zhou, Z Ye, L Xu
EMNLP 2023-main, 11573-11583, 2023
32023
Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction
H Yan, Y Zhu, K Zheng, B Liu, H Cao, D Jiang, L Xu
ACL 2024-main 1, 15009–15022, 2024
22024
Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer
Y Zhu, D Su, L He, L Xu, D Yu
ACL 2024-main 1, 1764–1775, 2024
22024
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Y Zhu, B Li, H Zhang, X Li, L Xu, L Bing
NeurIPS 2024, 2024
12024
Itrievalkd: an iterative retrieval framework assisted with knowledge distillation for noisy text-to-image retrieval
Z Liu, Y Zhu, Z Gao, X Sheng, L Xu
PAKDD 2023, 257-268, 2023
12023
Summarizing Like Human: Edit-Based Text Summarization with Keywords
Y Liang, J Guo, Y Zhu, L Xu
ICANN 2024, 333-351, 2024
2024
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–14