Takip et
Jiayi Ji
Jiayi Ji
厦门大学(XMU)& 新加坡国立大学 (NUS)
xmu.edu.cn üzerinde doğrulanmış e-posta adresine sahip
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Dual-level collaborative transformer for image captioning
Y Luo, J Ji, X Sun, L Cao, Y Wu, F Huang, CW Lin, R Ji
AAAI 2021 35 (3), 2286-2293, 2021
3312021
Rstnet: Captioning with adaptive attention on visual and non-visual words
X Zhang, X Sun, Y Luo, J Ji, Y Zhou, Y Wu, F Huang, R Ji
CVPR 2021, 15465-15474, 2021
2622021
Improving image captioning by leveraging intra- and inter-layer global representation in transformer network
J Ji, Y Luo, X Sun, F Chen, G Luo, Y Wu, Y Gao, R Ji
AAAI 2021 35 (2), 1655-1663, 2021
1932021
Towards local visual modeling for image captioning
Y Ma, J Ji, X Sun, Y Zhou, R Ji
Pattern Recognition 138, 109420, 2023
742023
Knowing what to learn: a metric-oriented focal mechanism for image captioning
J Ji, Y Ma, X Sun, Y Zhou, Y Wu, R Ji
IEEE Transactions on Image Processing 31, 4321-4335, 2022
432022
Towards Semantic Equivalence of Tokenization in Multimodal LLM
S Wu, H Fei, X Li, J Ji, H Zhang, TS Chua, S Yan
ICLR 2025, 2024
402024
X-mesh: Towards fast and accurate text-driven 3d stylization via dynamic textual guidance
Y Ma, X Zhang, X Sun, J Ji, H Wang, G Jiang, W Zhuang, R Ji
ICCV 2023, 2749-2760, 2023
382023
Variational structured semantic inference for diverse image captioning
F Chen, R Ji, J Ji, X Sun, B Zhang, X Ge, Y Wu, F Huang, Y Wang
NeurIPS 2019, 1931-1941, 2019
352019
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
S Liu, Y Ma, X Zhang, H Wang, J Ji*, X Sun, R Ji
CVPR 2024, 2024
332024
Knowing what it is: semantic-enhanced dual attention transformer
Y Ma, J Ji, X Sun, Y Zhou, Y Wu, F Huang, R Ji
IEEE Transactions on Multimedia, 2022
262022
Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning
J Ji, X Huang, X Sun, Y Zhou, G Luo, L Cao, J Liu, L Shao, R Ji
IEEE Transactions on Multimedia, 2022
212022
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
C Wu, Y Ma, Q Chen, H Wang, G Luo, J Ji*, X Sun
AAAI 2024, 2024
172024
Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
H Wang, J Ji, Y Zhou, Y Wu, X Sun
AAAI 2023, 2023
172023
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval
Y Ma, X Sun, J Ji, G Jiang, W Zhuang, R Ji
ACM MM 2023, 4157-4168, 2023
152023
Attacking image captioning towards accuracy-preserving target words removal
J Ji, X Sun, Y Zhou, R Ji, F Chen, J Liu, Q Tian
ACM MM 2020, 4226-4234, 2020
142020
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models
M Wu, J Ji*, O Huang, J Li, Y Wu, X Sun, R Ji
ICML 2024, 2024
13*2024
X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks
Z Qian, Y Ma, J Ji, X Sun
AAAI 2024 38 (5), 4551-4559, 2024
132024
Beyond first impressions: Integrating joint multi-modal cues for comprehensive 3d representation
H Wang, J Tang, J Ji, X Sun, R Zhang, Y Ma, M Zhao, L Li, Z Zhao, T Lv, ...
ACM MM 2023, 3403-3414, 2023
122023
Creating High-quality 3D Content by Bridging the Gap Between Text-to-2D and Text-to-3D Generation
Y Ma, Y Fan, J Ji, H Wang, H Yin, X Sun, R Ji
ACM ToMM, 2024
9*2024
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models
M Wu, X Cai, J Ji*, J Li, O Huang, G Luo, H Fei, X Sun, R Ji
NeurIPS 2024, 2024
82024
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–20