ViTMatte: Boosting image matting with pre-trained plain vision transformers J Yao, X Wang, S Yang, B Wang Information Fusion 103, 102091, 2024 | 55 | 2024 |
Matte anything: Interactive natural image matting with segment anything model J Yao, X Wang, L Ye, W Liu Image and Vision Computing 147, 105067, 2024 | 35 | 2024 |
LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels Z Cui, J Yao, L Zeng, J Yang, W Liu, X Wang arXiv preprint arXiv:2407.18054, 2024 | 4 | 2024 |
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification J Yao, W Cheng, W Liu, X Wang NeurIPS 2024, 2024 | 3 | 2024 |
EVA-X: A Foundation Model for General Chest X-ray Analysis with Self-supervised Learning J Yao, X Wang, Y Song, H Zhao, J Ma, Y Chen, W Liu, B Wang arXiv preprint arXiv:2405.05237, 2024 | 3 | 2024 |
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models J Yao, X Wang arXiv preprint arXiv:2501.01423, 2025 | 1 | 2025 |
ViTGaze: Gaze Following with Interaction Features in Vision Transformers Y Song, X Wang, J Yao, W Liu, J Zhang, X Xu arXiv preprint arXiv:2403.12778, 2024 | 1 | 2024 |