Segui
Ziheng Wu
Ziheng Wu
ByteDance
Email verificata su bytedance.com
Titolo
Citata da
Citata da
Anno
Scale-aware modulation meet transformer
W Lin, Z Wu, J Chen, J Huang, L Jin
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1002023
Beautifulprompt: Towards automatic prompt engineering for text-to-image synthesis
T Cao, C Wang, B Liu, Z Wu, J Zhu, J Huang
arXiv preprint arXiv:2311.06752, 2023
222023
Hierarchical side-tuning for vision transformers
W Lin, Z Wu, W Yang, M Huang, J Huang, L Jin
arXiv preprint arXiv:2310.05393, 2023
102023
YOLOX-PAI: an improved YOLOX, stronger and faster than YOLOv6
Z Wu, X Zou, W Zhou, J Huang
arXiv preprint arXiv:2208.13040, 2022
92022
Facechain: A playground for identity-preserving portrait generation
Y Liu, C Yu, L Shang, Z Wu, X Wang, Y Zhao, L Zhu, C Cheng, W Chen, ...
arXiv preprint arXiv:2308.14256, 2023
72023
M2SD: Multiple Mixing Self-Distillation for Few-Shot Class-Incremental Learning
J Lin, Z Wu, W Lin, J Huang, RH Luo
Proceedings of the AAAI Conference on Artificial Intelligence 38 (4), 3422-3431, 2024
62024
Elastic-link for binarized neural networks
J Hu, Z Wu, V Tan, Z Lu, M Zeng, E Wu
Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 942-950, 2022
62022
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion
Z Chu, J Chen, C Chen, C Wang, Z Wu, J Huang, W Qian
Proceedings of the 2024 SIAM International Conference on Data Mining (SDM …, 2024
42024
EasyPhoto: your smart AI photo generator
Z Wu, J Xu, X Zou, K Huang, X Shi, J Huang
arXiv preprint arXiv:2310.04672, 2023
42023
Rapid diffusion: Building domain-specific text-to-image synthesizers with fast inference speed
B Liu, W Lin, Z Duan, C Wang, W Ziheng, Z Zipeng, K Jia, L Jin, C Chen, ...
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
42023
Diffsynth: Latent in-iteration deflickering for realistic video synthesis
Z Duan, L You, C Wang, C Chen, Z Wu, W Qian, J Huang
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2024
22024
Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
Z Wu, Z Chen, R Luo, C Zhang, Y Gao, Z He, X Wang, H Lin, M Qiu
arXiv preprint arXiv:2501.05901, 2025
2025
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–12