Beyond appearance: a semantic controllable self-supervised learning framework for human-centric visual tasks W Chen, X Xu, J Jia, H Luo, Y Wang, F Wang, R Jin, X Sun Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 112 | 2023 |
Rethinking of pedestrian attribute recognition: A reliable evaluation under zero-shot pedestrian identity setting J Jia, H Huang, X Chen, K Huang arXiv preprint arXiv:2107.03576, 2021 | 99* | 2021 |
Spatial and semantic consistency regularizations for pedestrian attribute recognition J Jia, X Chen, K Huang Proceedings of the IEEE/CVF international conference on computer vision, 962-971, 2021 | 71 | 2021 |
Queryprop: Object query propagation for high-performance video object detection F He, N Gao, J Jia, X Zhao, K Huang Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 834-842, 2022 | 33 | 2022 |
Learning disentangled attribute representations for robust pedestrian attribute recognition J Jia, N Gao, F He, X Chen, K Huang Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 1069-1077, 2022 | 33 | 2022 |
Panopticdepth: A unified framework for depth-aware panoptic segmentation N Gao, F He, J Jia, Y Shan, H Zhang, X Zhao, K Huang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 26 | 2022 |
Inspro: Propagating instance query and proposal for online video instance segmentation F He, H Zhang, N Gao, J Jia, Y Shan, X Zhao, K Huang Advances in Neural Information Processing Systems 35, 19370-19383, 2022 | 16 | 2022 |
Rethinking of pedestrian attribute recognition: A reliable evaluation under zero-shot pedestrian identity setting. arXiv 2021 J Jia, H Huang, X Chen, K Huang arXiv preprint arXiv.2107.03576 2107, 0 | 5 | |
Learning disentangled label representations for multi-label classification J Jia, F He, N Gao, X Chen, K Huang arXiv preprint arXiv:2212.01461, 2022 | 4 | 2022 |
Knowledge adaptation from large language model to recommendation for practical industrial application J Jia, Y Wang, Y Li, H Chen, X Bai, Z Liu, J Liang, Q Chen, H Li, P Jiang, ... arXiv preprint arXiv:2405.03988, 2024 | 3 | 2024 |
ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval R Zhao, J Jia, Y Li, X Bai, Q Chen, H Li, P Jiang, X Li arXiv preprint arXiv:2408.02978, 2024 | 2 | 2024 |
Source-target coordinated training with multi-head hybrid-attention for domain adaptive semantic segmentation J Jia, W Chen, J Yuan, X Sun | 2 | 2022 |
SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization Z Tan, B Xue, J Jia, J Wang, W Ye, S Shi, M Sun, W Wu, Q Chen, P Jiang arXiv preprint arXiv:2412.10443, 2024 | 1 | 2024 |
Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads S Kou, J Jin, C Liu, Y Ma, J Jia, Q Chen, P Jiang, Z Deng arXiv preprint arXiv:2412.00127, 2024 | 1 | 2024 |
Knowledge Condensation and Reasoning for Knowledge-based VQA D Hao, J Jia, L Guo, Q Wang, T Yang, Y Li, Y Cheng, B Wang, Q Chen, ... arXiv preprint arXiv:2403.10037, 2024 | 1 | 2024 |
Split Semantic Detection in Sandplay Images X Feng, X Chen, J Jia, K Huang arXiv preprint arXiv:2203.00907, 2022 | 1 | 2022 |
From Principles to Applications: A Comprehensive Survey of Discrete Tokenizers in Generation, Comprehension, Recommendation, and Information Retrieval J Jia, J Gao, B Xue, J Wang, Q Cai, Q Chen, X Zhao, P Jiang, K Gai arXiv preprint arXiv:2502.12448, 2025 | | 2025 |
Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy T Yang, J Jia, X Zhu, W Zhao, B Wang, Y Cheng, Y Li, S Liu, Q Chen, ... arXiv preprint arXiv:2411.15453, 2024 | | 2024 |
Spatiotemporal Fine-grained Video Description for Short Videos T Yang, J Jia, B Wang, Y Cheng, Y Li, D Hao, X Cao, Q Chen, H Li, ... Proceedings of the 32nd ACM International Conference on Multimedia, 3945-3954, 2024 | | 2024 |
BCT: Three-branch Coordinated Training for Domain Adaptive Semantic Segmentation C Liang, J Jia, J Wang, J Yuan, X Zhao, W Chen | | |