Følg
Shitao Xiao
Shitao Xiao
Verifisert e-postadresse på bupt.edu.cn
Tittel
Sitert av
Sitert av
År
C-pack: Packed resources for general chinese embeddings
S Xiao, Z Liu, P Zhang, N Muennighoff, D Lian, JY Nie
Proceedings of the 47th international ACM SIGIR conference on research and …, 2024
4512024
Bge m3-embedding: Multi-lingual, multi-functionality, multi-granularity text embeddings through self-knowledge distillation
J Chen*, S Xiao*, P Zhang, K Luo, D Lian, Z Liu
arXiv preprint arXiv:2402.03216, 2024
325*2024
Graphformers: Gnn-nested transformers for representation learning on textual graph
J Yang, Z Liu, S Xiao, C Li, D Lian, S Agrawal, A Singh, G Sun, X Xie
Advances in Neural Information Processing Systems 34, 28798-28810, 2021
1692021
RetroMAE: Pre-training Retrieval-oriented Transformers via Masked Auto-Encoder
S Xiao, Z Liu, Y Shao, Z Cao
arXiv preprint arXiv:2205.12035, 2022
160*2022
Retrieve anything to augment large language models
P Zhang*, S Xiao*, Z Liu, Z Dou, JY Nie
arXiv preprint arXiv:2310.07554, 2023
1012023
Mlvu: A comprehensive benchmark for multi-task long video understanding
J Zhou, Y Shu, B Zhao, B Wu, S Xiao, X Yang, Y Xiong, B Zhang, T Huang, ...
arXiv preprint arXiv:2406.04264, 2024
702024
Soaring from 4k to 400k: Extending llm’s context with activation beacon
P Zhang, Z Liu, S Xiao, N Shao, Q Ye, Z Dou
arXiv preprint arXiv:2401.03462 2 (3), 5, 2024
50*2024
Making large language models a better foundation for dense retrieval
C Li, Z Liu, S Xiao, Y Shao
arXiv preprint arXiv:2312.15503, 2023
47*2023
LECF: recommendation via learnable edge collaborative filtering
S Xiao, Y Shao, Y Li, H Yin, Y Shen, B Cui
Science China Information Sciences 65 (1), 112101, 2022
412022
Training large-scale news recommenders with pretrained language models in the loop
S Xiao, Z Liu, Y Shao, T Di, B Middha, F Wu, X Xie
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022
392022
Omnigen: Unified image generation
S Xiao, Y Wang, J Zhou, H Yuan, X Xing, R Yan, S Wang, T Huang, Z Liu
arXiv preprint arXiv:2409.11340, 2024
282024
Uni-retriever: Towards learning the unified embedding based retriever in bing sponsored search
J Zhang, Z Liu, W Han, S Xiao, R Zheng, Y Shao, H Sun, H Zhu, ...
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022
282022
Distill-vq: Learning retrieval oriented vector quantization by distilling knowledge from dense embeddings
S Xiao, Z Liu, W Han, J Zhang, D Lian, Y Gong, Q Chen, F Yang, H Sun, ...
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
282022
RetroMAE-2: Duplex masked auto-encoder for pre-training retrieval-oriented language models
Z Liu, S Xiao, Y Shao, Z Cao
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
252023
Lm-cocktail: Resilient tuning of language models via model merging
S Xiao, Z Liu, P Zhang, X Xing
arXiv preprint arXiv:2311.13534, 2023
202023
Progressively optimized bi-granular document representation for scalable embedding based retrieval
S Xiao, Z Liu, W Han, J Zhang, Y Shao, D Lian, C Li, H Sun, D Deng, ...
Proceedings of the ACM Web Conference 2022, 286-296, 2022
162022
Bge landmark embedding: A chunking-free embedding method for retrieval augmented long-context large language models
K Luo, Z Liu, S Xiao, K Liu
arXiv preprint arXiv:2402.11573, 2024
152024
VISTA: visualized text embedding for universal multi-modal retrieval
J Zhou, Z Liu, S Xiao, B Zhao, Y Xiong
arXiv preprint arXiv:2406.04292, 2024
142024
Making text embedders few-shot learners
C Li, MH Qin, S Xiao, J Chen, K Luo, Y Shao, D Lian, Z Liu
arXiv preprint arXiv:2409.15700, 2024
122024
Mindsim: user simulator for news recommenders
X Luo, Z Liu, S Xiao, X Xie, D Li
Proceedings of the ACM Web Conference 2022, 2067-2077, 2022
122022
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–20