Towards general text embeddings with multi-stage contrastive learning Z Li, X Zhang, Y Zhang, D Long, P Xie, M Zhang arXiv preprint arXiv:2308.03281, 2023 | 134 | 2023 |
Learning diverse document representations with deep query interactions for dense retrieval Z Li, N Yang, L Wang, F Wei arXiv preprint arXiv:2208.04232, 2022 | 11 | 2022 |
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search Z Li, J Zhang, C Yin, Y Ouyang, W Rong arXiv preprint arXiv:2403.16702, 2024 | 4 | 2024 |
Language Models are Universal Embedders X Zhang, Z Li, Y Zhang, D Long, P Xie, M Zhang, M Zhang arXiv preprint arXiv:2310.08232, 2023 | 3 | 2023 |
Text Representation Distillation via Information Bottleneck Principle Y Zhang, D Long, Z Li, P Xie arXiv preprint arXiv:2311.05472, 2023 | 2 | 2023 |
Challenging Decoder helps in Masked Auto-Encoder Pre-training for Dense Passage Retrieval Z Li, Y Zhang, D Long, P Xie arXiv preprint arXiv:2305.13197, 2023 | | 2023 |