[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …

F Wang, Z Zhang, X Zhang, Z Wu, T Mo, Q Lu… - arxiv preprint arxiv …, 2024 - ai.radensa.ru
Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …

I3 retriever: incorporating implicit interaction in pre-trained language models for passage retrieval

Q Dong, Y Liu, Q Ai, H Li, S Wang, Y Liu… - Proceedings of the 32nd …, 2023 - dl.acm.org
Passage retrieval is a fundamental task in many information systems, such as web search
and question answering, where both efficiency and effectiveness are critical concerns. In …

User retention-oriented recommendation with decision transformer

K Zhao, L Zou, X Zhao, M Wang, D Yin - Proceedings of the ACM Web …, 2023 - dl.acm.org
Improving user retention with reinforcement learning (RL) has attracted increasing attention
due to its significant importance in boosting user engagement. However, training the RL …

Model-based unbiased learning to rank

D Luo, L Zou, Q Ai, Z Chen, D Yin… - Proceedings of the …, 2023 - dl.acm.org
Unbiased Learning to Rank (ULTR), ie, learning to rank documents with biased user
feedback data, is a well-known challenge in information retrieval. Existing methods in …

Pre-training with Large Language Model-based Document Expansion for Dense Passage Retrieval

G Ma, X Wu, P Wang, Z Lin, S Hu - arxiv preprint arxiv:2308.08285, 2023 - arxiv.org
In this paper, we systematically study the potential of pre-training with Large Language
Model (LLM)-based document expansion for dense passage retrieval. Concretely, we …

LLMProxy: Reducing Cost to Access Large Language Models

N Martin, AB Faisal, H Eltigani, R Haroon… - arxiv preprint arxiv …, 2024 - arxiv.org
In this paper, we make a case for a proxy for large language models which has explicit
support for cost-saving optimizations. We design LLMProxy, which supports three key …

Dual cycle generative adversarial networks for web search

Y Lin, C Ying, B Xu, H Lin - Applied Soft Computing, 2024 - Elsevier
In this work, the IRGAN model is revisited to tackle semi-supervised information retrieval (IR)
problems, considering the premature convergence of IRGAN caused by mismatching the …

CHIFRAUD: A Long-term Web Text Dataset for Chinese Fraud Detection

M Tang, L Zou, Z **, SJ Cui, SN Liang… - Proceedings of the 31st …, 2025 - aclanthology.org
Detecting fraudulent online text is essential, as these manipulative messages exploit human
greed, deceive individuals, and endanger societal security. Currently, this task remains …

LT2R: Learning to Online Learning to Rank for Web Search

X Chu, C Hao, S Wang, D Yin, J Zhao… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Online learning to rank (OLTR), which directly optimizes the ranker with interactive user
feedback, has gained considerable attention in both academia and industry. However, most …

Divided at the Edge-Measuring Performance and the Digital Divide of Cloud Edge Data Centers

N Martin, F Dogar - Proceedings of the ACM on Networking, 2023 - dl.acm.org
Cloud providers are highly incentivized to reduce latency. One way they do this is by
locating data centers as close to users as possible. These “cloud edge” data centers are …