[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …
Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …
question answering, and reasoning, facilitating various tasks and domains. Despite their …
I3 retriever: incorporating implicit interaction in pre-trained language models for passage retrieval
Passage retrieval is a fundamental task in many information systems, such as web search
and question answering, where both efficiency and effectiveness are critical concerns. In …
and question answering, where both efficiency and effectiveness are critical concerns. In …
User retention-oriented recommendation with decision transformer
Improving user retention with reinforcement learning (RL) has attracted increasing attention
due to its significant importance in boosting user engagement. However, training the RL …
due to its significant importance in boosting user engagement. However, training the RL …
Model-based unbiased learning to rank
Unbiased Learning to Rank (ULTR), ie, learning to rank documents with biased user
feedback data, is a well-known challenge in information retrieval. Existing methods in …
feedback data, is a well-known challenge in information retrieval. Existing methods in …
Pre-training with Large Language Model-based Document Expansion for Dense Passage Retrieval
In this paper, we systematically study the potential of pre-training with Large Language
Model (LLM)-based document expansion for dense passage retrieval. Concretely, we …
Model (LLM)-based document expansion for dense passage retrieval. Concretely, we …
LLMProxy: Reducing Cost to Access Large Language Models
In this paper, we make a case for a proxy for large language models which has explicit
support for cost-saving optimizations. We design LLMProxy, which supports three key …
support for cost-saving optimizations. We design LLMProxy, which supports three key …
Dual cycle generative adversarial networks for web search
In this work, the IRGAN model is revisited to tackle semi-supervised information retrieval (IR)
problems, considering the premature convergence of IRGAN caused by mismatching the …
problems, considering the premature convergence of IRGAN caused by mismatching the …
CHIFRAUD: A Long-term Web Text Dataset for Chinese Fraud Detection
Detecting fraudulent online text is essential, as these manipulative messages exploit human
greed, deceive individuals, and endanger societal security. Currently, this task remains …
greed, deceive individuals, and endanger societal security. Currently, this task remains …
LT2R: Learning to Online Learning to Rank for Web Search
Online learning to rank (OLTR), which directly optimizes the ranker with interactive user
feedback, has gained considerable attention in both academia and industry. However, most …
feedback, has gained considerable attention in both academia and industry. However, most …
Divided at the Edge-Measuring Performance and the Digital Divide of Cloud Edge Data Centers
Cloud providers are highly incentivized to reduce latency. One way they do this is by
locating data centers as close to users as possible. These “cloud edge” data centers are …
locating data centers as close to users as possible. These “cloud edge” data centers are …