Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
An overview of multi-agent reinforcement learning from game theoretical perspective
Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org
Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …
Decision-focused learning: Foundations, state of the art, benchmark and future opportunities
Decision-focused learning (DFL) is an emerging paradigm that integrates machine learning
(ML) and constrained optimization to enhance decision quality by training ML models in an …
(ML) and constrained optimization to enhance decision quality by training ML models in an …
Principled reinforcement learning with human feedback from pairwise or k-wise comparisons
We provide a theoretical framework for Reinforcement Learning with Human Feedback
(RLHF). We show that when the underlying true reward is linear, under both Bradley-Terry …
(RLHF). We show that when the underlying true reward is linear, under both Bradley-Terry …
Dense text retrieval based on pretrained language models: A survey
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …
required to return relevant information resources to user's queries in natural language. From …
Efficiently teaching an effective dense retriever with balanced topic aware sampling
A vital step towards the widespread adoption of neural retrieval models is their resource
efficiency throughout the training, indexing and query workflows. The neural IR community …
efficiency throughout the training, indexing and query workflows. The neural IR community …
Rocketqav2: A joint training method for dense passage retrieval and passage re-ranking
In various natural language processing tasks, passage retrieval and passage re-ranking are
two key procedures in finding and ranking relevant information. Since both the two …
two key procedures in finding and ranking relevant information. Since both the two …
[BUKU][B] Pretrained transformers for text ranking: Bert and beyond
The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in
response to a query. Although the most common formulation of text ranking is search …
response to a query. Although the most common formulation of text ranking is search …
Identification of potent antimicrobial peptides via a machine-learning pipeline that mines the entire space of peptide sequences
J Huang, Y Xu, Y Xue, Y Huang, X Li, X Chen… - Nature Biomedical …, 2023 - nature.com
Systematically identifying functional peptides is difficult owing to the vast combinatorial
space of peptide sequences. Here we report a machine-learning pipeline that mines the …
space of peptide sequences. Here we report a machine-learning pipeline that mines the …
Dcn v2: Improved deep & cross network and practical lessons for web-scale learning to rank systems
Learning effective feature crosses is the key behind building recommender systems.
However, the sparse and large feature space requires exhaustive search to identify effective …
However, the sparse and large feature space requires exhaustive search to identify effective …
Leveraging large language models in conversational recommender systems
L Friedman, S Ahuja, D Allen, Z Tan… - arxiv preprint arxiv …, 2023 - arxiv.org
A Conversational Recommender System (CRS) offers increased transparency and control to
users by enabling them to engage with the system through a real-time multi-turn dialogue …
users by enabling them to engage with the system through a real-time multi-turn dialogue …