Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Transformers in reinforcement learning: a survey
Transformers have significantly impacted domains like natural language processing,
computer vision, and robotics, where they improve performance compared to other neural …
computer vision, and robotics, where they improve performance compared to other neural …
Deep reinforcement learning
SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer
Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …
decision strategies. However, in many cases, it is desirable to learn directly from …
Simulation-guided beam search for neural combinatorial optimization
Neural approaches for combinatorial optimization (CO) equip a learning mechanism to
discover powerful heuristics for solving complex real-world problems. While neural …
discover powerful heuristics for solving complex real-world problems. While neural …
Stochastic beams and where to find them: The gumbel-top-k trick for sampling sequences without replacement
Abstract The well-known Gumbel-Max trick for sampling from a categorical distribution can
be extended to sample $ k $ elements without replacement. We show how to implicitly apply …
be extended to sample $ k $ elements without replacement. We show how to implicitly apply …
Deep learning for approximate nearest neighbour search: A survey and future directions
Approximate nearest neighbour search (ANNS) in high-dimensional space is an essential
and fundamental operation in many applications from many domains such as multimedia …
and fundamental operation in many applications from many domains such as multimedia …
Learning deductive reasoning from synthetic corpus based on formal logic
We study a synthetic corpus based approach for language models (LMs) to acquire logical
deductive reasoning ability. The previous studies generated deduction examples using …
deductive reasoning ability. The previous studies generated deduction examples using …
Learning optimal tree models under beam search
Retrieving relevant targets from an extremely large target set under computational limits is a
common challenge for information retrieval and recommendation systems. Tree models …
common challenge for information retrieval and recommendation systems. Tree models …
Lambdabeam: Neural program search with higher-order functions and lambdas
Search is an important technique in program synthesis that allows for adaptive strategies
such as focusing on particular search directions based on execution results. Several prior …
such as focusing on particular search directions based on execution results. Several prior …
Estimating gradients for discrete random variables by sampling without replacement
We derive an unbiased estimator for expectations over discrete random variables based on
sampling without replacement, which reduces variance as it avoids duplicate samples. We …
sampling without replacement, which reduces variance as it avoids duplicate samples. We …
Reinforcement routing on proximity graph for efficient recommendation
We focus on Maximum Inner Product Search (MIPS), which is an essential problem in many
machine learning communities. Given a query, MIPS finds the most similar items with the …
machine learning communities. Given a query, MIPS finds the most similar items with the …