Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Fairness in information access systems
Recommendation, information retrieval, and other information access systems pose unique
challenges for investigating and applying the fairness and non-discrimination concepts that …
challenges for investigating and applying the fairness and non-discrimination concepts that …
Scaling laws do not scale
F Diaz, M Madaio - Proceedings of the AAAI/ACM Conference on AI …, 2024 - ojs.aaai.org
Recent work has advocated for training AI models on ever-larger datasets, arguing that as
the size of a dataset increases, the performance of a model trained on that dataset will …
the size of a dataset increases, the performance of a model trained on that dataset will …
Aligning offline metrics and human judgments of value for code generation models
Large language models have demonstrated great potential to assist programmers in
generating code. For such human-AI pair programming scenarios, we empirically …
generating code. For such human-AI pair programming scenarios, we empirically …
Preference-based offline evaluation
A core step in production model research and development involves the offline evaluation of
a system before production deployment. Traditional offline evaluation of search …
a system before production deployment. Traditional offline evaluation of search …
Measuring commonality in recommendation of cultural content to strengthen cultural citizenship
Recommender systems have become the dominant means of curating cultural content,
significantly influencing the nature of individual cultural experience. While the majority of …
significantly influencing the nature of individual cultural experience. While the majority of …
Recall, robustness, and lexicographic evaluation
Although originally developed to evaluate sets of items, recall is often used to evaluate
rankings of items, including those produced by recommender, retrieval, and other machine …
rankings of items, including those produced by recommender, retrieval, and other machine …
Best-Case Retrieval Evaluation: Improving the Sensitivity of Reciprocal Rank with Lexicographic Precision
F Diaz - arxiv preprint arxiv:2306.07908, 2023 - arxiv.org
Across a variety of ranking tasks, researchers use reciprocal rank to measure the
effectiveness for users interested in exactly one relevant item. Despite its widespread use …
effectiveness for users interested in exactly one relevant item. Despite its widespread use …
Offline Evaluation of Set-Based Text-to-Image Generation
Text-to-Image (TTI) systems often support people during ideation, the early stages of a
creative process when exposure to a broad set of relevant or partially relevant images can …
creative process when exposure to a broad set of relevant or partially relevant images can …
Unified browsing models for linear and grid layouts
Many information access systems operationalize their results in terms of rankings, which are
then displayed to users in various ranking layouts such as linear lists or grids. User …
then displayed to users in various ranking layouts such as linear lists or grids. User …
Mixed method development of evaluation metrics
Designers of online search and recommendation services often need to develop metrics to
assess system performance. This tutorial focuses on mixed methods approaches to …
assess system performance. This tutorial focuses on mixed methods approaches to …