Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Teaching models to express their uncertainty in words
We show that a GPT-3 model can learn to express uncertainty about its own answers in
natural language--without use of model logits. When given a question, the model generates …
natural language--without use of model logits. When given a question, the model generates …
Approaching human-level forecasting with language models
Forecasting future events is important for policy and decision making. In this work, we study
whether language models (LMs) can forecast at the level of competitive human forecasters …
whether language models (LMs) can forecast at the level of competitive human forecasters …
Natural selection favors AIs over humans
D Hendrycks - arxiv preprint arxiv:2303.16200, 2023 - arxiv.org
For billions of years, evolution has been the driving force behind the development of life,
including humans. Evolution endowed humans with high intelligence, which allowed us to …
including humans. Evolution endowed humans with high intelligence, which allowed us to …
Back to the future: Towards explainable temporal reasoning with large language models
Temporal reasoning is a crucial natural language processing (NLP) task, providing a
nuanced understanding of time-sensitive contexts within textual data. Although recent …
nuanced understanding of time-sensitive contexts within textual data. Although recent …
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
As artificial intelligence systems grow more powerful, there has been increasing interest in"
AI safety" research to address emerging and future risks. However, the field of AI safety …
AI safety" research to address emerging and future risks. However, the field of AI safety …
Temporal knowledge graph forecasting without knowledge using in-context learning
Temporal knowledge graph (TKG) forecasting benchmarks challenge models to predict
future facts using knowledge of past facts. In this paper, we apply large language models …
future facts using knowledge of past facts. In this paper, we apply large language models …
Evaluating superhuman models with consistency checks
If machine learning models were to achieve superhuman abilities at various reasoning or
decision-making tasks, how would we go about evaluating such models, given that humans …
decision-making tasks, how would we go about evaluating such models, given that humans …
[KNJIGA][B] Introduction to AI safety, ethics, and society
D Hendrycks - 2025 - library.oapen.org
As AI technology is rapidly progressing in capability and being adopted more widely across
society, it is more important than ever to understand the potential risks AI may pose and how …
society, it is more important than ever to understand the potential risks AI may pose and how …
[HTML][HTML] Humans vs. large language models: Judgmental forecasting in an era of advanced AI
This study investigates the forecasting accuracy of human experts versus large language
models (LLMs) in the retail sector, particularly during standard and promotional sales …
models (LLMs) in the retail sector, particularly during standard and promotional sales …
Mirai: Evaluating llm agents for event forecasting
Recent advancements in Large Language Models (LLMs) have empowered LLM agents to
autonomously collect world information, over which to conduct reasoning to solve complex …
autonomously collect world information, over which to conduct reasoning to solve complex …