Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Prism: A framework for decoupling and assessing the capabilities of vlms
Abstract Vision Language Models (VLMs) demonstrate remarkable proficiency in addressing
a wide array of visual questions, which requires strong perception and reasoning faculties …
a wide array of visual questions, which requires strong perception and reasoning faculties …
A peek into token bias: Large language models are not yet genuine reasoners
This study introduces a hypothesis-testing framework to assess whether large language
models (LLMs) possess genuine reasoning abilities or primarily depend on token bias. We …
models (LLMs) possess genuine reasoning abilities or primarily depend on token bias. We …
Evaluating ChatGPT-4 Vision on Brazil's National Undergraduate Computer Science Exam
NC Mendonça - ACM Transactions on Computing Education, 2024 - dl.acm.org
The recent integration of visual capabilities into Large Language Models (LLMs) has the
potential to play a pivotal role in science and technology education, where visual elements …
potential to play a pivotal role in science and technology education, where visual elements …
What is the visual cognition gap between humans and multimodal llms?
Recently, Multimodal Large Language Models (MLLMs) have shown great promise in
language-guided perceptual tasks such as recognition, segmentation, and object detection …
language-guided perceptual tasks such as recognition, segmentation, and object detection …
Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad?
Recently, newly developed Vision-Language Models (VLMs), such as OpenAI's GPT-4o,
have emerged, seemingly demonstrating advanced reasoning capabilities across text and …
have emerged, seemingly demonstrating advanced reasoning capabilities across text and …
Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything
Large Visual Language Model\textbfs (VLMs) such as GPT-4V have achieved remarkable
success in generating comprehensive and nuanced responses. Researchers have …
success in generating comprehensive and nuanced responses. Researchers have …
VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models
Multimodal Large Language Models (MLLMs) have demonstrated remarkable
advancements in multimodal understanding; however, their fundamental visual cognitive …
advancements in multimodal understanding; however, their fundamental visual cognitive …
Visual scratchpads: Enabling global reasoning in vision
Modern vision models have achieved remarkable success in benchmarks where local
features provide critical information about the target. There is now a growing interest in …
features provide critical information about the target. There is now a growing interest in …
Towards Learning to Reason: Comparing LLMs with Neuro-Symbolic on Arithmetic Relations in Abstract Reasoning
This work compares large language models (LLMs) and neuro-symbolic approaches in
solving Raven's progressive matrices (RPM), a visual abstract reasoning test that involves …
solving Raven's progressive matrices (RPM), a visual abstract reasoning test that involves …
Benchmarking Visual Cognition of Multimodal LLMs via Matrix Reasoning
Recently, Multimodal Large Language Models (MLLMs) and Vision Language Models
(VLMs) have shown great promise in language-guided perceptual tasks such as recognition …
(VLMs) have shown great promise in language-guided perceptual tasks such as recognition …