Chatting about ChatGPT: how may AI and GPT impact academia and libraries?

BD Lund, T Wang - Library hi tech news, 2023 - emerald.com
Purpose This paper aims to provide an overview of key definitions related to ChatGPT, a
public tool developed by OpenAI, and its underlying technology, Generative Pretrained …

One small step for generative ai, one giant leap for agi: A complete survey on chatgpt in aigc era

C Zhang, C Zhang, C Li, Y Qiao, S Zheng… - arxiv preprint arxiv …, 2023 - arxiv.org
OpenAI has recently released GPT-4 (aka ChatGPT plus), which is demonstrated to be one
small step for generative AI (GAI), but one giant leap for artificial general intelligence (AGI) …

Minicpm-v: A gpt-4v level mllm on your phone

Y Yao, T Yu, A Zhang, C Wang, J Cui, H Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
The recent surge of Multimodal Large Language Models (MLLMs) has fundamentally
reshaped the landscape of AI research and industry, shedding light on a promising path …

What if the tv was off? examining counterfactual reasoning abilities of multi-modal language models

L Zhang, X Zhai, Z Zhao, Y Zong… - Proceedings of the …, 2024 - openaccess.thecvf.com
Counterfactual reasoning a fundamental aspect of human cognition involves contemplating
alternatives to established facts or past events significantly enhancing our abilities in …

Deepseek-vl2: Mixture-of-experts vision-language models for advanced multimodal understanding

Z Wu, X Chen, Z Pan, X Liu, W Liu, D Dai… - arxiv preprint arxiv …, 2024 - arxiv.org
We present DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-
Language Models that significantly improves upon its predecessor, DeepSeek-VL, through …

[PDF][PDF] Battle of the wordsmiths: Comparing chatgpt, gpt-4, claude, and bard

A Borji, M Mohammadian - GPT-4, Claude, and Bard (June 12 …, 2023 - researchgate.net
Although informal evaluations of modern LLMs can be found on social media, blogs, and
news outlets, a formal and comprehensive comparison among them has yet to be …

Kiva: Kid-inspired visual analogies for testing large multimodal models

E Yiu, M Qraitem, C Wong, AN Majhi, Y Bai… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper investigates visual analogical reasoning in large multimodal models (LMMs)
compared to human adults and children. A" visual analogy" is an abstract rule inferred from …

Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

A Cherian, KC Peng, S Lohit, J Matthiesen… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent years have seen a significant progress in the general-purpose problem solving
abilities of large vision and language models (LVLMs), such as ChatGPT, Gemini, etc.; some …

Towards an Exhaustive Evaluation of Vision-Language Foundation Models

E Salin, S Ayache, B Favre - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Vision-language foundation models have had considerable increase in performances in the
last few years. However, there is still a lack of comprehensive evaluation methods able to …

[PDF][PDF] Herausforderungen und Entwicklungsmöglichkeiten für die Mathematikdidaktik durch generative KI-Sprachmodelle

N Buchholtz, L Baumanns… - … für Didaktik der …, 2023 - ojs.didaktik-der-mathematik.de
Herausforderungen und Entwicklungsmöglichkeiten für die Mathematikdidaktik durch generative
KI-Sprachmodelle Page 1 GDM-Mitteilungen 114 · 2023 Magazin 19 Herausforderungen und …