The llama 3 herd of models

A Dubey, A Jauhri, A Pandey, A Kadian… - arxiv preprint arxiv …, 2024 - arxiv.org
Modern artificial intelligence (AI) systems are powered by foundation models. This paper
presents a new set of foundation models, called Llama 3. It is a herd of language models …

AI-driven research in pure mathematics and theoretical physics

YH He - Nature Reviews Physics, 2024 - nature.com
The past five years have seen a dramatic increase in the usage of artificial intelligence (AI)
algorithms in pure mathematics and theoretical sciences. This might appear counter-intuitive …

Knowledge mechanisms in large language models: A survey and perspective

M Wang, Y Yao, Z Xu, S Qiao, S Deng, P Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Understanding knowledge mechanisms in Large Language Models (LLMs) is crucial for
advancing towards trustworthy AGI. This paper reviews knowledge mechanism analysis …

A systematic assessment of openai o1-preview for higher order thinking in education

E Latif, Y Zhou, S Guo, Y Gao, L Shi… - arxiv preprint arxiv …, 2024 - arxiv.org
As artificial intelligence (AI) continues to advance, it demonstrates capabilities comparable
to human intelligence, with significant potential to transform education and workforce …

Ai-assisted generation of difficult math questions

V Shah, D Yu, K Lyu, S Park, J Yu, Y He, NR Ke… - arxiv preprint arxiv …, 2024 - arxiv.org
Current LLM training positions mathematical reasoning as a core capability. With publicly
available sources fully tapped, there is unmet demand for diverse and challenging math …

Towards building specialized generalist ai with system 1 and system 2 fusion

K Zhang, B Qi, B Zhou - arxiv preprint arxiv:2407.08642, 2024 - arxiv.org
In this perspective paper, we introduce the concept of Specialized Generalist Artificial
Intelligence (SGAI or simply SGI) as a crucial milestone toward Artificial General Intelligence …

Large language models for base station siting: Intelligent deployment based on prompt or agent

Y Wang, MM Afzal, Z Li, J Zhou, C Feng, S Guo… - arxiv preprint arxiv …, 2024 - arxiv.org
Traditional base station siting (BSS) methods rely heavily on drive testing and user
feedback, which are laborious and require extensive expertise in communication …

[PDF][PDF] Explicit memory learning with expectation maximization

Z Yin, Q Sun, Q Guo, Z Zeng, Q Cheng… - Proceedings of the …, 2024 - aclanthology.org
Abstract Large Language Models (LLMs) have revolutionized the landscape of natural
language processing, demonstrating remarkable abilities across various complex tasks …

Smart vision-language reasoners

D Roberts, L Roberts - arxiv preprint arxiv:2407.04212, 2024 - arxiv.org
In this article, we investigate vision-language models (VLM) as reasoners. The ability to form
abstractions underlies mathematical reasoning, problem-solving, and other Math AI tasks …

Can LLMs learn by teaching for better reasoning? A preliminary study

X Ning, Z Wang, S Li, Z Lin, P Yao, T Fu… - arxiv preprint arxiv …, 2024 - arxiv.org
Teaching to improve student models (eg, knowledge distillation) is an extensively studied
methodology in LLMs. However, for humans, teaching improves not only students but also …