Astronomia ex machina: a history, primer and outlook on neural networks in astronomy

MJ Smith, JE Geach - Royal Society Open Science, 2023 - royalsocietypublishing.org
In this review, we explore the historical development and future prospects of artificial
intelligence (AI) and deep learning in astronomy. We trace the evolution of connectionism in …

Embers of autoregression: Understanding large language models through the problem they are trained to solve

RT McCoy, S Yao, D Friedman, M Hardy… - arxiv preprint arxiv …, 2023 - arxiv.org
The widespread adoption of large language models (LLMs) makes it important to recognize
their strengths and limitations. We argue that in order to develop a holistic understanding of …

Driving and suppressing the human language network using large language models

G Tuckute, A Sathe, S Srikant, M Taliaferro… - Nature Human …, 2024 - nature.com
Transformer models such as GPT generate human-like language and are predictive of
human brain responses to language. Here, using functional-MRI-measured brain responses …

Generative representational instruction tuning

N Muennighoff, H Su, L Wang, N Yang, F Wei… - arxiv preprint arxiv …, 2024 - arxiv.org
All text-based language problems can be reduced to either generation or embedding.
Current models only perform well at one or the other. We introduce generative …

Can neural networks do arithmetic? a survey on the elementary numerical skills of state-of-the-art deep learning models

A Testolin - Applied Sciences, 2024 - mdpi.com
Creating learning models that can exhibit sophisticated reasoning abilities is one of the
greatest challenges in deep learning research, and mathematics is rapidly becoming one of …

Risks and opportunities of open-source generative AI

F Eiras, A Petrov, B Vidgen, C Schroeder… - arxiv preprint arxiv …, 2024 - arxiv.org
Applications of Generative AI (Gen AI) are expected to revolutionize a number of different
areas, ranging from science & medicine to education. The potential for these seismic …

Scaling law for recommendation models: Towards general-purpose user representations

K Shin, H Kwak, SY Kim, MN Ramström… - Proceedings of the …, 2023 - ojs.aaai.org
Recent advancement of large-scale pretrained models such as BERT, GPT-3, CLIP, and
Gopher, has shown astonishing achievements across various task domains. Unlike vision …

Instructprotein: Aligning human and protein language via knowledge instruction

Z Wang, Q Zhang, K Ding, M Qin, X Zhuang… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have revolutionized the field of natural language
processing, but they fall short in comprehending biological sequences such as proteins. To …

Infusing behavior science into large language models for activity coaching

N Hegde, M Vardhan, D Nathani… - PLOS Digital …, 2024 - journals.plos.org
Large language models (LLMs) have shown promise for task-oriented dialogue across a
range of domains. The use of LLMs in health and fitness coaching is under-explored …

[PDF][PDF] Processamento de Linguagem Natural: conceitos, técnicas e aplicações em português

HM Caseli, MGV Nunes - 2024 - repositorio.usp.br
O Processamento de Linguagem Natural (PLN) surgiu praticamente ao mesmo tempo que
os computadores, por volta da década de 1940, já que a tradução automática entre línguas …