From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks

J Russin, SW McGrath, DJ Williams… - ar** an unknown word (such as a novel
noun Raun) to an inflected form (such as the plural Rauns), has historically proven a …

Recurrent Transformers Trade-off Parallelism for Length Generalization on Regular Languages

P Soulos, A Terzic, M Hersche… - The First Workshop on …, 2024 - openreview.net
Transformers have achieved remarkable success in Natural Language Processing but
struggle with state tracking and algorithmic reasoning tasks, such as modeling Regular …