Modeling similarity and psychological space

BD Roads, BC Love - Annual Review of Psychology, 2024 - annualreviews.org
Similarity and categorization are fundamental processes in human cognition that help
complex organisms make sense of the cacophony of information in their environment. These …

[HTML][HTML] Analyzing biological and artificial neural networks: challenges with opportunities for synergy?

DGT Barrett, AS Morcos, JH Macke - Current opinion in neurobiology, 2019 - Elsevier
Highlights•Artificial and biological neural networks can be analyzed using similar
methods.•Neural analysis has revealed similarities between the representations in artificial …

Memorization without overfitting: Analyzing the training dynamics of large language models

K Tirumala, A Markosyan… - Advances in …, 2022 - proceedings.neurips.cc
Despite their wide adoption, the underlying training and memorization dynamics of very
large language models is not well understood. We empirically study exact memorization in …

Do vision transformers see like convolutional neural networks?

M Raghu, T Unterthiner, S Kornblith… - Advances in neural …, 2021 - proceedings.neurips.cc
Convolutional neural networks (CNNs) have so far been the de-facto model for visual data.
Recent work has shown that (Vision) Transformer models (ViT) can achieve comparable or …

A toy model of universality: Reverse engineering how networks learn group operations

B Chughtai, L Chan, N Nanda - International Conference on …, 2023 - proceedings.mlr.press
Universality is a key hypothesis in mechanistic interpretability–that different models learn
similar features and circuits when trained on similar tasks. In this work, we study the …

Layer-wise analysis of a self-supervised speech representation model

A Pasad, JC Chou, K Livescu - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org
Recently proposed self-supervised learning approaches have been successful for pre-
training speech representation models. The utility of these learned representations has been …

Learning from failure: De-biasing classifier from biased classifier

J Nam, H Cha, S Ahn, J Lee… - Advances in Neural …, 2020 - proceedings.neurips.cc
Neural networks often learn to make predictions that overly rely on spurious corre-lation
existing in the dataset, which causes the model to be biased. While previous work tackles …

Toward transparent ai: A survey on interpreting the inner structures of deep neural networks

T Räuker, A Ho, S Casper… - 2023 ieee conference …, 2023 - ieeexplore.ieee.org
The last decade of machine learning has seen drastic increases in scale and capabilities.
Deep neural networks (DNNs) are increasingly being deployed in the real world. However …

Similarity of neural network representations revisited

S Kornblith, M Norouzi, H Lee… - … conference on machine …, 2019 - proceedings.mlr.press
Recent work has sought to understand the behavior of neural networks by comparing
representations between layers and between different trained models. We examine methods …