Etai Littwin

Sitert av

	Alle	Siden 2020
Sitater	580	535
h-indeks	11	10
i10-indeks	13	11

260

130

195

20162017201820192020202120222023202420255 13 10 12 15 24 60 117 248 71

Offentlig tilgang

Vis alle

3 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Medforfattere

Joshua M SusskindAppleVerifisert e-postadresse på apple.com
Lior WolfThe School of Computer Science at Tel Aviv UniversityVerifisert e-postadresse på cs.tau.ac.il
Shuangfei ZhaiApple, Machine Learning ResearchVerifisert e-postadresse på apple.com
Greg YangxAIVerifisert e-postadresse på x.ai
Tomer GalantiAI ResearcherVerifisert e-postadresse på tamu.edu
Ben MyaraComputer Vision, Apple / Research Data ScientistVerifisert e-postadresse på apple.com
Sima SabahAppleVerifisert e-postadresse på apple.com
Daniel Cohen-OrProfessor of Computer Science, Tel Aviv UniversityVerifisert e-postadresse på tau.ac.il
Hadar Averbuch-ElorAssistant Professor, Cornell UniversityVerifisert e-postadresse på cornell.edu

Følg

Etai Littwin

Research Scientist at Apple

Verifisert e-postadresse på apple.com

Theoretical and practical aspects of machine learning


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
What algorithms can transformers learn? a study in length generalization H Zhou, A Bradley, E Littwin, N Razin, O Saremi, J Susskind, S Bengio, ... arXiv preprint arXiv:2310.16028, 2023	113	2023
Tensor programs iib: Architectural universality of neural tangent kernel training dynamics G Yang, E Littwin International conference on machine learning, 11762-11772, 2021	73	2021
Stabilizing transformer training by preventing attention entropy collapse S Zhai, T Likhomanenko, E Littwin, D Busbridge, J Ramapuram, Y Zhang, ... International Conference on Machine Learning, 40770-40803, 2023	62	2023
The slingshot mechanism: An empirical study of adaptive optimizers and the grokking phenomenon V Thilak, E Littwin, S Zhai, O Saremi, R Paiss, J Susskind arXiv preprint arXiv:2206.04817, 2022	50	2022
Biometric authentication techniques DS Prakash, LE Ballard, JV Hauck, F Tang, E Littwin, PKA Vasu, G Littwin, ... US Patent 10,929,515, 2021	41	2021
Transformers learn through gradual rank increase E Boix-Adsera, E Littwin, E Abbe, S Bengio, J Susskind Advances in Neural Information Processing Systems 36, 24519-24551, 2023	38*	2023
The multiverse loss for robust transfer learning E Littwin, L Wolf Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016	34	2016
On infinite-width hypernetworks E Littwin, T Galanti, L Wolf, G Yang Advances in neural information processing systems 33, 13226-13237, 2020	30*	2020
Tensor programs ivb: Adaptive optimization in the infinite-width limit G Yang, E Littwin arXiv preprint arXiv:2308.01814, 2023	22	2023
The loss surface of residual networks: Ensembles and the role of batch normalization E Littwin, L Wolf arXiv preprint arXiv:1611.02525, 2016	14	2016
Regularizing by the variance of the activations' sample-variances E Littwin, L Wolf Advances in Neural Information Processing Systems 31, 2018	12	2018
When can transformers reason with abstract symbols? E Boix-Adsera, O Saremi, E Abbe, S Bengio, E Littwin, J Susskind arXiv preprint arXiv:2310.09753, 2023	10	2023
Collegial ensembles E Littwin, B Myara, S Sabah, J Susskind, S Zhai, O Golan Advances in Neural Information Processing Systems 33, 18738-18748, 2020	10	2020
Adaptive Optimization in the -Width Limit E Littwin, G Yang The Eleventh International Conference on Learning Representations, 2023	8	2023
Lidar: Sensing linear probing performance in joint embedding ssl architectures V Thilak, C Huang, O Saremi, L Dinh, H Goh, P Nakkiran, JM Susskind, ... arXiv preprint arXiv:2312.04000, 2023	7	2023
On random kernels of residual architectures E Littwin, T Galanti, L Wolf Uncertainty in Artificial Intelligence, 897-907, 2021	7	2021
Biometric authentication techniques DS Prakash, LE Ballard, JV Hauck, F Tang, E Littwin, PKA Vasu, G Littwin, ... US Patent 11,151,235, 2021	7	2021
Spherical embedding of inlier silhouette dissimilarities E Littwin, H Averbuch-Elor, D Cohen-Or Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2015	7	2015
Vanishing gradients in reinforcement finetuning of language models N Razin, H Zhou, O Saremi, V Thilak, A Bradley, P Nakkiran, J Susskind, ... arXiv preprint arXiv:2310.20703, 2023	6	2023
How jepa avoids noisy features: The implicit bias of deep linear self distillation networks E Littwin, O Saremi, M Advani, V Thilak, P Nakkiran, C Huang, J Susskind Advances in Neural Information Processing Systems 37, 91300-91336, 2025	5	2025

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av

Medforfattere