Segui
Behnam Neyshabur
Behnam Neyshabur
Member of Technical Staff, Anthropic
Email verificata su anthropic.com - Home page
Titolo
Citata da
Citata da
Anno
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
24642023
Exploring generalization in deep learning
B Neyshabur, S Bhojanapalli, D McAllester, N Srebro
Advances in Neural Information Processing Systems, 2017
15142017
Sharpness-Aware Minimization for Efficiently Improving Generalization
P Foret, A Kleiner, H Mobahi, B Neyshabur
International Conference on Learning Representations, 2021
14942021
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
Transactions on Machine Learning Research, 2023
12912023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
9722024
In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning
B Neyshabur, R Tomioka, N Srebro
International Conference on Learning Representations, 2015
7542015
Stronger generalization bounds for deep nets via a compression approach
S Arora, R Ge, B Neyshabur, Y Zhang
The 35th International Conference on Machine Learning, 2018
7202018
A pac-bayesian approach to spectrally-normalized margin bounds for neural networks
B Neyshabur, S Bhojanapalli, N Srebro
International Conference on Learning Representations, 2018
7052018
Fantastic Generalization Measures and Where to Find Them
Y Jiang, B Neyshabur, H Mobahi, D Krishnan, S Bengio
International Conference on Learning Representations, 2020
7042020
Norm-Based Capacity Control in Neural Networks
B Neyshabur, R Tomioka, N Srebro
Conference on Learning Theory, 1376–1401, 2015
6762015
Solving quantitative reasoning problems with language models
A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ...
Advances in Neural Information Processing Systems, 2022
6702022
Towards understanding the role of over-parametrization in generalization of neural networks
B Neyshabur, Z Li, S Bhojanapalli, Y LeCun, N Srebro
International Conference on Learning Representations, 2019
6362019
What is being transferred in transfer learning?
B Neyshabur, H Sedghi, C Zhang
Advances in Neural Information Processing Systems, 2020
5822020
Implicit regularization in matrix factorization
S Gunasekar, BE Woodworth, S Bhojanapalli, B Neyshabur, N Srebro
Advances in neural information processing systems 30, 2017
5692017
Global Optimality of Local Search for Low Rank Matrix Recovery
S Bhojanapalli, B Neyshabur, N Srebro
Advances in Neural Information Processing Systems, 2016
4602016
Predicting protein–protein interactions through sequence-based deep learning
S Hashemifar, B Neyshabur, AA Khan, J Xu
Bioinformatics 34 (17), i802-i810, 2018
3652018
Path-SGD: Path-Normalized Optimization in Deep Neural Networks
B Neyshabur, RR Salakhutdinov, N Srebro
Advances in Neural Information Processing Systems, 2413-2421, 2015
3452015
Gemma 2: Improving open language models at a practical size
G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ...
arXiv preprint arXiv:2408.00118, 2024
3102024
Long range language modeling via gated state spaces
H Mehta, A Gupta, A Cutkosky, B Neyshabur
International Conference on Learning Representations, 2023
2272023
On Symmetric and Asymmetric LSHs for Inner Product Search
B Neyshabur, N Srebro
The 32nd International Conference on Machine Learning, 1926–1934, 2015
2272015
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20