ติดตาม
Daniel Hesslow
Daniel Hesslow
Adaptive ML
ยืนยันอีเมลแล้วที่ adaptive-ml.com
ชื่อ
อ้างโดย
อ้างโดย
ปี
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
18012023
The refinedweb dataset for falcon llm: Outperforming curated corpora with web data only
G Penedo, Q Malartic, D Hesslow, R Cojocaru, H Alobeidli, A Cappelli, ...
Advances in Neural Information Processing Systems 36, 79155-79172, 2023
873*2023
The falcon series of open language models
E Almazrouei, H Alobeidli, A Alshamsi, A Cappelli, R Cojocaru, M Debbah, ...
arXiv preprint arXiv:2311.16867, 2023
4852023
Falcon-40B: an open large language model with state-of-the-art performance
E Almazrouei, H Alobeidli, A Alshamsi, A Cappelli, R Cojocaru, M Debbah, ...
2512023
What language model architecture and pretraining objective works best for zero-shot generalization?
T Wang, A Roberts, D Hesslow, T Le Scao, HW Chung, I Beltagy, ...
International Conference on Machine Learning, 22964-22984, 2022
1832022
What language model to train if you have one million GPU hours?
TL Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ...
arXiv preprint arXiv:2210.15424, 2022
1202022
Rita: a study on scaling up generative protein sequence models
D Hesslow, N Zanichelli, P Notin, I Poli, D Marks
arXiv preprint arXiv:2205.05789, 2022
922022
BLOOM: A 176b-parameter open-access multilingual language model. CoRR, abs/2211.05100, 2022. doi: 10.48550
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilic, D Hesslow, R Castagné, ...
arXiv preprint arXiv.2211.05100 10, 0
23
Lighton optical processing unit: Scaling-up AI and HPC with a non von neumann co-processor
C Brossollet, A Cappelli, I Carron, C Chaintoutis, A Chatelain, L Daudet, ...
arXiv preprint arXiv:2107.11814, 2021
112021
Photonic co-processors in HPC: using LightOn OPUs for randomized numerical linear algebra
D Hesslow, A Cappelli, I Carron, L Daudet, R Lafargue, K Müller, ...
arXiv preprint arXiv:2104.14429, 2021
92021
Is the number of trainable parameters all that actually matters?
A Chatelain, A Djeghri, D Hesslow, J Launay
I (Still) Can't Believe It's Not Better! Workshop at NeurIPS 2021, 27-32, 2022
72022
Contrastive embeddings for neural architectures
D Hesslow, I Poli
arXiv preprint arXiv:2102.04208, 2021
72021
Building a Swedish question-answering model
H von Essen, D Hesslow
Proceedings of the Probability and Meaning Conference (PaM 2020), 117-127, 2020
62020
Falcon-40B: an open large language model with state-ofthe-art performance (2023)
E Almazrouei, H Alobeidli, A Alshamsi, A Cappelli, R Cojocaru, M Debbah, ...
S. Kundu, S. Johnston, S. Kravec, SE Showk, S. Fort, T. Telleen-Lawton, T …, 2022
52022
Linear optical random projections without holography
R Ohana, D Hesslow, D Brunner, S Gigan, K Müller
Optics Express 31 (16), 25881-25888, 2023
32023
Scaling laws beyond backpropagation
MJ Filipovich, A Cappelli, D Hesslow, J Launay
arXiv preprint arXiv:2210.14593, 2022
32022
Method and system for machine learning using optical data
I Poli, J Launay, K Müller, G Pariente, I Carron, L Daudet, R Ohana, ...
US Patent 11,574,178, 2023
22023
Artificial neural network training on an optical processor via direct feedback alignment
K Müller, J Launay, I Poli, M Filipovich, A Capelli, D Hesslow, I Carron, ...
The European Conference on Lasers and Electro-Optics, jsiii_3_3, 2023
12023
Photonic co-processors in HPC
D Hesslow, A Cappelli, I Carron, L Daudet, R Lafargue, K Müller, ...
arXiv preprint arXiv:2104.14429, 2021
2021
Real-Time Global Illumination in Web-Browsers
M Bertilsson, D Hesslow, N Jonsson, S Moos, O Persson, H von Essen
2018
ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง
บทความ 1–20