Segueix
Saleh Ashkboos
Títol
Citada per
Citada per
Any
OPTQ: Accurate post-training quantization for generative pre-trained transformers
E Frantar, S Ashkboos, T Hoefler, DA Alistarh
11th International Conference on Learning Representations, 2023
10672023
Spqr: A sparse-quantized representation for near-lossless llm weight compression
T Dettmers, R Svirschevski, V Egiazarian, D Kuznedelev, E Frantar, ...
arXiv preprint arXiv:2306.03078, 2023
2122023
SparCML: High-performance sparse communication for machine learning
C Renggli, S Ashkboos, M Aghagolzadeh, D Alistarh, T Hoefler
Proceedings of the International Conference for High Performance Computing …, 2019
1532019
Slicegpt: Compress large language models by deleting rows and columns
S Ashkboos, ML Croci, MG Nascimento, T Hoefler, J Hensman
arXiv preprint arXiv:2401.15024, 2024
1312024
Quarot: Outlier-free 4-bit inference in rotated llms
S Ashkboos, A Mohtashami, M Croci, B Li, P Cameron, M Jaggi, D Alistarh, ...
Advances in Neural Information Processing Systems 37, 100213-100240, 2025
912025
Flare: Flexible in-network allreduce
D De Sensi, S Di Girolamo, S Ashkboos, S Li, T Hoefler
Proceedings of the International Conference for High Performance Computing …, 2021
492021
Motif prediction with graph neural networks
M Besta, R Grob, C Miglioli, N Bernold, G Kwasniewski, G Gjini, ...
Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and …, 2022
392022
New bounds for distributed mean estimation and variance reduction
P Davies, V Gurunathan, N Moshrefi, S Ashkboos, D Alistarh
arXiv preprint arXiv:2002.09268, 2020
34*2020
Quik: Towards end-to-end 4-bit inference on generative large language models
S Ashkboos, I Markov, E Frantar, T Zhong, X Wang, J Ren, T Hoefler, ...
arXiv preprint arXiv:2310.09259, 2023
272023
Ens-10: A dataset for post-processing ensemble weather forecasts
S Ashkboos, L Huang, N Dryden, T Ben-Nun, P Dueben, L Gianinazzi, ...
Advances in Neural Information Processing Systems 35, 21974-21987, 2022
262022
Probgraph: High-performance and high-accuracy graph mining with probabilistic set representations
M Besta, C Miglioli, PS Labini, J Tětek, P Iff, R Kanakagiri, S Ashkboos, ...
SC22: International Conference for High Performance Computing, Networking …, 2022
102022
The spatial computer: A model for energy-efficient parallel computation
L Gianinazzi, T Ben-Nun, M Besta, S Ashkboos, Y Baumann, P Luczynski, ...
arXiv preprint arXiv:2205.04934, 2022
62022
Multi-way sparsest cut problem on trees with a control on the number of parts and outliers
R Javadi, S Ashkboos
Discrete Applied Mathematics 289, 281-291, 2021
5*2021
Arrow matrix decomposition: A novel approach for communication-efficient sparse matrix multiplication
L Gianinazzi, AN Ziogas, L Huang, P Luczynski, S Ashkboosh, F Scheidl, ...
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024
42024
Sten: Productive and efficient sparsity in pytorch
A Ivanov, N Dryden, T Ben-Nun, S Ashkboos, T Hoefler
arXiv preprint arXiv:2304.07613, 2023
42023
Minimum cuts of distance-regular digraphs
S Ashkboos, G Omidi, F Shafiei, K Tajbakhsh
the electronic journal of combinatorics, P4. 2-P4. 2, 2017
22017
An Efficient Parallel Data Clustering Algorithm Using Isoperimetric Number of Trees
R Javadi, S Ashkboos
arXiv preprint arXiv:1702.04739, 2017
22017
Computational Bottlenecks of Training Small-scale Large Language Models
S Ashkboos, I Mirzadeh, K Alizadeh, MH Sekhavat, M Nabi, M Farajtabar, ...
arXiv preprint arXiv:2410.19456, 2024
12024
HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
S Ashkboos, M Nikdan, S Tabesh, RL Castro, T Hoefler, D Alistarh
arXiv preprint arXiv:2501.02625, 2025
2025
EfQAT: An Efficient Framework for Quantization-Aware Training
S Ashkboos, B Verhoef, T Hoefler, E Eleftheriou, M Dazzi
arXiv preprint arXiv:2411.11038, 2024
2024
En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.
Articles 1–20