[HTML][HTML] Data science applications to string theory

F Ruehle - Physics Reports, 2020 - Elsevier
We first introduce various algorithms and techniques for machine learning and data science.
While there is a strong focus on neural network applications in unsupervised, supervised …

Differential privacy and fairness in decisions and learning tasks: A survey

F Fioretto, C Tran, P Van Hentenryck, K Zhu - arxiv preprint arxiv …, 2022 - arxiv.org
This paper surveys recent work in the intersection of differential privacy (DP) and fairness. It
reviews the conditions under which privacy and fairness may have aligned or contrasting …

Why transformers need adam: A hessian perspective

Y Zhang, C Chen, T Ding, Z Li… - Advances in Neural …, 2025 - proceedings.neurips.cc
SGD performs worse than Adam by a significant margin on Transformers, but the reason
remains unclear. In this work, we provide an explanation through the lens of Hessian:(i) …

Characterizing possible failure modes in physics-informed neural networks

A Krishnapriyan, A Gholami, S Zhe… - Advances in neural …, 2021 - proceedings.neurips.cc
Recent work in scientific machine learning has developed so-called physics-informed neural
network (PINN) models. The typical approach is to incorporate physical domain knowledge …

The right to be forgotten in federated learning: An efficient realization with rapid retraining

Y Liu, L Xu, X Yuan, C Wang, B Li - IEEE INFOCOM 2022-IEEE …, 2022 - ieeexplore.ieee.org
In Machine Learning, the emergence of the right to be forgotten gave birth to a paradigm
named machine unlearning, which enables data holders to proactively erase their data from …

Local learning matters: Rethinking data heterogeneity in federated learning

M Mendieta, T Yang, P Wang, M Lee… - Proceedings of the …, 2022 - openaccess.thecvf.com
Federated learning (FL) is a promising strategy for performing privacy-preserving, distributed
learning with a network of clients (ie, edge devices). However, the data distribution among …

Q-bert: Hessian based ultra low precision quantization of bert

S Shen, Z Dong, J Ye, L Ma, Z Yao, A Gholami… - Proceedings of the AAAI …, 2020 - aaai.org
Transformer based architectures have become de-facto models used for a range of Natural
Language Processing tasks. In particular, the BERT based models achieved significant …

Hawq: Hessian aware quantization of neural networks with mixed-precision

Z Dong, Z Yao, A Gholami… - Proceedings of the …, 2019 - openaccess.thecvf.com
Abstract Model size and inference speed/power have become a major challenge in the
deployment of neural networks for many applications. A promising approach to address …

Understanding and robustifying differentiable architecture search

A Zela, T Elsken, T Saikia, Y Marrakchi, T Brox… - arxiv preprint arxiv …, 2019 - arxiv.org
Differentiable Architecture Search (DARTS) has attracted a lot of attention due to its
simplicity and small search costs achieved by a continuous relaxation and an approximation …

Adahessian: An adaptive second order optimizer for machine learning

Z Yao, A Gholami, S Shen, M Mustafa… - proceedings of the …, 2021 - ojs.aaai.org
Incorporating second-order curvature information into machine learning optimization
algorithms can be subtle, and doing so naïvely can lead to high per-iteration costs …