Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Normalization techniques in training dnns: Methodology, analysis and application
Normalization techniques are essential for accelerating the training and improving the
generalization of deep neural networks (DNNs), and have successfully been used in various …
generalization of deep neural networks (DNNs), and have successfully been used in various …
Machine learning in process systems engineering: Challenges and opportunities
This “white paper” is a concise perspective of the potential of machine learning in the
process systems engineering (PSE) domain, based on a session during FIPSE 5, held in …
process systems engineering (PSE) domain, based on a session during FIPSE 5, held in …
Fine-tuning language models with just forward passes
Fine-tuning language models (LMs) has yielded success on diverse downstream tasks, but
as LMs grow in size, backpropagation requires a prohibitively large amount of memory …
as LMs grow in size, backpropagation requires a prohibitively large amount of memory …
Squeezellm: Dense-and-sparse quantization
Generative Large Language Models (LLMs) have demonstrated remarkable results for a
wide range of tasks. However, deploying these models for inference has been a significant …
wide range of tasks. However, deploying these models for inference has been a significant …
Why transformers need adam: A hessian perspective
SGD performs worse than Adam by a significant margin on Transformers, but the reason
remains unclear. In this work, we provide an explanation through the lens of Hessian:(i) …
remains unclear. In this work, we provide an explanation through the lens of Hessian:(i) …
Characterizing possible failure modes in physics-informed neural networks
Recent work in scientific machine learning has developed so-called physics-informed neural
network (PINN) models. The typical approach is to incorporate physical domain knowledge …
network (PINN) models. The typical approach is to incorporate physical domain knowledge …
Sophia: A scalable stochastic second-order optimizer for language model pre-training
Given the massive cost of language model pre-training, a non-trivial improvement of the
optimization algorithm would lead to a material reduction on the time and cost of training …
optimization algorithm would lead to a material reduction on the time and cost of training …
Revisiting weighted aggregation in federated learning with neural networks
In federated learning (FL), weighted aggregation of local models is conducted to generate a
global model, and the aggregation weights are normalized (the sum of weights is 1) and …
global model, and the aggregation weights are normalized (the sum of weights is 1) and …
Diverse weight averaging for out-of-distribution generalization
Standard neural networks struggle to generalize under distribution shifts in computer vision.
Fortunately, combining multiple networks can consistently improve out-of-distribution …
Fortunately, combining multiple networks can consistently improve out-of-distribution …
Full stack optimization of transformer inference: a survey
Recent advances in state-of-the-art DNN architecture design have been moving toward
Transformer models. These models achieve superior accuracy across a wide range of …
Transformer models. These models achieve superior accuracy across a wide range of …