A theory for compressibility of graph transformers for transductive learning
Transductive tasks on graphs differ fundamentally from typical supervised machine learning
tasks, as the independent and identically distributed (iid) assumption does not hold among …
tasks, as the independent and identically distributed (iid) assumption does not hold among …
Normalization Matters for Optimization Performance on Graph Neural Networks
We show that feature normalization has a drastic impact on the performance of optimization
algorithms in the context of graph neural networks. The standard normalization scheme …
algorithms in the context of graph neural networks. The standard normalization scheme …