Resource allocation for simultaneous wireless information and power transfer systems: A tutorial overview

Z Wei, X Yu, DWK Ng, R Schober - Proceedings of the IEEE, 2021 - ieeexplore.ieee.org
Over the last decade, simultaneous wireless information and power transfer (SWIPT) has
become a practical and promising solution for connecting and recharging battery-limited …

H2o: Heavy-hitter oracle for efficient generative inference of large language models

Z Zhang, Y Sheng, T Zhou, T Chen… - Advances in …, 2023 - proceedings.neurips.cc
Abstract Large Language Models (LLMs), despite their recent impressive accomplishments,
are notably cost-prohibitive to deploy, particularly for applications involving long-content …

Optimal dispatch of low-carbon integrated energy system considering nuclear heating and carbon trading

Y Li, F Bu, J Gao, G Li - Journal of Cleaner Production, 2022 - Elsevier
The development of miniaturized nuclear power (NP) units and the improvement of the
carbon trading market provide a new way to realize the low-carbon operation of integrated …

Solving linear programs in the current matrix multiplication time

MB Cohen, YT Lee, Z Song - Journal of the ACM (JACM), 2021 - dl.acm.org
This article shows how to solve linear programs of the form min Ax= b, x≥ 0 c⊤ x with n
variables in time O*((n ω+ n 2.5− α/2+ n 2+ 1/6) log (n/δ)), where ω is the exponent of matrix …

Attention scheme inspired softmax regression

Y Deng, Z Li, Z Song - arxiv preprint arxiv:2304.10411, 2023 - arxiv.org
Large language models (LLMs) have made transformed changes for human society. One of
the key computation in LLMs is the softmax unit. This operation is important in LLMs …

Infoprompt: Information-theoretic soft prompt tuning for natural language understanding

J Wu, T Yu, R Wang, Z Song, R Zhang… - Advances in …, 2023 - proceedings.neurips.cc
Soft prompt tuning achieves superior performances across a wide range of few-shot tasks.
However, the performances of prompt tuning can be highly sensitive to the initialization of …

A faster small treewidth sdp solver

Y Gu, Z Song - arxiv preprint arxiv:2211.06033, 2022 - arxiv.org
Semidefinite programming is a fundamental tool in optimization and theoretical computer
science. It has been extensively used as a black-box for solving many problems, such as …

Training multi-layer over-parametrized neural network in subquadratic time

Z Song, L Zhang, R Zhang - arxiv preprint arxiv:2112.07628, 2021 - arxiv.org
We consider the problem of training a multi-layer over-parametrized neural network to
minimize the empirical risk induced by a loss function. In the typical setting of over …

A tighter complexity analysis of sparsegpt

X Li, Y Liang, Z Shi, Z Song - arxiv preprint arxiv:2408.12151, 2024 - arxiv.org
In this work, we improved the analysis of the running time of SparseGPT [Frantar, Alistarh
ICML 2023] from $ O (d^{3}) $ to $ O (d^{\omega}+ d^{2+ a+ o (1)}+ d^{1+\omega (1, 1, a)-a}) …

A faster algorithm for solving general LPs

S Jiang, Z Song, O Weinstein, H Zhang - Proceedings of the 53rd Annual …, 2021 - dl.acm.org
The fastest known LP solver for general (dense) linear programs is due to [Cohen, Lee and
Song'19] and runs in O*(n ω+ n 2.5− α/2+ n 2+ 1/6) time. A number of follow-up works [Lee …