- Academic Search

XY Liu, G Wang, H Yang, D Zha - arxiv preprint arxiv:2307.10485, 2023 - arxiv.org

Large language models (LLMs) have demonstrated remarkable proficiency in
understanding and generating human-like texts, which may potentially revolutionize the …

保存引用被引用次数：99 相关文章所有 6 个版本 HTML 版

[Free GPT-4]

[PDF] acm.org

Differentially private low-rank adaptation of large language model using federated learning

XY Liu, R Zhu, D Zha, J Gao, S Zhong… - ACM Transactions on …, 2023 - dl.acm.org

The surge in interest and application of large language models (LLMs) has sparked a drive
to fine-tune these models to suit specific applications, such as finance and medical science …

保存引用被引用次数：19 相关文章所有 2 个版本

[Free GPT-4]

[PDF] neurips.cc

One less reason for filter pruning: Gaining free adversarial robustness with structured grouped kernel pruning

SH Zhong, Z You, J Zhang, S Zhao… - Advances in neural …, 2023 - proceedings.neurips.cc

Densely structured pruning methods utilizing simple pruning heuristics can deliver
immediate compression and acceleration benefits with acceptable benign performances …

保存引用被引用次数：3 相关文章所有 3 个版本 HTML 版

[Free GPT-4]

[PDF] openreview.net

DSpar: An embarrassingly simple strategy for efficient GNN training and inference via degree-based sparsification

Z Liu, K Zhou, Z Jiang, L Li, R Chen… - … on Machine Learning …, 2023 - openreview.net

Running Graph Neural Networks (GNNs) on large graphs suffers from notoriously
inefficiency. This is attributed to the sparse graph-based operations, which is hard to be …

保存引用被引用次数：16 相关文章 HTML 版

[Free GPT-4]

[PDF] ieee.org

Intelligent practices of large language models in digital government services

J Han, J Lu, Y Xu, J You, B Wu - IEEE Access, 2024 - ieeexplore.ieee.org

Large language models have been widely used in open-domain tasks with significant
results, as well as being able to perform zero-sample closed-ended questions based on …

保存引用被引用次数：6 相关文章所有 2 个版本

[Free GPT-4]

[PDF] arxiv.org

Large language models as faithful explainers

YN Chuang, G Wang, CY Chang, R Tang… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) have recently become proficient in addressing complex
tasks by utilizing their rich internal knowledge and reasoning ability. Consequently, this …

保存引用被引用次数：3 相关文章所有 2 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

TBA: Faster Large Language Model Training Using SSD-Based Activation Offloading

K Wu, JB Park, X Zhang, M Hidayetoğlu… - arxiv preprint arxiv …, 2024 - arxiv.org

The growth rate of the GPU memory capacity has not been able to keep up with that of the
size of large language models (LLMs), hindering the model training process. In particular …

保存引用被引用次数：1 相关文章所有 3 个版本 HTML 版

[Free GPT-4]

[PDF] arxiv.org

FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference

Z Liu, Q Song, QC **ao, SK Selvaraj… - arxiv preprint arxiv …, 2024 - arxiv.org

The large number of parameters in Pretrained Language Models enhance their
performance, but also make them resource-intensive, making it challenging to deploy them …

保存引用被引用次数：3 相关文章所有 2 个版本 HTML 版

ViTeGNN: Towards Versatile Inference of Temporal Graph Neural Networks on FPGA

H Zhou, B Zhang, R Kannan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Temporal Graph Neural Networks (TGNNs) are powerful models to capture temporal,
structural, and contextual information on temporal graphs, outperforming other methods in …

保存引用相关文章所有 3 个版本

[Free GPT-4]

[PDF] arxiv.org

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity

W Guo, J Long, Y Zeng, Z Liu, X Yang, Y Ran… - arxiv preprint arxiv …, 2024 - arxiv.org

Zeroth-order optimization (ZO) is a memory-efficient strategy for fine-tuning Large Language
Models using only forward passes. However, the application of ZO fine-tuning in memory …

保存引用被引用次数：4 相关文章所有 4 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Winner-take-all column row sampling for memory efficient adaptation of language model

Fingpt: Democratizing internet-scale data for financial large language models

Differentially private low-rank adaptation of large language model using federated learning

One less reason for filter pruning: Gaining free adversarial robustness with structured grouped kernel pruning

DSpar: An embarrassingly simple strategy for efficient GNN training and inference via degree-based sparsification

Intelligent practices of large language models in digital government services

Large language models as faithful explainers

TBA: Faster Large Language Model Training Using SSD-Based Activation Offloading

FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference

ViTeGNN: Towards Versatile Inference of Temporal Graph Neural Networks on FPGA

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity