Memory-efficient fine-tuning of compressed large language models via sub-4-bit integer quantization
Large language models (LLMs) face the challenges in fine-tuning and deployment due to
their high memory demands and computational costs. While parameter-efficient fine-tuning …
their high memory demands and computational costs. While parameter-efficient fine-tuning …
Efficientqat: Efficient quantization-aware training for large language models
Large language models (LLMs) are crucial in modern natural language processing and
artificial intelligence. However, they face challenges in managing their significant memory …
artificial intelligence. However, they face challenges in managing their significant memory …
Nearest is not dearest: Towards practical defense against quantization-conditioned backdoor attacks
Abstract Model quantization is widely used to compress and accelerate deep neural
networks. However recent studies have revealed the feasibility of weaponizing model …
networks. However recent studies have revealed the feasibility of weaponizing model …
Beyond efficiency: A systematic survey of resource-efficient large language models
The burgeoning field of Large Language Models (LLMs), exemplified by sophisticated
models like OpenAI's ChatGPT, represents a significant advancement in artificial …
models like OpenAI's ChatGPT, represents a significant advancement in artificial …
A survey of resource-efficient llm and multimodal foundation models
Large foundation models, including large language models (LLMs), vision transformers
(ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine …
(ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine …
A survey of low-bit large language models: Basics, systems, and algorithms
Large language models (LLMs) have achieved remarkable advancements in natural
language processing, showcasing exceptional performance across various tasks. However …
language processing, showcasing exceptional performance across various tasks. However …
Resource-efficient Algorithms and Systems of Foundation Models: A Survey
Large foundation models, including large language models, vision transformers, diffusion,
and large language model based multimodal models, are revolutionizing the entire machine …
and large language model based multimodal models, are revolutionizing the entire machine …
PTQ4SAM: Post-Training Quantization for Segment Anything
Abstract Segment Anything Model (SAM) has achieved impressive performance in many
computer vision tasks. However as a large-scale model the immense memory and …
computer vision tasks. However as a large-scale model the immense memory and …
Optimize weight rounding via signed gradient descent for the quantization of llms
Large Language Models (LLMs) have proven their exceptional capabilities in performing
language-related tasks. However, their deployment poses significant challenges due to their …
language-related tasks. However, their deployment poses significant challenges due to their …
[PDF][PDF] Resource-efficient algorithms and systems of foundation models: A survey
In the rapidly evolving field of artificial intelligence (AI), a paradigm shift is underway. We are
witnessing the transition from specialized, fragmented deep learning models to versatile …
witnessing the transition from specialized, fragmented deep learning models to versatile …