Efficientqat: Efficient quantization-aware training for large language models

M Chen, W Shao, P Xu, J Wang, P Gao… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) are crucial in modern natural language processing and
artificial intelligence. However, they face challenges in managing their significant memory …

A survey of low-bit large language models: Basics, systems, and algorithms

R Gong, Y Ding, Z Wang, C Lv, X Zheng, J Du… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have achieved remarkable advancements in natural
language processing, showcasing exceptional performance across various tasks. However …

How good are low-bit quantized llama3 models? an empirical study

W Huang, X Ma, H Qin, X Zheng, C Lv, H Chen… - arxiv e …, 2024 - ui.adsabs.harvard.edu
Meta's LLaMA family has become one of the most powerful open-source Large Language
Model (LLM) series. Notably, LLaMA3 models have recently been released and achieve …

An empirical study of llama3 quantization: From llms to mllms

W Huang, X Zheng, X Ma, H Qin, C Lv, H Chen, J Luo… - Visual Intelligence, 2024 - Springer
The LLaMA family, a collection of foundation language models ranging from 7B to 65B
parameters, has become one of the most powerful open-source large language models …

Compressing large language models by joint sparsification and quantization

J Guo, J Wu, Z Wang, J Liu, G Yang, Y Ding… - … on Machine Learning, 2024 - openreview.net
In this paper, we introduce a novel model compression technique named Joint Sparsification
and Quantization (JSQ), explicitly tailored for large language models (LLMs). Traditional …

[HTML][HTML] Enhancing brain tumor segmentation in MRI images: A hybrid approach using UNet, attention mechanisms, and transformers

TB Nguyen-Tat, TQT Nguyen, HN Nguyen… - Egyptian Informatics …, 2024 - Elsevier
Accurate brain tumor segmentation in MRI images is crucial for effective treatment planning
and monitoring. Traditional methods often encounter challenges due to the complexity and …

[HTML][HTML] Enhancing medical image classification via federated learning and pre-trained model

PN Srinivasu, GJ Lakshmi, SC Narahari, J Shafi… - Egyptian Informatics …, 2024 - Elsevier
The precise classification of medical images is crucial in various healthcare applications,
especially in fields like disease diagnosis and treatment planning. In recent times, machine …

Q-snns: Quantized spiking neural networks

W Wei, Y Liang, A Belatreche, Y **ao, H Cao… - Proceedings of the …, 2024 - dl.acm.org
Brain-inspired Spiking Neural Networks (SNNs) leverage sparse spikes to represent
information and process them in an asynchronous event-driven manner, offering an energy …

A Comprehensive Approach Towards Wheat Leaf Disease Identification Leveraging Transformer Models and Federated Learning

M Fahim-Ul-Islam, A Chakrabarty, ST Ahmed… - IEEE …, 2024 - ieeexplore.ieee.org
Wheat is one of the most extensively cultivated crops worldwide that contributes significantly
to global food caloric and protein production and is grown on millions of hectares yearly …

On-Device LLMs for SMEs: Challenges and Opportunities

JSG Yee, PC Ng, Z Wang, I McLoughlin, AB Ng… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper presents a systematic review of the infrastructure requirements for deploying
Large Language Models (LLMs) on-device within the context of small and medium-sized …