Google Наука

C Surianarayanan, JJ Lawrence, PR Chelliah… - Sensors, 2023 - mdpi.com

Artificial Intelligence (Al) models are being produced and used to solve a variety of current
and future business and technical problems. Therefore, AI model engineering processes …

Запазване Позоваване С позовавания в 54 Сродни статии Всички 12 версии Кеширана версия

[Free GPT-4]
[DeepSeek]

[HTML] nature.com

[HTML][HTML] An analog-AI chip for energy-efficient speech recognition and transcription

S Ambrogio, P Narayanan, A Okazaki, A Fasoli… - Nature, 2023 - nature.com

Abstract Models of artificial intelligence (AI) that have billions of parameters can achieve
high accuracy across a range of tasks,, but they exacerbate the poor energy efficiency of …

Запазване Позоваване С позовавания в 114 Сродни статии Всички 11 версии

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Reconfigurable halide perovskite nanocrystal memristors for neuromorphic computing

RA John, Y Demirağ, Y Shynkarenko… - Nature …, 2022 - nature.com

Many in-memory computing frameworks demand electronic devices with specific switching
characteristics to achieve the desired level of computational complexity. Existing memristive …

Запазване Позоваване С позовавания в 182 Сродни статии Всички 12 версии

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Zero-shot text-to-image generation

A Ramesh, M Pavlov, G Goh, S Gray… - International …, 2021 - proceedings.mlr.press

Text-to-image generation has traditionally focused on finding better modeling assumptions
for training on a fixed dataset. These assumptions might involve complex architectures …

Запазване Позоваване С позовавания в 5665 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Higher-dimensional processing using a photonic tensor core with continuous-time data

B Dong, S Aggarwal, W Zhou, UE Ali, N Farmakidis… - Nature …, 2023 - nature.com

New developments in hardware-based 'accelerators' range from electronic tensor cores and
memristor-based arrays to photonic implementations. The goal of these approaches is to …

Запазване Позоваване С позовавания в 61 Сродни статии Всички 7 версии

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators

MJ Rasch, C Mackin, M Le Gallo, A Chen… - Nature …, 2023 - nature.com

Analog in-memory computing—a promising approach for energy-efficient acceleration of
deep learning workloads—computes matrix-vector multiplications but only approximately …

Запазване Позоваване С позовавания в 87 Сродни статии Всички 12 версии

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Training transformers with 4-bit integers

H **, C Li, J Chen, J Zhu - Advances in Neural Information …, 2023 - proceedings.neurips.cc

Quantizing the activation, weight, and gradient to 4-bit is promising to accelerate neural
network training. However, existing 4-bit training methods require custom numerical formats …

Запазване Позоваване С позовавания в 45 Сродни статии Всички 7 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Resource-efficient convolutional networks: A survey on model-, arithmetic-, and implementation-level techniques

JK Lee, L Mukhanov, AS Molahosseini… - ACM Computing …, 2023 - dl.acm.org

Convolutional neural networks (CNNs) are used in our daily life, including self-driving cars,
virtual assistants, social network services, healthcare services, and face recognition, among …

Запазване Позоваване С позовавания в 36 Сродни статии Всички 8 версии

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Fp8 quantization: The power of the exponent

A Kuzmin, M Van Baalen, Y Ren… - Advances in …, 2022 - proceedings.neurips.cc

When quantizing neural networks for efficient inference, low-bit integers are the go-to format
for efficiency. However, low-bit floating point numbers have an extra degree of freedom …

Запазване Позоваване С позовавания в 73 Сродни статии Всички 6 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Understanding int4 quantization for language models: latency speedup, composability, and failure cases

X Wu, C Li, RY Aminabadi, Z Yao… - … Conference on Machine …, 2023 - proceedings.mlr.press

Improving the deployment efficiency of transformer-based language models has been
challenging given their high computation and memory cost. While INT8 quantization has …

Запазване Позоваване С позовавания в 43 Сродни статии Всички 6 версии Във вид на HTML

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Ultra-low precision 4-bit training of deep neural networks

A survey on optimization techniques for edge artificial intelligence (AI)

[HTML][HTML] An analog-AI chip for energy-efficient speech recognition and transcription

Reconfigurable halide perovskite nanocrystal memristors for neuromorphic computing

Zero-shot text-to-image generation

Higher-dimensional processing using a photonic tensor core with continuous-time data

Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators

Training transformers with 4-bit integers

Resource-efficient convolutional networks: A survey on model-, arithmetic-, and implementation-level techniques

Fp8 quantization: The power of the exponent

Understanding int4 quantization for language models: latency speedup, composability, and failure cases