محقق Google

H Wu, X Chen, YC Lin, K Chang, HL Chung… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Neural audio codecs are initially introduced to compress audio data into compact codes to
reduce transmission latency. Researchers recently discovered the potential of codecs as …‏

ذخیره ارجاع بیان شده در 28 یافته مقاله‌های مربوط تمام نسخه‌های 3 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Low-resource languages jailbreak gpt-4‏

ZX Yong, C Menghini, SH Bach - arxiv preprint arxiv:2310.02446, 2023‏ - arxiv.org‏

AI safety training and red-teaming of large language models (LLMs) are measures to
mitigate the generation of unsafe content. Our work exposes the inherent cross-lingual …‏

ذخیره ارجاع بیان شده در 179 یافته مقاله‌های مربوط تمام نسخه‌های 5 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Language model tokenizers introduce unfairness between languages‏

A Petrov, E La Malfa, P Torr… - Advances in neural …, 2023‏ - proceedings.neurips.cc‏

Recent language models have shown impressive multilingual performance, even when not
explicitly trained for it. Despite this, there are concerns about the quality of their outputs …‏

ذخیره ارجاع بیان شده در 97 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Uniaudio: An audio foundation model toward universal audio generation‏

D Yang, J Tian, X Tan, R Huang, S Liu, X Chang… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Large Language models (LLM) have demonstrated the capability to handle a variety of
generative tasks. This paper presents the UniAudio system, which, unlike prior task-specific …‏

ذخیره ارجاع بیان شده در 108 یافته مقاله‌های مربوط تمام نسخه‌های 3 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Aya dataset: An open-access collection for multilingual instruction tuning‏

S Singh, F Vargus, D Dsouza, BF Karlsson… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Datasets are foundational to many breakthroughs in modern artificial intelligence. Many
recent achievements in the space of natural language processing (NLP) can be attributed to …‏

ذخیره ارجاع بیان شده در 80 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fairness in large language models: A taxonomic survey‏

Z Chu, Z Wang, W Zhang - ACM SIGKDD explorations newsletter, 2024‏ - dl.acm.org‏

Large Language Models (LLMs) have demonstrated remarkable success across various
domains. However, despite their promising performance in numerous real-world …‏

ذخیره ارجاع بیان شده در 27 یافته مقاله‌های مربوط تمام نسخه‌های 10

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

SpiRit-LM: Interleaved Spoken and Written Language Model‏

TA Nguyen, B Muller, B Yu, MR Costa-Jussa… - Transactions of the …, 2025‏ - direct.mit.edu‏

We introduce SpiRit-lm, a foundation multimodal language model that freely mixes text and
speech. Our model is based on a 7B pretrained text language model that we extend to the …‏

ذخیره ارجاع بیان شده در 28 یافته مقاله‌های مربوط تمام نسخه‌های 6

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Uniaudio: Towards universal audio generation with large language models‏

D Yang, J Tian, X Tan, R Huang, S Liu… - … on Machine Learning, 2024‏ - openreview.net‏

Audio generation is a major branch of generative AI research. Compared with prior works in
this area that are commonly task-specific with heavy domain knowledge, this paper …‏

ذخیره ارجاع بیان شده در 11 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study‏

X Chang, B Yan, K Choi, JW Jung, Y Lu… - ICASSP 2024-2024 …, 2024‏ - ieeexplore.ieee.org‏

Speech signals, typically sampled at rates in the tens of thousands per second, contain
redundancies, evoking inefficiencies in sequence modeling. High-dimensional speech …‏

ذخیره ارجاع بیان شده در 37 یافته مقاله‌های مربوط تمام نسخه‌های 5

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Salm: Speech-augmented language model with in-context learning for speech recognition and translation‏

Z Chen, H Huang, A Andrusenko… - ICASSP 2024-2024 …, 2024‏ - ieeexplore.ieee.org‏

We present a novel Speech Augmented Language Model (SALM) with multitask and in-
context learning capabilities. SALM comprises a frozen text LLM, a audio encoder, a …‏

ذخیره ارجاع بیان شده در 37 یافته مقاله‌های مربوط تمام نسخه‌های 5

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

SeamlessM4T: Massively Multilingual & Multimodal Machine Translation

Towards audio language modeling--an overview‏

Low-resource languages jailbreak gpt-4‏

Language model tokenizers introduce unfairness between languages‏

Uniaudio: An audio foundation model toward universal audio generation‏

Aya dataset: An open-access collection for multilingual instruction tuning‏

Fairness in large language models: A taxonomic survey‏

SpiRit-LM: Interleaved Spoken and Written Language Model‏

Uniaudio: Towards universal audio generation with large language models‏

Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study‏

Salm: Speech-augmented language model with in-context learning for speech recognition and translation‏