Accelerating transformer inference for translation via parallel decoding

A Santilli, S Severino, E Postolache, V Maiorca… - arxiv preprint arxiv …, 2023 - arxiv.org
Autoregressive decoding limits the efficiency of transformers for Machine Translation (MT).
The community proposed specific network architectures and learning-based methods to …

Multi-source diffusion models for simultaneous music generation and separation

G Mariani, I Tallini, E Postolache, M Mancusi… - arxiv preprint arxiv …, 2023 - arxiv.org
In this work, we define a diffusion-based generative model capable of both music synthesis
and source separation by learning the score of the joint probability density of sources …

Camoscio: An italian instruction-tuned llama

A Santilli, E Rodolà - 2023 - books.google.com
Abstract In recent years Large Language Models have improved the state of the art on
several natural language processing tasks. However, their availability is frequently restricted …

Cocola: Coherence-oriented contrastive learning of musical audio representations

R Ciranni, G Mariani, M Mancusi, E Postolache… - arxiv preprint arxiv …, 2024 - arxiv.org
We present COCOLA (Coherence-Oriented Contrastive Learning for Audio), a contrastive
learning method for musical audio representations that captures the harmonic and rhythmic …

Unsupervised composable representations for audio

G Bindi, P Esling - arxiv preprint arxiv:2408.09792, 2024 - arxiv.org
Current generative models are able to generate high-quality artefacts but have been shown
to struggle with compositional reasoning, which can be defined as the ability to generate …

Exploring the Frontier: Generative AI Applications in Online Consumer Behavior Analytics

T Kimura - Cuadernos de Gestión, 2024 - ojs.ehu.eus
enpresa Management Letters / Cuadernos de Gestión Page 1 This article is distributed under the
terms of the Creative Commons Atribution 4.0 Internacional License institutua enpresa Instituto …

Improving Source Extraction with Diffusion and Consistency Models

T Karchkhadze, MR Izadi, S Zhang - arxiv preprint arxiv:2412.06965, 2024 - arxiv.org
In this work, we demonstrate the integration of a score-matching diffusion model into a
deterministic architecture for time-domain musical source extraction, resulting in enhanced …

Towards the evaluation of marine acoustic biodiversity through data-driven audio source separation

M Mancusi, N Zonca, E Rodolà… - 2023 Immersive and 3D …, 2023 - ieeexplore.ieee.org
The marine ecosystem faces alarming changes, including biodiversity loss and the migration
of tropical species to temperate regions. Monitoring underwater environments and their …

Source Separation of Multi-source Raw Music using a Residual Quantized Variational Autoencoder

L Berti - arxiv preprint arxiv:2408.07020, 2024 - arxiv.org
I developed a neural audio codec model based on the residual quantized variational
autoencoder architecture. I train the model on the Slakh2100 dataset, a standard dataset for …

Harnessing the capabilities of Generative Models

G Mariani - 2024 - tesidottorato.depositolegale.it
Generative models have experienced significant advancements in recent years, driven by
the introduction of architectures such as Stable Diffusion, GPT-3, ChatGPT, and many …