Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Codec-superb@ slt 2024: A lightweight benchmark for neural audio codec models
Neural audio codec models are becoming increasingly important as they serve as
tokenizers for audio, enabling efficient transmission or facilitating speech language …
tokenizers for audio, enabling efficient transmission or facilitating speech language …
Wavchat: A survey of spoken dialogue models
Recent advancements in spoken dialogue models, exemplified by systems like GPT-4o,
have captured significant attention in the speech domain. Compared to traditional three-tier …
have captured significant attention in the speech domain. Compared to traditional three-tier …
Bigcodec: Pushing the limits of low-bitrate neural speech codec
We present BigCodec, a low-bitrate neural speech codec. While recent neural speech
codecs have shown impressive progress, their performance significantly deteriorates at low …
codecs have shown impressive progress, their performance significantly deteriorates at low …
Audio-Language Models for Audio-Centric Tasks: A survey
Audio-Language Models (ALMs), which are trained on audio-text data, focus on the
processing, understanding, and reasoning of sounds. Unlike traditional supervised learning …
processing, understanding, and reasoning of sounds. Unlike traditional supervised learning …
Analyzing and Mitigating Inconsistency in Discrete Audio Tokens for Neural Codec Language Models
Building upon advancements in Large Language Models (LLMs), the field of audio
processing has seen increased interest in training audio generation tasks with discrete …
processing has seen increased interest in training audio generation tasks with discrete …
LLMs are one-shot URL classifiers and explainers
Malicious URL classification represents a crucial aspect of cybersecurity. Although existing
work comprises numerous machine learning and deep learning-based URL classification …
work comprises numerous machine learning and deep learning-based URL classification …
Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding
The goal of this paper is to accelerate codec-based speech synthesis systems with minimum
sacrifice to speech quality. We propose an enhanced inference method that allows for …
sacrifice to speech quality. We propose an enhanced inference method that allows for …
CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset
With the rapid advancement of codec-based speech generation (CoSG) systems, creating
fake speech that mimics an individual's identity and spreads misinformation has become …
fake speech that mimics an individual's identity and spreads misinformation has become …
Artificial Intelligence in Creative Industries: Advances Prior to 2025
The rapid advancements in artificial intelligence (AI), particularly in generative AI and large
language models (LLMs), have profoundly impacted the creative industries by enabling …
language models (LLMs), have profoundly impacted the creative industries by enabling …
Recent Advances in Discrete Speech Tokens: A Review
The rapid advancement of speech generation technologies in the era of large language
models (LLMs) has established discrete speech tokens as a foundational paradigm for …
models (LLMs) has established discrete speech tokens as a foundational paradigm for …