- Academic Search

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

保存引用被引用数: 235 関連記事全 6 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep reinforcement learning in computer vision: a comprehensive survey

N Le, VS Rathour, K Yamazaki, K Luu… - Artificial Intelligence …, 2022 - Springer

Deep reinforcement learning augments the reinforcement learning framework and utilizes
the powerful representation of deep neural networks. Recent works have demonstrated the …

保存引用被引用数: 212 関連記事全 10 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Styletts 2: Towards human-level text-to-speech through style diffusion and adversarial training with large speech language models

YA Li, C Han, V Raghavan… - Advances in Neural …, 2023 - proceedings.neurips.cc

In this paper, we present StyleTTS 2, a text-to-speech (TTS) model that leverages style
diffusion and adversarial training with large speech language models (SLMs) to achieve …

保存引用被引用数: 100 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

保存引用被引用数: 467 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

A systematic literature review on phishing email detection using natural language processing techniques

S Salloum, T Gaber, S Vadera, K Shaalan - IEEE Access, 2022 - ieeexplore.ieee.org

Every year, phishing results in losses of billions of dollars and is a major threat to the Internet
economy. Phishing attacks are now most often carried out by email. To better comprehend …

保存引用被引用数: 151 関連記事全 6 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Open problems in cooperative ai

A Dafoe, E Hughes, Y Bachrach, T Collins… - arxiv preprint arxiv …, 2020 - arxiv.org

Problems of cooperation--in which agents seek ways to jointly improve their welfare--are
ubiquitous and important. They can be found at scales ranging from our daily routines--such …

保存引用被引用数: 263 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Consumer engagement via interactive artificial intelligence and mixed reality

EC Sung, S Bae, DID Han, O Kwon - International journal of information …, 2021 - Elsevier

The use of immersive technologies has changed the consumption environment in which
retailers provide services. We present findings from a study designed to investigate …

保存引用被引用数: 192 関連記事全 8 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on audio diffusion models: Text to speech synthesis and enhancement in generative ai

C Zhang, C Zhang, S Zheng, M Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org

Generative AI has demonstrated impressive performance in various fields, among which
speech synthesis is an interesting direction. With the diffusion model as the most popular …

保存引用被引用数: 83 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

A review of modern audio deepfake detection methods: challenges and future directions

Z Almutairi, H Elgibreen - Algorithms, 2022 - mdpi.com

A number of AI-generated tools are used today to clone human voices, leading to a new
technology known as Audio Deepfakes (ADs). Despite being introduced to enhance human …

保存引用被引用数: 116 関連記事全 4 バージョンキャッシュ

Conventional and contemporary approaches used in text to speech synthesis: A review

N Kaur, P Singh - Artificial Intelligence Review, 2023 - Springer

Nowadays speech synthesis or text to speech (TTS), an ability of system to produce human
like natural sounding voice from the written text, is gaining popularity in the field of speech …

保存引用被引用数: 60 関連記事全 3 バージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

A review of deep learning based speech synthesis

A review of deep learning techniques for speech processing

Deep reinforcement learning in computer vision: a comprehensive survey

Styletts 2: Towards human-level text-to-speech through style diffusion and adversarial training with large speech language models

A survey on neural speech synthesis

A systematic literature review on phishing email detection using natural language processing techniques

Open problems in cooperative ai

Consumer engagement via interactive artificial intelligence and mixed reality

A survey on audio diffusion models: Text to speech synthesis and enhancement in generative ai

A review of modern audio deepfake detection methods: challenges and future directions

Conventional and contemporary approaches used in text to speech synthesis: A review