A survey on generative adversarial networks for imbalance problems in computer vision tasks

V Sampath, I Maurtua, JJ Aguilar Martin, A Gutierrez - Journal of big Data, 2021 - Springer
Any computer vision application development starts off by acquiring images and data, then
preprocessing and pattern recognition steps to perform a task. When the acquired images …

Learning transactional behavioral representations for credit card fraud detection

Y **e, G Liu, C Yan, C Jiang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Credit card fraud detection is a challenging task since fraudulent actions are hidden in
massive legitimate behaviors. This work aims to learn a new representation for each …

Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis

Y Lei, S Yang, X Wang, L **e - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
Expressive synthetic speech is essential for many human-computer interaction and audio
broadcast scenarios, and thus synthesizing expressive speech has attracted much attention …

Time-aware attention-based gated network for credit card fraud detection by extracting transactional behaviors

Y **e, G Liu, C Yan, C Jiang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the popularity of credit cards worldwide, timely and accurate fraud detection has
become critically important to ensure the safety of their user accounts. Existing models …

Electrical load forecasting based on variable T-distribution and dual attention mechanism

J Wang, L Han, X Zhang, Y Wang, S Zhang - Energy, 2023 - Elsevier
Accurate load forecasting is essential for power system stability and grid dispatch
optimization. However, this task is challenging due to the inherent instability and volatility of …

Fine-grained emotion strength transfer, control and prediction for emotional speech synthesis

Y Lei, S Yang, L **e - 2021 IEEE Spoken Language …, 2021 - ieeexplore.ieee.org
This paper proposes a unified model to conduct emotion transfer, control and prediction for
sequence-to-sequence based fine-grained emotional speech synthesis. Conventional …

InSAR time-series deformation forecasting surrounding Salt Lake using deep transformer models

J Wang, C Li, L Li, Z Huang, C Wang, H Zhang… - Science of the Total …, 2023 - Elsevier
The free and open data policy of Sentinel-1 SAR images enables Radar interferometry
(InSAR) to perform time series surface deformation monitoring over large areas. InSAR …

A study of text vectorization method combining topic model and transfer learning

X Yang, K Yang, T Cui, M Chen, L He - Processes, 2022 - mdpi.com
With the development of Internet cloud technology, the scale of data is expanding.
Traditional processing methods find it difficult to deal with the problem of information …

[HTML][HTML] Hiddensinger: High-quality singing voice synthesis via neural audio codec and latent diffusion models

JS Hwang, SH Lee, SW Lee - Neural Networks, 2025 - Elsevier
Recently, denoising diffusion models have demonstrated remarkable performance among
generative models in various domains. However, in the speech domain, there are limitations …

DECN: Dialogical emotion correction network for conversational emotion recognition

Z Lian, B Liu, J Tao - Neurocomputing, 2021 - Elsevier
Emotion recognition in conversation (ERC) is an important research topic in artificial
intelligence. Different from the emotion estimation in individual utterances, ERC requires …