Multimodal co-learning: Challenges, applications with datasets, recent advances and future directions

A Rahate, R Walambe, S Ramanna, K Kotecha - Information Fusion, 2022 - Elsevier
Multimodal deep learning systems that employ multiple modalities like text, image, audio,
video, etc., are showing better performance than individual modalities (ie, unimodal) …

[HTML][HTML] Machine learning of Raman spectroscopy data for classifying cancers: a review of the recent literature

N Blake, R Gaifulina, LD Griffin, IM Bell, GMH Thomas - Diagnostics, 2022 - mdpi.com
Raman Spectroscopy has long been anticipated to augment clinical decision making, such
as classifying oncological samples. Unfortunately, the complexity of Raman data has thus far …

Image classification with deep learning in the presence of noisy labels: A survey

G Algan, I Ulusoy - Knowledge-Based Systems, 2021 - Elsevier
Image classification systems recently made a giant leap with the advancement of deep
neural networks. However, these systems require an excessive amount of labeled data to be …

Gradient descent with early stop** is provably robust to label noise for overparameterized neural networks

M Li, M Soltanolkotabi, S Oymak - … conference on artificial …, 2020 - proceedings.mlr.press
Modern neural networks are typically trained in an over-parameterized regime where the
parameters of the model far exceed the size of the training data. Such neural networks in …

Training deep neural-networks using a noise adaptation layer

J Goldberger, E Ben-Reuven - International conference on learning …, 2017 - openreview.net
The availability of large datsets has enabled neural networks to achieve impressive
recognition results. However, the presence of inaccurate class labels is known to deteriorate …

Learning with bad training data via iterative trimmed loss minimization

Y Shen, S Sanghavi - International conference on machine …, 2019 - proceedings.mlr.press
In this paper, we study a simple and generic framework to tackle the problem of learning
model parameters when a fraction of the training samples are corrupted. Our approach is …

Pushing on text readability assessment: A transformer meets handcrafted linguistic features

BW Lee, YS Jang, JHJ Lee - ar** attacks on malware detection systems
R Taheri, R Javidan, M Shojafar, Z Pooranian… - Neural Computing and …, 2020 - Springer
Label manipulation attacks are a subclass of data poisoning attacks in adversarial machine
learning used against different applications, such as malware detection. These types of …

[HTML][HTML] Data augmentation for audio-visual emotion recognition with an efficient multimodal conditional GAN

F Ma, Y Li, S Ni, SL Huang, L Zhang - Applied Sciences, 2022 - mdpi.com
Audio-visual emotion recognition is the research of identifying human emotional states by
combining the audio modality and the visual modality simultaneously, which plays an …

Deep learning with noisy labels in medical prediction problems: a sco** review

Y Wei, Y Deng, C Sun, M Lin, H Jiang… - Journal of the American …, 2024 - academic.oup.com
Objectives Medical research faces substantial challenges from noisy labels attributed to
factors like inter-expert variability and machine-extracted labels. Despite this, the adoption of …