Deepfakes generation and detection: State-of-the-art, open challenges, countermeasures, and way forward

M Masood, M Nawaz, KM Malik, A Javed, A Irtaza… - Applied …, 2023 - Springer
Easy access to audio-visual content on social media, combined with the availability of
modern tools such as Tensorflow or Keras, and open-source trained models, along with …

Using transformers for multimodal emotion recognition: Taxonomies and state of the art review

S Hazmoune, F Bougamouza - Engineering Applications of Artificial …, 2024 - Elsevier
Emotion recognition is an aspect of human-computer interaction, affective computing, and
social robotics. Conventional unimodal approaches for emotion recognition, depending on …

Picie: Unsupervised semantic segmentation using invariance and equivariance in clustering

JH Cho, U Mall, K Bala… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We present a new framework for semantic segmentation without annotations via clustering.
Off-the-shelf clustering methods are limited to curated, single-label, and object-centric …

Revisiting self-supervised visual representation learning

A Kolesnikov, X Zhai, L Beyer - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Unsupervised visual representation learning remains a largely unsolved problem in
computer vision research. Among a big body of recently proposed approaches for …

Memory-augmented dense predictive coding for video representation learning

T Han, W **e, A Zisserman - European conference on computer vision, 2020 - Springer
The objective of this paper is self-supervised learning from video, in particular for
representations for action recognition. We make the following contributions:(i) We propose a …

Detecting deep-fake videos from appearance and behavior

S Agarwal, H Farid, T El-Gaaly… - 2020 IEEE international …, 2020 - ieeexplore.ieee.org
Synthetically-generated audios and videos-so-called deep fakes-continue to capture the
imagination of the computer-graphics and computer-vision communities. At the same time …

Id-reveal: Identity-aware deepfake video detection

D Cozzolino, A Rössler, J Thies… - Proceedings of the …, 2021 - openaccess.thecvf.com
A major challenge in DeepFake forgery detection is that state-of-the-art algorithms are
mostly trained to detect a specific fake method. As a result, these approaches show poor …

Emotion recognition in speech using cross-modal transfer in the wild

S Albanie, A Nagrani, A Vedaldi… - Proceedings of the 26th …, 2018 - dl.acm.org
Obtaining large, human labelled speech datasets to train models for emotion recognition is a
notoriously challenging task, hindered by annotation cost and label ambiguity. In this work …

Mast: A memory-augmented self-supervised tracker

Z Lai, E Lu, W **e - … of the IEEE/CVF Conference on …, 2020 - openaccess.thecvf.com
Recent interest in self-supervised dense tracking has yielded rapid progress, but
performance still remains far from supervised methods. We propose a dense tracking model …

Unsupervised learning of object landmarks through conditional image generation

T Jakab, A Gupta, H Bilen… - Advances in neural …, 2018 - proceedings.neurips.cc
We propose a method for learning landmark detectors for visual objects (such as the eyes
and the nose in a face) without any manual supervision. We cast this as the problem of …