Multimodal machine learning: A survey and taxonomy

T Baltrušaitis, C Ahuja… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Our experience of the world is multimodal-we see objects, hear sounds, feel texture, smell
odors, and taste flavors. Modality refers to the way in which something happens or is …

A survey on integrated sensing, communication, and computation

D Wen, Y Zhou, X Li, Y Shi, K Huang… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org
The forthcoming generation of wireless technology, 6G, promises a revolutionary leap
beyond traditional data-centric services. It aims to usher in an era of ubiquitous intelligent …

Drop-dtw: Aligning common signal between sequences while drop** outliers

M Dvornik, I Hadji, KG Derpanis… - Advances in Neural …, 2021 - proceedings.neurips.cc
In this work, we consider the problem of sequence-to-sequence alignment for signals
containing outliers. Assuming the absence of outliers, the standard Dynamic Time War** …

The multimodal sentiment analysis in car reviews (muse-car) dataset: Collection, insights and improvements

L Stappen, A Baird, L Schumann… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Truly real-life data presents a strong, but exciting challenge for sentiment and emotion
research. The high variety of possible 'in-the-wild'properties makes large datasets such as …

DeepSimulator: a deep simulator for Nanopore sequencing

Y Li, R Han, C Bi, M Li, S Wang, X Gao - Bioinformatics, 2018 - academic.oup.com
Abstract Motivation Oxford Nanopore sequencing is a rapidly developed sequencing
technology in recent years. To keep pace with the explosion of the downstream data …

Temporal transformer networks: Joint learning of invariant and discriminative time war**

S Lohit, Q Wang, P Turaga - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Many time-series classification problems involve develo** metrics that are invariant to
temporal misalignment. In human activity analysis, temporal misalignment arises due to …

Deep canonical time war** for simultaneous alignment and representation learning of sequences

G Trigeorgis, MA Nicolaou, BW Schuller… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Machine learning algorithms for the analysis of time-series often depend on the assumption
that utilised data are temporally aligned. Any temporal discrepancies arising in the data is …

Aligning accumulative representations for sign language recognition

AA Kındıroglu, O Özdemir, L Akarun - Machine Vision and Applications, 2023 - Springer
Accumulative representations provide a method for representing variable-length videos with
constant length features. In this study, we present aligned temporal accumulative features …

Alignnet: A unifying approach to audio-visual alignment

J Wang, Z Fang, H Zhao - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
We present AlignNet, a model that synchronizes videos with reference audios under non-
uniform and irregular misalignments. AlignNet learns the end-to-end dense correspondence …

Low-latency speculative inference on distributed multi-modal data streams

T Li, J Huang, E Risinger, D Ganesan - Proceedings of the 19th Annual …, 2021 - dl.acm.org
While multi-modal deep learning is useful in distributed sensing tasks like human tracking,
activity recognition, and audio and video analysis, deploying state-of-the-art multi-modal …