[HTML][HTML] Smartphone-based portable bio-chemical sensors: exploring recent advancements

TH Bui, B Thangavel, M Sharipov, K Chen, JH Shin - Chemosensors, 2023 - mdpi.com
Traditionally, analytical chemistry and diagnosis relied on wet laboratories and skilled
professionals utilizing sophisticated instruments for sample handling and analysis. However …

Acoustic scene classification: A comprehensive survey

B Ding, T Zhang, C Wang, G Liu, J Liang, R Hu… - Expert Systems with …, 2024 - Elsevier
Acoustic scene classification (ASC) has gained significant interest recently due to its diverse
applications. Various audio signal processing and machine learning methods have been …

Panns: Large-scale pretrained audio neural networks for audio pattern recognition

Q Kong, Y Cao, T Iqbal, Y Wang… - … on Audio, Speech …, 2020 - ieeexplore.ieee.org
Audio pattern recognition is an important research topic in the machine learning area, and
includes several tasks such as audio tagging, acoustic scene classification, music …

Self-supervised visual feature learning with deep neural networks: A survey

L **g, Y Tian - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org
Large-scale labeled data are generally required to train deep neural networks in order to
obtain better performance in visual feature learning from images or videos for computer …

Self-supervised learning by cross-modal audio-video clustering

H Alwassel, D Mahajan, B Korbar… - Advances in …, 2020 - proceedings.neurips.cc
Visual and audio modalities are highly correlated, yet they contain different information.
Their strong correlation makes it possible to predict the semantics of one from the other with …

Audio-visual instance discrimination with cross-modal agreement

P Morgado, N Vasconcelos… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We present a self-supervised learning approach to learn audio-visual representations from
video and audio. Our method uses contrastive learning for cross-modal discrimination of …

Audiocaps: Generating captions for audios in the wild

CD Kim, B Kim, H Lee, G Kim - … of the 2019 Conference of the …, 2019 - aclanthology.org
We explore the problem of Audio Captioning: generating natural language description for
any kind of audio in the wild, which has been surprisingly unexplored in previous research …

The internet of sounds: Convergent trends, insights, and future directions

L Turchet, M Lagrange, C Rottondi… - IEEE Internet of …, 2023 - ieeexplore.ieee.org
Current sound-based practices and systems developed in both academia and industry point
to convergent research trends that bring together the field of sound and music Computing …

Audio set: An ontology and human-labeled dataset for audio events

JF Gemmeke, DPW Ellis, D Freedman… - … on acoustics, speech …, 2017 - ieeexplore.ieee.org
Audio event recognition, the human-like ability to identify and relate sounds from audio, is a
nascent problem in machine perception. Comparable problems such as object detection in …

Look, listen, and learn more: Design choices for deep audio embeddings

AL Cramer, HH Wu, J Salamon… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
A considerable challenge in applying deep learning to audio classification is the scarcity of
labeled data. An increasingly popular solution is to learn deep audio embeddings from large …