Machine learning methods for small data challenges in molecular science

B Dou, Z Zhu, E Merkurjev, L Ke, L Chen… - Chemical …, 2023 - ACS Publications
Small data are often used in scientific and engineering research due to the presence of
various constraints, such as time, cost, ethics, privacy, security, and technical limitations in …

Multimodal learning with transformers: A survey

P Xu, X Zhu, DA Clifton - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …

Storytelling with image data: A systematic review and comparative analysis of methods and tools

F Lotfi, A Beheshti, H Farhood, M Pooshideh… - Algorithms, 2023 - mdpi.com
In our digital age, data are generated constantly from public and private sources, social
media platforms, and the Internet of Things. A significant portion of this information comes in …

Large scale visual food recognition

W Min, Z Wang, Y Liu, M Luo, L Kang… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Food recognition plays an important role in food choice and intake, which is essential to the
health and well‐being of humans. It is thus of importance to the computer vision community …

[HTML][HTML] Transformer-based decoder designs for semantic segmentation on remotely sensed images

T Panboonyuen, K Jitkajornwanich, S Lawawirojwong… - Remote Sensing, 2021 - mdpi.com
Transformers have demonstrated remarkable accomplishments in several natural language
processing (NLP) tasks as well as image processing tasks. Herein, we present a deep …

Learning program representations for food images and cooking recipes

DP Papadopoulos, E Mora… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we are interested in modeling a how-to instructional procedure, such as a
cooking recipe, with a meaningful and rich high-level representation. Specifically, we …

All in one: Exploring unified vision-language tracking with multi-modal alignment

C Zhang, X Sun, Y Yang, L Liu, Q Liu, X Zhou… - Proceedings of the 31st …, 2023 - dl.acm.org
Current mainstream vision-language (VL) tracking framework consists of three parts, ie, a
visual feature extractor, a language feature extractor, and a fusion model. To pursue better …

enhanced hierarchical contrastive learning for recommendation

K Wang, Y Zhu, T Zang, C Wang, M **g - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Designed to establish potential relations and distill high-order representations, graph-based
recommendation systems continue to reveal promising results by jointly modeling ratings …

Self-supervised calorie-aware heterogeneous graph networks for food recommendation

Y Song, X Yang, C Xu - ACM Transactions on Multimedia Computing …, 2023 - dl.acm.org
With the rapid development of online recipe sharing platforms, food recommendation is
emerging as an important application. Although recent studies have made great progress on …

Multi-aspect graph contrastive learning for review-enhanced recommendation

K Wang, Y Zhu, T Zang, C Wang, K Liu… - ACM Transactions on …, 2023 - dl.acm.org
Review-based recommender systems explore semantic aspects of users' preferences by
incorporating user-generated reviews into rating-based models. Recent works have …