Enhancing recipe retrieval with foundation models: A data augmentation perspective

F Song, B Zhu, Y Hao, S Wang - European Conference on Computer …, 2024 - Springer
Learning recipe and food image representation in common embedding space is non-trivial
but crucial for cross-modal recipe retrieval. In this paper, we propose a new perspective for …

Rode: Linear rectified mixture of diverse experts for food large multi-modal models

P Jiao, X Wu, B Zhu, J Chen, CW Ngo… - ar** Image-Recipe Cross-Modal Retrieval with Dual Cross Attention Encoders
W Liu, S Yuan, Z Wang, X Chang, L Gao, Z Zhang - Mathematics, 2024 - mdpi.com
The image-recipe cross-modal retrieval task, which retrieves the relevant recipes according
to food images and vice versa, is now attracting widespread attention. There are two main …

Nutrition Estimation for Dietary Management: A Transformer Approach with Depth Sensing

Z Kwan, W Zhang, Z Wang, AB Ng, S See - arxiv preprint arxiv …, 2024 - arxiv.org
Nutrition estimation is crucial for effective dietary management and overall health and well-
being. Existing methods often struggle with sub-optimal accuracy and can be time …