ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

T Souček, P Gatti, M Wray, I Laptev, D Damen… - arxiv preprint arxiv …, 2024 - arxiv.org
The goal of this work is to generate step-by-step visual instructions in the form of a sequence
of images, given an input image that provides the scene context and the sequence of textual …

Large Language Models for Ingredient Substitution in Food Recipes using Supervised Fine-tuning and Direct Preference Optimization

T Senath, K Athukorala, R Costa, S Ranathunga… - arxiv preprint arxiv …, 2024 - arxiv.org
In this paper, we address the challenge of recipe personalization through ingredient
substitution. We make use of Large Language Models (LLMs) to build an ingredient …