These days, large language models can handle increasingly complex tasks, writing complex code and engaging in sophisticated ...
Abstract: Automatic dietary assessment based on food images remains a challenge, requiring precise food detection, segmentation, and classification. Vision-Language Models (VLMs) offer new ...