Today, artificial intelligence can describe images, recognize objects, and explain complex relationships. The pace of development is remarkable: So-called vision-language models (VLMs) combine text ...