Tools
Image Analysis (Vision)
A few models are capable of processing images and taking them into account for their answer generation. This can be used to extract text, describe or analyse images.
The more context and details you add, the better your response because the model understands precisely what you expect. Do not miss our Prompt Engineering Guide to learn how to write great prompts.
Apart from uploading text files, you can also upload images (JPG, PNG) to the chat and let the model analyze them. This capability is called “vision”. The following models support it:
- GPT-4o
- GPT-4o mini
- Claude 3.5 Sonnet
- Claude 3 Haiku
- Claude 3 Sonnet
- Claude 3 Opus
- Gemini 1.5 Pro
- Gemini 1.5 Flash
Image analysis is limited to images uploaded in the chat and not available in uploaded PDFs or presentations yet.
Was this page helpful?