Processing images

Some Language Models support generating images, or looking at images, or both. The following are examples for how to use the capabilities of these types of models in your application.

For a list of models capable of image input or output, please refer to the multimodal models on the Models page. On that page, you will can also lookup parameters which are model specific.

Generating images

Generating an image can be fun, but can also be used in your product.

The following is a basic way of generating an image based on a prompt, and saving the image to a file.

Using images as input

Processing images by using them as input to your models can unlock a variety of applications, such as image captioning, classification, or generating detailed descriptions based on the image content.

Common denominator

In this example, we are passing in multiple images and asking the LLM to find the most common denominator between the images.