Analyze and understand image content using vision-enabled models
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
model | string | ✅ Yes | gpt-41-nano-2025-04-14 | The model to use for the request. Must be a vision-enabled model. |
input | array | ✅ Yes | - | Array of message objects with role and content. Each message contains: - role: “user” or “assistant” - content: Array of content objects supporting the following types: - input_text: Text prompt with text field - input_image: Image input with image_url field, supports: - Base64 encoded images: data:image/jpeg;base64,{base64_string} - Supported formats: JPG, PNG Example with base64 encoded image: json [ { "role": "user", "content": [ { "type": "input_text", "text": "What is in this image?" }, { "type": "input_image", "image_url": "data:image/jpeg;base64,${base64Image}" } ] } ] |
stream | boolean | ❌ No | false | Whether to stream the response |