Skip to main content
Everything you build runs through one gateway: a single endpoint and key in front of 300+ models, compatible with the OpenAI, Anthropic, and Google AI APIs. Pick the surface that matches what you’re building — the routing, governance, and tracing are the same underneath.

Where to start

Gateway

Text generation through the SDK you already use. Messages, tools, structured output, streaming, and web search.

Multimodality

Images, audio, and video. Vision input, image generation, speech, transcription, and clips.

Realtime

Two-way voice over WebSocket. OpenAI, xAI, and Gemini behind one protocol.

Roundtable

Ask several models the same question in parallel, then consolidate or compare.

Models

The full catalog, with EU-hosted and multimodal models marked.

Pick by what you’re building

Use the gateway. It handles multi-turn conversations, tool calling, and image inputs out of the box, and you can use the OpenAI, Anthropic, or Google AI SDK you already know.
Use structured output. Attach the file, describe the fields you want with a JSON Schema, and get a clean, validated object back.
Use structured output. Describe the output shape with response_format and the model fills it in.
Point your SDK at the gateway. Change the base URL and the API key, and your code keeps working — now with Opper’s observability and Control Plane rules. See Drop-in SDKs.
Use Realtime. Open a websocket, stream microphone audio in, and stream voice out.
Use the Video endpoint. Submit a prompt (and optionally a reference image), then poll for the result.
Use Roundtable. Send one question to a panel of models and get back a consolidated answer or a side-by-side comparison.

What’s the same everywhere

Whichever surface you use:
  • You can call any of the 300+ models Opper supports.
  • Your Control Plane rules apply automatically.
  • Every call shows up in your traces and generation history.
  • Your API key is the same.
You can mix them in one app. A common setup has the gateway powering the user-facing conversation while a background structured-output call parses uploads or summarizes context.