Getting a single AI call to work is easy. Making AI reliable in production is not. Teams that ship AI into real products quickly run into the same set of problems: you’re locked into one provider’s SDK, switching models means rewriting integration code, there’s no visibility into what the AI is actually doing, and quality is unpredictable. As you add more models and more use cases, the complexity compounds — separate credentials, different APIs, no shared observability, and guardrails bolted on as an afterthought. Opper exists to solve this. It sits between your application and the models it uses, giving you a single place to connect, control, and observe all your AI interactions.Documentation Index
Fetch the complete documentation index at: https://docs.opper.ai/llms.txt
Use this file to discover all available pages before exploring further.
Gateway — multi-model, multimodal generation
The gateway is your single connection to AI models. One API, one key, 200+ models across all major providers — OpenAI, Anthropic, Google, Mistral, and more.- Any model, same API. Route to any supported model without provider-specific SDKs or credentials. Switch models with a config change, not a code rewrite.
- Multimodal by default. Text, images, audio, embeddings — the same API handles all modalities. Build applications that work across media types without juggling separate integrations.
- Model independence. Models change fast. New ones launch, pricing shifts, capabilities improve. Your code shouldn’t have to change with them. With Opper, the model is a parameter — everything else stays the same.
- Drop-in compatibility. Use the Opper Multimodal API, or connect through OpenAI, Anthropic, and Gemini compatible endpoints. Migrate incrementally without rewriting your stack.
Control Plane — making AI reliable
Control Plane features are in early access and need to be turned on per account. Contact support@opper.ai if you’re interested.
- Observe. Score every generation against a configurable judge. Pick the rigor (Fast / Balanced / Thorough), the sample strategy, and either a 0–1 score or a binary verdict. Results land on the span next to latency and cost.
- Route. Pin a default model for a scope. Calls that omit
modeluse it; calls that pass an explicitmodelare still subject to the Comply allowlist. - Guard. Run guardrails at the gateway, before data reaches the model and before responses return. LLM guards use a custom prompt to classify content; Regex guards match known patterns. Each guard can Flag, Block, or Redact.
- Comply. Constrain which models calls can reach (provider / region / country / model allowlist with parent-intersection across scopes), trace retention, monthly budget with email alerts, and Zero Data Retention.
- Steer. Coming soon — Steer will use Observe scores to select few-shot examples and optimize prompts. The SDK feedback flow is available today and surfaces on the span.
- Memory. Store and retrieve knowledge through semantic indexes. Give your AI access to custom context without managing vector databases yourself.
Security
Opper is a secure intermediary between your application and external models. All traffic is encrypted and authenticated, and the platform is designed so that your data stays under your control.- Infrastructure. Opper deploys regionally on AWS. All data except model calls is contained within the deployment — vector databases, routing, tracing, and caching stay in-region. Currently available in Stockholm, Sweden.
- Authentication. SSO is supported with major providers (Google, GitHub). All API calls are authenticated with API keys. Users get their own account and can be part of a multi-user organization.
- Compliance. Opper follows European data protection directives and security standards, including GDPR. For compliance details, contact support@opper.ai.