One API. 17 providers. 31 endpoints. Automatic model selection.
| TIME | KEY | STRATEGY | ROUTED TO | TPS | MS |
|---|
Simulated routing decisions. Request content is illustrative only.
No SDK sprawl
One format for every provider. Format translation, tool schemas, and vision payloads handled automatically.
No maintenance
Provider APIs change. Models deprecate. Cailos absorbs every breaking change so your integration doesn't.
No lock-in
Switch providers in seconds. Circuit breakers auto-failover when a provider goes down. Your on-call never wakes up.
Live evals
Every endpoint is evaluated on intelligence, tool calling, and vision. Routing always reflects the current model landscape.
Drop Cailos into any OpenAI-compatible agent framework. The SDK doesn't change — just the endpoint. Model selection becomes automatic.
from agents import Agent
triage = Agent(
name="triage",
model="gpt-4o-mini",
instructions="Classify: billing, technical, or escalate.",
)
resolver = Agent(
name="resolver",
model="gpt-4o",
instructions="Draft resolution from KB.",
tools=[search_kb, lookup_customer],
)
from agents import Agent
from openai import AsyncOpenAI
cailos = AsyncOpenAI(base_url="https://cailos.com/v1", api_key="cai_...")
triage = Agent(
name="triage",
model="auto", # fastest cheap model
instructions="Classify: billing, technical, or escalate.",
)
resolver = Agent(
name="resolver",
model="auto", # best tool-calling model
instructions="Draft resolution from KB.",
tools=[search_kb, lookup_customer],
)
Stack capability constraints. Cailos narrows to one endpoint. Integrate with two lines.
Capability filtering
Every request flows through the filtering pipeline. Cailos narrows 17 endpoints down to the best candidates for your constraints.
from openai import OpenAI
client = OpenAI(
base_url="https://cailos.com/v1",
api_key="cai_...",
)
response = client.chat.completions.create(
model="auto",
messages=[{"role": "user", "content": "..."}],
extra_body={
"cailos": {
"optimise": "quality",
"require_vision": True,
"speed": "fast",
}
},
)
Standard OpenAI SDK. Change base_url and api_key. Add cailos for routing hints.
cailos.com/v1
Drop-in replacement for any OpenAI SDK.
Change two lines. Access 31 endpoints across 17 providers.