LIVE$5* free credits on signup. Try Kimi K2.6, FLUX-2 Klein, Whisper Turbo, Aura 2, and our curated edge tier.Claim your key →
85+ labs · one schema · zero lock-in

One API.
Every model.
Every modality.

OpenAI-compatible endpoints to every model and every modality — text, image, video, voice, audio, music, embeddings. Plus eval-driven routing, sub-account keys for your end users, and replay across any model.

Get your key Read the docs
99.99% uptime 47ms p50 overhead SOC 2 in progress
python
typescript
curl
aigateway-py
aigateway-cli
quickstart.py
# one-line swap: point your SDK at us.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.openai.com/v1"
             "https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

stream = client.chat.completions.create(
    model="moonshot/kimi-k2.6",
    messages=msgs, stream=True,
)
STREAMINGmodel: kimi-k2.6tok/s: 0ttft: via edge / iad1
openaianthropicgooglemoonshotmetamistraldeepseekqwenblack-forest-labsstabilityaideepgram+ 27 more
MODELS
1000+
across 85+ upstream labs
LABS
85+
normalized behind one schema
EDGE OVERHEAD
47ms
p50 over 30 days · global edge
CACHE HIT
34%
median across paying accounts
$5* FREE ON SIGNUP

Start with $5.
Try the free tier.*

Your free credit covers a curated edge-tier shortlist — Kimi K2.6 for chat, FLUX-2 Klein for image, Whisper Turbo + Aura 2 for voice, BGE-M3 for embeddings, Gemma 4 for vision, Llama Guard for moderation. Top up to unlock the full catalog; topups never expire.

$5
signup_creditfree7 curated models7-day window
$5* free credit · no card preauth
Use it on the 7-model edge tier — chat, image, voice, transcription, vision, embeddings, moderation.
KIMI K2.6
~2M tok
on your free $5
FLUX KLEIN 9B
~2 images
on your free $5
WHISPER TURBO
~14 hours
on your free $5
AURA 2
~416K chars
on your free $5
Claim your $5 →See the 7 free-tier modelstopups never expire
python
first_call.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

r = client.chat.completions.create(
    model="anthropic/claude-opus-4.7",
    messages=[{"role": "user",
               "content": "Plan a"
                          " research agent."}],
    stream=True,
)

# $5 free credit on signup (7-day window, curated models).
# Top up to unlock the full 1000+-model catalog.

* Applicable only on selected models — see the model list for details. Free credit expires 7 days after signup; topups never expire.

Catalog

Every model. Every modality.
One key.

Drop-in compatible with the OpenAI SDK. Rotate models mid-conversation without changing any other line of code.

showing 12 of 1000+ models
Rankings →Providers →See all 1000+ models →
Popular comparisons:Opus 4.7 vs GPT-5.4Sonnet 4.6 vs GPT-5.4-miniGemini 3.1 Pro vs Opus 4.7Kimi K2.6 vs Opus 4.7Haiku 4.5 vs GPT-5.4-mini
Honest comparison

Why swap in
one line of code.

Anonymized competitor labels — same public pricing and feature matrix, minus the trash talk. We'll let you Google who's who.

 AIgatewayCompetitor A
breadth aggregator
Competitor B
platform-native gateway
Competitor C
enterprise governance
Models / modalities1000+ · text · image · video · music · voice · audio · embeddings · vision~300 · text only~40 · text + image~80 · image, video, audio
Open-weight served at the edgeyesno · pass-throughno · pass-throughno · queue
Edge latency (p50)47ms · global edge~180ms · single region~60ms · one platform~190ms · single region
Eval-driven routing (SLO on your data)yes
Sub-account / per-user key APIyes · programmaticworkspace only
Replay + shadow A/B across modelsyes
OpenAI-compatibledrop-in, zero changesdrop-indrop-in

Sourced from each competitor's public pricing page and docs, April 2026. If something is wrong, email hello@aigateway.sh and we'll fix it.

One catalog

Every model, every modality.
One endpoint, one schema.

1000+ models across 85+ labs — text, image, video, music, voice, audio, embeddings, vision. Same OpenAI-compatible body shape for every one. Rotate between Claude Opus 4.7, GPT-5.4, Gemini 3.1 Pro, Kimi K2.6, Llama 4, Flux 1 Schnell, Stable Diffusion XL, Veo 3.1, MiniMax Hailuo by changing a single string.

FRONTIER LABS
Anthropic, OpenAI, Google
Claude Opus 4.7, GPT-5.4, Gemini 3.1 Pro, Claude Haiku 4.5
OPEN-WEIGHT
Moonshot, Meta, Mistral, DeepSeek, IBM, Nvidia
Kimi K2.6, Llama 4 Scout, Mistral Small 3.1, DeepSeek-R1, Granite 4
MULTIMODAL
Black Forest Labs, Stability, MyShell, Deepgram
Flux 1 Schnell, Stable Diffusion XL, MeloTTS, Aura 2, Whisper Large v3
Primitives

Four things only the
aggregator can ship.

Things single-provider SDKs physically cannot do — because they only see one provider. We see all of them.

EVAL-DRIVEN ROUTING
Let your own data pick the model.
Upload a dataset, we grade every candidate model on it, then hand you an alias that always routes to the current winner. When a new frontier model lands, rerun the eval — your prod code doesn't change.
curl -X POST https://api.aigateway.sh/v1/evals \
  -H "Authorization: Bearer $KEY" \
  -d '{
    "name": "prod-summarize",
    "candidate_models": [
      "anthropic/claude-opus-4.7",
      "openai/gpt-5.4",
      "moonshot/kimi-k2.6"
    ],
    "dataset": [...],
    "metric": "quality"
  }'

# then just use it
model = "eval:prod-summarize"
SUB-ACCOUNT API
Scoped keys for your end users.
One API call mints a key for each of your customers, with its own spend cap, rate limit, and isolated analytics. No more "I'll build a billing layer on top." We are the billing layer.
POST /v1/sub-accounts
{
  "name": "acme-corp",
  "spend_cap_cents": 50000,     // $500 / mo hard cap
  "rate_limit_rpm": 300,
  "default_tag": "acme"
}

=> { "key": "sk-aig-...",       // hand to customer
     "spend_cap_cents": 50000,
     "id": "sa_9f3k..." }
REPLAY + SHADOW A/B
Test a new model on real traffic.
Replay any past request against a different model and see output, cost, and latency side-by-side. Shadow mode fires the new model in parallel without ever reaching the user — a safe cutover switch.
POST /v1/replays
{
  "source_request_id": "req_abc123",
  "target_model": "anthropic/claude-opus-4.7",
  "shadow": true
}

=> { "source_output": "...",
     "target_output": "...",
     "cost_source_cents": 1.2,
     "cost_target_cents": 4.7,
     "score_delta": 0.82 }
COST-ATTRIBUTION TAGS
Know what every feature costs.
Tag any request with a free-form string — feature, user, tenant — via the x-aig-tag header. Query monthly spend grouped by tag. Pair with hard budget caps per tag and no feature can silently eat your margin again.
# tag each request with the feature it powers
curl https://api.aigateway.sh/v1/chat/completions \
  -H "x-aig-tag: summarize" \
  ...

GET /v1/usage/by-tag?month=2026-04
=> [{ "tag": "summarize", "cost_cents": 4210 },
    { "tag": "chat",      "cost_cents": 9830 },
    { "tag": "rerank",    "cost_cents":  118 }]
For coding agents

Point Claude Code at us.
It configures itself.

A capable coding agent reads /llms.txt, /openapi.json, or /agents.md once — and from then on it can call every modality, mint sub-accounts, tag cost, and replay requests without a human editing config.

You can call any AI model through AIgateway, a universal AI API.

- Base URL: https://api.aigateway.sh/v1
- Auth: Authorization: Bearer sk-aig-...
- SDK: drop-in for the OpenAI SDK — only change base_url
- OpenAPI spec: https://api.aigateway.sh/openapi.json
- Capability map: https://aigateway.sh/llms.txt
- Live models: https://api.aigateway.sh/v1/models
- MCP server: https://api.aigateway.sh/mcp

Install (only when the OpenAI SDK isn't enough):
  pip install aigateway-py        # Python — async jobs, sub-accounts, evals
  pnpm add aigateway-js           # Node   — same surface in TypeScript
  npm i -g aigateway-cli          # CLI    — `aig init` walks through everything

Primitives nobody else has:
  POST /v1/sub-accounts — scoped keys + spend caps per end user
  POST /v1/evals        — grade candidate models on your data
  POST /v1/replays      — re-run a past request on a new model
  GET  /v1/usage/by-tag — per-feature cost via x-aig-tag header
LLMS.TXT
Agent-readable capability map
/llms.txt
OPENAPI.JSON
Typed 3.1 spec for code-gen
https://api.aigateway.sh/openapi.json
AGENTS.MD
Integration patterns + error remediation
/agents.md
INSTALL
pip install aigateway-py · pnpm add aigateway-js · npm i -g aigateway-cli
/integrations
Full agent guide →
Public beta · April 2026

We don't fake logos.
We'd rather ship to you.

Sign up in 30 seconds — no waitlist, no sales call. Cline, Aider, and a handful of agent startups are already on it. Want a personal onboard? Drop your email and I'll reply.

we'll email you once, from a real human. no drip sequence.