AIgateway is a universal AI API that aggregates 1000+ frontier and open-weight models (text, image, audio, video, embeddings, moderation) behind a single OpenAI-compatible endpoint. Change base_url on the OpenAI SDK and every existing integration works.

How is AIgateway priced?

Pay the underlying model cost plus a 5% platform fee on every API call — that's our entire revenue model. No monthly fee, no seat fee, no per-model surcharge. Cached requests get a 50% discount. Every new account gets $5 in free credits redeemable on a curated set of edge-tier models (Kimi K2.6, BGE-M3, FLUX-2 Klein 9B, Gemma 4 vision, Aura 2, Whisper Turbo, Llama Guard 3); the free credit expires 7 days after signup. Top up to unlock the full catalog — topups never expire. Card is required at signup so auto-topup can keep your service running. Payment-processor fees (~5%) apply once at top-up — that's Stripe's standard rate, passed through.

Is AIgateway OpenAI-compatible?

Yes. The endpoint paths, request bodies, and streaming SSE format match OpenAI's API. Point the official OpenAI SDK (Python, Node, etc.) at https://api.aigateway.sh/v1 with an AIgateway key and everything works unchanged.

Which models are supported?

Claude Opus 4.7, GPT-5.4, Gemini 3.1 Pro, Kimi K2.6, Grok 4, Llama 4, FLUX 2, Imagen 4, Veo 3.1, Deepgram Nova 3, BGE-M3, and 1000+ more across every modality. The full live catalog is at /models and /v1/models.

Do I need separate keys for each provider?

No. One AIgateway key routes to every provider. Billing is unified on a single line. Bring-your-own-keys is also supported for providers where you have direct contracts.

Does AIgateway support tool calling, vision, streaming, and JSON mode?

Yes, across every model that the underlying provider supports. Capabilities are published per-model in /v1/models and rendered on each /models/ page.

How do I integrate with agent frameworks like Claude Code, Cursor, or LangChain?

Any OpenAI-compatible SDK or base_url field works — LangChain, LlamaIndex, Vercel AI SDK, Cursor, Claude Code, Continue, Cline. Autoconfigurable via /llms.txt and MCP at https://api.aigateway.sh/mcp.

85+ labs · one schema · zero lock-in

One API.
Every model.
Every modality.

OpenAI-compatible endpoints to every model and every modality — text, image, video, voice, audio, music, embeddings. Plus eval-driven routing, sub-account keys for your end users, and replay across any model.

Get your key →Read the docs

● 99.99% uptime● 47ms p50 overhead● SOC 2 in progress

python

typescript

curl

aigateway-py

aigateway-cli

quickstart.py

# one-line swap: point your SDK at us.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.openai.com/v1"
             "https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

stream = client.chat.completions.create(
    model="moonshot/kimi-k2.6",
    messages=msgs, stream=True,
)

STREAMINGmodel: kimi-k2.6tok/s: 0ttft: —via edge / iad1

openaianthropicgooglemoonshotmetamistraldeepseekqwenblack-forest-labsstabilityaideepgram+ 27 more

MODELS

1000+

across 85+ upstream labs

LABS

85+

normalized behind one schema

EDGE OVERHEAD

47ms

p50 over 30 days · global edge

CACHE HIT

34%

median across paying accounts

$5* FREE ON SIGNUP

Start with $5.
Try the free tier.*

Your free credit covers a curated edge-tier shortlist — Kimi K2.6 for chat, FLUX-2 Klein for image, Whisper Turbo + Aura 2 for voice, BGE-M3 for embeddings, Gemma 4 for vision, Llama Guard for moderation. Top up to unlock the full catalog; topups never expire.

signup_creditfree7 curated models7-day window

$5* free credit · no card preauth

Use it on the 7-model edge tier — chat, image, voice, transcription, vision, embeddings, moderation.

KIMI K2.6

~2M tok

on your free $5

FLUX KLEIN 9B

~2 images

on your free $5

WHISPER TURBO

~14 hours

on your free $5

AURA 2

~416K chars

on your free $5

Claim your $5 →See the 7 free-tier modelstopups never expire

python

first_call.py

from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

r = client.chat.completions.create(
    model="anthropic/claude-opus-4.7",
    messages=[{"role": "user",
               "content": "Plan a"
                          " research agent."}],
    stream=True,
)

# $5 free credit on signup (7-day window, curated models).
# Top up to unlock the full 1000+-model catalog.

* Applicable only on selected models — see the model list for details. Free credit expires 7 days after signup; topups never expire.

Catalog

Every model. Every modality.
One key.

Drop-in compatible with the OpenAI SDK. Rotate models mid-conversation without changing any other line of code.

showing 12 of 1000+ models

Rankings →Providers →See all 1000+ models →

Honest comparison

Why swap in
one line of code.

Anonymized competitor labels — same public pricing and feature matrix, minus the trash talk. We'll let you Google who's who.

	AIgateway	Competitor A breadth aggregator	Competitor B platform-native gateway	Competitor C enterprise governance
Models / modalities	1000+ · text · image · video · music · voice · audio · embeddings · vision	~300 · text only	~40 · text + image	~80 · image, video, audio
Open-weight served at the edge	yes	no · pass-through	no · pass-through	no · queue
Edge latency (p50)	47ms · global edge	~180ms · single region	~60ms · one platform	~190ms · single region
Eval-driven routing (SLO on your data)	yes	—	—	—
Sub-account / per-user key API	yes · programmatic	—	workspace only	—
Replay + shadow A/B across models	yes	—	—	—
OpenAI-compatible	drop-in, zero changes	drop-in	drop-in	—

Sourced from each competitor's public pricing page and docs, April 2026. If something is wrong, email hello@aigateway.sh and we'll fix it.

One catalog

Every model, every modality.
One endpoint, one schema.

1000+ models across 85+ labs — text, image, video, music, voice, audio, embeddings, vision. Same OpenAI-compatible body shape for every one. Rotate between Claude Opus 4.7, GPT-5.4, Gemini 3.1 Pro, Kimi K2.6, Llama 4, Flux 1 Schnell, Stable Diffusion XL, Veo 3.1, MiniMax Hailuo by changing a single string.

FRONTIER LABS

Anthropic, OpenAI, Google

Claude Opus 4.7, GPT-5.4, Gemini 3.1 Pro, Claude Haiku 4.5

OPEN-WEIGHT

Moonshot, Meta, Mistral, DeepSeek, IBM, Nvidia

Kimi K2.6, Llama 4 Scout, Mistral Small 3.1, DeepSeek-R1, Granite 4

MULTIMODAL

Black Forest Labs, Stability, MyShell, Deepgram

Flux 1 Schnell, Stable Diffusion XL, MeloTTS, Aura 2, Whisper Large v3

Primitives

Four things only the
aggregator can ship.

Things single-provider SDKs physically cannot do — because they only see one provider. We see all of them.

EVAL-DRIVEN ROUTING

Let your own data pick the model.

Upload a dataset, we grade every candidate model on it, then hand you an alias that always routes to the current winner. When a new frontier model lands, rerun the eval — your prod code doesn't change.

curl -X POST https://api.aigateway.sh/v1/evals \
  -H "Authorization: Bearer $KEY" \
  -d '{
    "name": "prod-summarize",
    "candidate_models": [
      "anthropic/claude-opus-4.7",
      "openai/gpt-5.4",
      "moonshot/kimi-k2.6"
    ],
    "dataset": [...],
    "metric": "quality"
  }'

# then just use it
model = "eval:prod-summarize"

SUB-ACCOUNT API

Scoped keys for your end users.

One API call mints a key for each of your customers, with its own spend cap, rate limit, and isolated analytics. No more "I'll build a billing layer on top." We are the billing layer.

POST /v1/sub-accounts
{
  "name": "acme-corp",
  "spend_cap_cents": 50000,     // $500 / mo hard cap
  "rate_limit_rpm": 300,
  "default_tag": "acme"
}

=> { "key": "sk-aig-...",       // hand to customer
     "spend_cap_cents": 50000,
     "id": "sa_9f3k..." }

REPLAY + SHADOW A/B

Test a new model on real traffic.

Replay any past request against a different model and see output, cost, and latency side-by-side. Shadow mode fires the new model in parallel without ever reaching the user — a safe cutover switch.

POST /v1/replays
{
  "source_request_id": "req_abc123",
  "target_model": "anthropic/claude-opus-4.7",
  "shadow": true
}

=> { "source_output": "...",
     "target_output": "...",
     "cost_source_cents": 1.2,
     "cost_target_cents": 4.7,
     "score_delta": 0.82 }

COST-ATTRIBUTION TAGS

Know what every feature costs.

Tag any request with a free-form string — feature, user, tenant — via the x-aig-tag header. Query monthly spend grouped by tag. Pair with hard budget caps per tag and no feature can silently eat your margin again.

# tag each request with the feature it powers
curl https://api.aigateway.sh/v1/chat/completions \
  -H "x-aig-tag: summarize" \
  ...

GET /v1/usage/by-tag?month=2026-04
=> [{ "tag": "summarize", "cost_cents": 4210 },
    { "tag": "chat",      "cost_cents": 9830 },
    { "tag": "rerank",    "cost_cents":  118 }]

For coding agents

Point Claude Code at us.
It configures itself.

A capable coding agent reads /llms.txt, /openapi.json, or /agents.md once — and from then on it can call every modality, mint sub-accounts, tag cost, and replay requests without a human editing config.

You can call any AI model through AIgateway, a universal AI API.

- Base URL: https://api.aigateway.sh/v1
- Auth: Authorization: Bearer sk-aig-...
- SDK: drop-in for the OpenAI SDK — only change base_url
- OpenAPI spec: https://api.aigateway.sh/openapi.json
- Capability map: https://aigateway.sh/llms.txt
- Live models: https://api.aigateway.sh/v1/models
- MCP server: https://api.aigateway.sh/mcp

Install (only when the OpenAI SDK isn't enough):
  pip install aigateway-py        # Python — async jobs, sub-accounts, evals
  pnpm add aigateway-js           # Node   — same surface in TypeScript
  npm i -g aigateway-cli          # CLI    — `aig init` walks through everything

Primitives nobody else has:
  POST /v1/sub-accounts — scoped keys + spend caps per end user
  POST /v1/evals        — grade candidate models on your data
  POST /v1/replays      — re-run a past request on a new model
  GET  /v1/usage/by-tag — per-feature cost via x-aig-tag header

LLMS.TXT

Agent-readable capability map

/llms.txt

OPENAPI.JSON

Typed 3.1 spec for code-gen

https://api.aigateway.sh/openapi.json

AGENTS.MD

Integration patterns + error remediation

/agents.md

INSTALL

pip install aigateway-py · pnpm add aigateway-js · npm i -g aigateway-cli

/integrations