New BaseModel is now available on the AWS Marketplace — deploy in your own VPC. Learn more →
Built on AWS · SOC 2 Type II

One gateway for
every LLM.

BaseModel is the unified API for large language models. Point your SDK at a single endpoint and reach OpenAI, Anthropic, Google, and leading open & Chinese models — with smart routing, load balancing, automatic failover, and cost optimization built in.

100+models, one API
99.99%uptime SLA
<15msadded latency
app.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.basemodel.pro/v1",
    api_key="bm-•••••••••••••",
)

# Same code — now routed across 100+ models
resp = client.chat.completions.create(
    model="auto",        # smart routing
    messages=[{"role": "user",
               "content": "Hello, world"}],
)
Drop-in replacement — change one line, keep your existing SDK.

One endpoint. Every major provider.

Live routing

Watch a request find its fastest, cheapest path

Every call is scored on price, latency, and health — then routed, load-balanced, and failed over automatically. Zero changes to your code.

gateway · request trace
● live
Provider health auto-balanced
  • Anthropic · Claude112ms
  • OpenAI · GPT-4o140ms
  • Google · Gemini128ms
  • DeepSeek · V3156ms
  • Alibaba · Qwen163ms
38%cost saved
0dropped requests
3regions active
Platform

Everything you need to run LLMs in production

Stop wiring up six SDKs, juggling keys, and guessing at spend. BaseModel is the control plane for all your models.

Unified API

One OpenAI-compatible endpoint for every provider. Swap models with a string — no rewrites, no new SDKs.

Smart routing & load balancing

Route by cost, latency, or quality. Spread traffic across providers and regions to stay under rate limits.

Automatic failover

A provider returns a 429 or 500? We retry on a healthy fallback in milliseconds. Your users never see it.

Cost optimization

Semantic caching, prompt dedup, and price-aware routing cut spend 30–60% without touching quality.

Full observability

Per-request logs, latency, tokens, and cost across every model — in one dashboard, exportable to your stack.

Virtual keys & guardrails

Issue scoped keys per team or app, set budgets and rate limits, and enforce PII & content policies centrally.

Why teams switch

Multi-model sprawl is a tax on every team

Without BaseModel
  • Six SDKs, six auth schemes, six billing portals
  • One provider outage takes down your app
  • No idea which team is burning the budget
  • Switching models means a refactor
  • Rate limits hit in production at the worst time
With BaseModel
  • One endpoint, one key, one invoice
  • Automatic failover keeps you online
  • Cost & usage broken down per key in real time
  • Change model="…" and ship
  • Load-balanced across providers & regions
How it works

Live in under five minutes

No infrastructure to manage. Keep your code, change the base URL, and you're routing through BaseModel.

1

Point your SDK

Set base_url to the BaseModel endpoint and use your bm- key. Works with the OpenAI SDK in any language.

2

Pick a routing policy

Choose auto, pin a model, or define rules by cost, latency, and fallbacks in the dashboard.

3

Ship & observe

Every request is routed, cached, and logged. Watch spend and latency per model, key, and region — live.

Enterprise-ready

Secure & scalable, on AWS

BaseModel runs on AWS with multi-region redundancy and is available on the AWS Marketplace — deploy in your own VPC so prompts and data never leave your account.

Built on AWSMulti-region, auto-scaling
SOC 2 Type IIAudited controls
VPC / PrivateLinkData stays in your account
GDPR readyEU & APAC regions
SSO / SAMLOkta, Entra, Google
99.99% uptimeSLA-backed
Pricing

Start free. Scale when you do.

No markup games — you pay providers at cost, plus a simple platform fee. Every plan includes routing & observability.

Developer

For side projects and early prototypes

$0/ mo
Start free
  • 10K requests / month
  • All 100+ models
  • Smart routing & failover
  • Basic analytics
  • Community support

Enterprise

For regulated & high-scale teams

Custom
Talk to sales
  • Unlimited requests
  • VPC / PrivateLink deployment
  • SSO/SAML & audit logs
  • Dedicated SLA & support
  • On AWS Marketplace
  • Solutions architect

Ship on every model by tonight.

Create a free key, change one line, and route your first request through BaseModel in minutes.

Free to start · No credit card required.