Capability · Chat completions
Chat and reasoning, region-pinned.
One OpenAI-compatible chat endpoint over every model family — open-weight and frontier — with the data-residency and PHI gate enforced on every call.
POST /v1/chat/completions
What you get
Built for regulated workloads
Point any OpenAI SDK at the gateway and call chat the way you already do. The model can be a concrete row (claude-sonnet-4-6, gemini-2-5-flash) or a provider-agnostic canonical alias (gemma-4, qwen3.5, glm-5.2) that resolves to whichever provider hosts it in the region you pinned. Set X-Region and X-PHI per request; everything else is standard.
SSE streaming, billed correctly
In-region failover
Canonical model aliases
Hard PHI gate
Sandboxed code execution
Availability & pricing
Where it runs, what it costs
Every route is region-resident and the PHI gate is enforced per call. Prices include the flat +10% gateway margin and mirror the live /v1/pricing rate card.
| Region | Provider | Model | Tier | Price |
|---|---|---|---|---|
| CH | Infomaniak | gemma-4 (31B IT) | PHI | $0.24 / $0.48 |
| CH | Infomaniak | qwen3.5 (397B) | PHI | $0.97 / $4.36 |
| EU | Scaleway | gemma-4 (26B IT) | PHI | $0.30 / $0.59 |
| EU | Scaleway | qwen3.6 (35B) | PHI | $0.47 / $1.78 |
| US | Together | qwen3.5 (397B) | General | $0.77 / $3.85 |
| US | Together | glm-5.2 | General | $0.66 / $2.41 |
| US / EU | Anthropic | claude-sonnet-4-6 | General | $3.30 / $16.50 |
| WORLD | gemini-2-5-flash | General | $0.33 / $2.75 |
Token prices are per 1M tokens (input / output), gateway margin included. US chat on protected health information returns phi_blocked today — no US chat provider is PHI-verified yet (Azure OpenAI is staged). Full rate card at /pricing.
Try it
A real call, end to end
A PHI chat call in Switzerland is two headers on a standard request. The response tells you exactly where it ran.
curl https://phi-cloud.com/api/v1/chat/completions \
-H "Authorization: Bearer $PHI_API_KEY" \
-H "X-Region: CH" \
-H "X-PHI: true" \
-H "Content-Type: application/json" \
-d '{
"model": "gemma-4",
"messages": [
{"role": "user", "content": "Summarise this discharge note..."}
]
}'x-phi-routed: infomaniak-gemma-4-31b/CH/phi
x-phi-tier: phi
x-phi-attempts: 1
x-phi-usage: verified
x-phi-cost-micro: 412Residency & gating
- CH PHI chat resolves to Infomaniak (Swiss-resident, nFADP/GDPR DPA verified).
- EU PHI chat resolves to Scaleway (EU-resident, GDPR Art. 28 DPA, zero-retention, no-train).
- US, UK and every other region carry general traffic on the resident cloud; US PHI chat is phi_blocked until a US chat provider is verified.
- No prompt or completion is ever logged or persisted by the proxy.
- · n is dropped (multi-choice would break single-response billing).
- · Streaming is disabled on the /math sandbox route — it must inspect tool calls.
- · Vision rides this same endpoint as image_url content parts (see the Vision capability).
FAQ
Common questions
Ready when you are
Put chat & reasoning in production — without giving up your data.
Spin up a key in minutes. The residency and PHI posture above arrives unchanged.
Free to test · Prepaid credits, no subscription · No data retained