Capability · Embeddings
Embeddings in one shared vector space.
A single 4096-dimension embedding model in every region — for general and PHI traffic alike — so your vectors stay comparable across Switzerland, the EU and the US.
POST /v1/embeddings
What you get
Built for regulated workloads
Embeddings resolve to the canonical qwen3-embedding-8b (4096-d) in every region, for both general and PHI traffic — CH on Infomaniak, EU on Scaleway, US on the worker (general). Because the model is the same everywhere, a vector embedded in Zurich is directly comparable to one embedded in Paris or Virginia, so you can build one index that spans jurisdictions.
Dimension-safe routing
PHI embeddings, resident
Pin a different space
Availability & pricing
Where it runs, what it costs
Every route is region-resident and the PHI gate is enforced per call. Prices include the flat +10% gateway margin and mirror the live /v1/pricing rate card.
| Region | Provider | Model | Tier | Price |
|---|---|---|---|---|
| CH | Infomaniak | qwen3-embedding-8b · 4096-d | PHI | $0.088 / 1M |
| EU | Scaleway | qwen3-embedding-8b · 4096-d | PHI | $0.121 / 1M |
| US / WORLD | Medishift worker | qwen3-embedding-8b · 4096-d | General | Free tier |
| US | Fireworks | qwen3-embedding-8b · 4096-d | Staged | $0.018 / 1M |
| CH | Infomaniak | bge-multilingual-gemma2 · 3584-d | PHI | $0.055 / 1M |
| CH | Infomaniak | all-MiniLM-L12-v2 · 384-d | PHI | $0.022 / 1M |
Per 1M input tokens, gateway margin included. The canonical qwen3-embedding-8b is the default everywhere; bge / minilm are pin-only CH options in their own dimension.
Try it
A real call, end to end
Embed a batch in the EU. The response is a standard OpenAI embedding object; the headers confirm the route and space.
curl https://phi-cloud.com/api/v1/embeddings \
-H "Authorization: Bearer $PHI_API_KEY" \
-H "X-Region: EU" \
-H "X-PHI: true" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen3-embedding-8b",
"input": ["patient presents with...", "follow-up in two weeks"]
}'x-phi-routed: scaleway-qwen3-embedding-8b/EU/phi
x-phi-tier: phi
x-phi-usage: verified
# data[].embedding -> 4096 floatsResidency & vectors
- CH embeddings stay on Infomaniak via the Swiss /2/ AI surface.
- EU embeddings run on Scaleway; general EU traffic can fail over to the worker in the same 4096-d space.
- US and other regions embed on the worker (general only) until Fireworks PHI is enabled.
- Build one cross-jurisdiction index; the shared space keeps similarity scores meaningful.
FAQ
Common questions
Ready when you are
Put embeddings & search in production — without giving up your data.
Spin up a key in minutes. The residency and PHI posture above arrives unchanged.
Free to test · Prepaid credits, no subscription · No data retained