siati.ai™ — Developer Wiki#
Swiss Infrastructure for Autonomous Trusted Intelligence.
OpenAI-compatible API, open-weight models served on hardware we own in Swiss datacenters. No CLOUD Act, no hyperscalers, no training on your data.
Get started in 30 seconds#
```python from openai import OpenAI
client = OpenAI( base_url="https://api.siati.ai/v1", api_key="siati_...", # from /dashboard/api-keys )
resp = client.chat.completions.create( model="siati/llama-3.1-405b", messages=[{"role": "user", "content": "Grüezi, siati!"}], ) print(resp.choices[0].message.content) ```
```typescript import OpenAI from "openai";
const client = new OpenAI({ baseURL: "https://api.siati.ai/v1", apiKey: process.env.SIATI_API_KEY, });
const resp = await client.chat.completions.create({ model: "siati/llama-3.1-405b", messages: [{ role: "user", content: "Grüezi, siati!" }], }); console.log(resp.choices[0].message.content); ```
bash
curl https://api.siati.ai/v1/chat/completions \
-H "Authorization: Bearer siati_..." \
-H "Content-Type: application/json" \
-d '{
"model": "siati/llama-3.1-405b",
"messages": [{"role": "user", "content": "Grüezi, siati!"}]
}'
What's in here#
- Quickstart — from sign-up to first call in under a minute.
- Models — catalog, tiers, hardware, pricing.
- Chat completions — the main interface.
- Embeddings — sovereign RAG over your PDFs.
- Streaming — token-by-token via SSE.
- Cookbook RAG — end-to-end example with bge-m3 + 405B.
- Pricing — CHF, VAT included, invoices from C41.ch SAGL.
- Errors — codes, retry strategy, headers.
Drop-in OpenAI
Swap base_url and api_key, leave the rest untouched. Same
chat.completions, embeddings, models schemas. Python SDK, JS SDK,
LangChain, LlamaIndex all work without code changes.
nFADP + NIS2 compliant
Your prompts and completions never leave Switzerland. Ever. No training on your data. Contracts under Swiss law, jurisdiction Mendrisio. DPA available within 1 business day.