Skip to content

siati.ai™ — Developer Wiki#

Swiss Infrastructure for Autonomous Trusted Intelligence.

OpenAI-compatible API, open-weight models served on hardware we own in Swiss datacenters. No CLOUD Act, no hyperscalers, no training on your data.

Get started in 30 seconds#

```python from openai import OpenAI

client = OpenAI( base_url="https://api.siati.ai/v1", api_key="siati_...", # from /dashboard/api-keys )

resp = client.chat.completions.create( model="siati/llama-3.1-405b", messages=[{"role": "user", "content": "Grüezi, siati!"}], ) print(resp.choices[0].message.content) ```

```typescript import OpenAI from "openai";

const client = new OpenAI({ baseURL: "https://api.siati.ai/v1", apiKey: process.env.SIATI_API_KEY, });

const resp = await client.chat.completions.create({ model: "siati/llama-3.1-405b", messages: [{ role: "user", content: "Grüezi, siati!" }], }); console.log(resp.choices[0].message.content); ```

bash curl https://api.siati.ai/v1/chat/completions \ -H "Authorization: Bearer siati_..." \ -H "Content-Type: application/json" \ -d '{ "model": "siati/llama-3.1-405b", "messages": [{"role": "user", "content": "Grüezi, siati!"}] }'

What's in here#

  • Quickstart — from sign-up to first call in under a minute.
  • Models — catalog, tiers, hardware, pricing.
  • Chat completions — the main interface.
  • Embeddings — sovereign RAG over your PDFs.
  • Streaming — token-by-token via SSE.
  • Cookbook RAG — end-to-end example with bge-m3 + 405B.
  • Pricing — CHF, VAT included, invoices from C41.ch SAGL.
  • Errors — codes, retry strategy, headers.

Drop-in OpenAI

Swap base_url and api_key, leave the rest untouched. Same chat.completions, embeddings, models schemas. Python SDK, JS SDK, LangChain, LlamaIndex all work without code changes.

nFADP + NIS2 compliant

Your prompts and completions never leave Switzerland. Ever. No training on your data. Contracts under Swiss law, jurisdiction Mendrisio. DPA available within 1 business day.