Merius

Introduction

One OpenAI-compatible API to open models on Merius's own GPUs. Authenticate, point your client at the base URL, and call any model.

The Merius API speaks the OpenAI HTTP schema end to end: the same endpoints, the same request and response bodies, the same streaming format. You do not install a Merius SDK or learn a new client — any OpenAI-compatible library works once you point it at our base URL and pass a Merius API key.

Under that familiar surface, every request runs on hardware we own and operate: B200/B300 (NVIDIA Blackwell) GPUs in EU and US datacenters. You get transparent per-token pricing, EU data residency on request, and no prompt or completion retention.

Already using the OpenAI SDK?

Skip ahead to Authentication and Quickstart — the only lines that change are your base URL and key.

What you get

Make your first request

Three steps and you are calling a model:

  1. Create a Merius account and generate an API key in the dashboard.
  2. Point your OpenAI client at the Merius base URL (the one line that changes).
  3. Call /chat/completions with any model slug we serve.
A complete request
curl https://api.merius.ai/v1/chat/completions \
  -H "Authorization: Bearer $MERIUS_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "z-ai/glm-5.1",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

The Quickstart shows the same call in Python and TypeScript. Authentication covers creating and storing keys.

On this page