← API Root

API Documentation

AI API Gateway — OpenAI-compatible interface to 4 providers · 5,990+ models

No API Key Required

All providers use internal keys. Just send your request — no Authorization header needed.

For AI Agents

You are an AI agent. Here is everything you need:

Base URL:

Text/Chat: POST /v1/chat/completions with "provider": "ollama" (fastest) or "provider": "opencode-go" (best QA)

Vision QA: POST /v1/chat/completions with "provider": "opencode-go", "model": "kimi-k2.5", "max_tokens": 200000. Send image as base64 in content array.

Image Gen: POST /v1/images/generations with "provider": "arli"

Models: GET /v1/models?provider=opencode-go|ollama|featherless|arli|arli-image

Analytics: GET /v1/analytics — usage stats, model performance, call history (auto-logged)

Model Recommendations

TaskModelProviderWhy
Visual QA / Screenshot Reviewkimi-k2.5opencode-go200K tokens, best analysis quality, handles thinking
Visual QA (fast)kimi-k2.5ollama2.4x faster, same model, /api/generate endpoint
Fastest Textgemma3:4bollama0.9s response time
Best Quality Textgpt-oss:120bollama111.8 t/s throughput, 2.2s
Best Reasoningdeepseek-v3.2ollama688B params, 5.9s
Image GenerationFLUX.2-klein-4Barli16.6s, high quality
Image EditingFLUX.2-klein-4BarliBest quality edits
Image Upscaling4x-UltraSharparli4x upscaling
Video GenerationWan 2.1 1.3Bhf-inferenceFree, fast video generation
Codingglm-5opencode-goOptimized for code
Fallback TextLlama-3.3-70B-Instructarli6.7s, reliable

Endpoints

MethodPathDescription
POST/v1/chat/completionsChat/text/vision generation (all providers)
POST/v1/completionsText completion
POST/v1/images/generationsImage generation (Arli)
POST/v1/images/editsImage editing (Arli)
POST/v1/images/upscaleImage upscaling (Arli)
GET/v1/modelsList all models (filter by ?provider=)
POST/v1/tokenizeToken counting (Featherless)
GET/v1/healthHealth check
GET/v1/statusLive provider status & model counts
GET/v1/routesAll available endpoints
GET/v1/recommendationsBest model per task (filter ?category=)
GET/v1/analyticsAPI usage analytics and model performance dashboard

Examples

Visual QA Review (best: Kimi K2.5 via OpenCode Go)

Send a screenshot, get a detailed QA report with issues and fixes.

curl -X POST /v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kimi-k2.5",
    "provider": "opencode-go",
    "max_tokens": 4000,
    "messages": [
      {
        "role": "user",
        "content": [
          {"type": "image_url", "image_url": {"url": "data:image/png;base64,YOUR_IMAGE_BASE64"}},
          {"type": "text", "text": "Perform a visual QA review. List Critical/Moderate/Minor issues with fixes."}
        ]
      }
    ]
  }'

Vision QA via Ollama (faster)

curl -X POST /v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kimi-k2.5",
    "provider": "ollama",
    "max_tokens": 4000,
    "messages": [
      {
        "role": "user",
        "content": [
          {"type": "image_url", "image_url": {"url": "data:image/png;base64,YOUR_IMAGE_BASE64"}},
          {"type": "text", "text": "Describe this image."}
        ]
      }
    ]
  }'

Generate an Image

curl -X POST /v1/images/generations \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "A serene mountain landscape at sunset",
    "model": "FLUX.2-klein-4B",
    "n": 1,
    "size": "1024x1024"
  }'

Quick Text Generation

curl -X POST /v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemma3:4b",
    "provider": "ollama",
    "messages": [{"role": "user", "content": "Explain quantum computing in 2 sentences."}],
    "max_tokens": 100
  }'

List Models by Provider

# All providers
curl /v1/models

# OpenCode Go only (Kimi K2.5, GLM-5, MiniMax)
curl /v1/models?provider=opencode-go

# Ollama Cloud (34 models)
curl /v1/models?provider=ollama

# Arli AI image models
curl /v1/models?provider=arli-image

Check System Status

curl /v1/status | python3 -m json.tool

Providers

OpenCode GoOnlineKimi K2.5, GLM-5, MiniMax M2.5/M2.7?provider=opencode-go
Ollama CloudOnline34 models, vision + reasoning, 3 concurrent?provider=ollama
Featherless AIOnline5,757 text models, 15 categories?provider=featherless (default)
Arli AIOnline117 text + 80 image models?provider=arli

Related Services