Provider Setup Guide

Overview

switchAILocal supports multiple AI providers through three authentication methods. Choose the method that works best for your use case.

Authentication Methods

Most users should start with CLI Wrappers (Option A) for the fastest setup with zero configuration.

Option A: CLI Wrappers (Recommended)

If you already have gemini, claude, codex, or vibe CLI tools installed and authenticated, switchAILocal uses them automatically.

Verify CLI Installation

Check that your CLI tools are installed and working:

gemini --version
claude --version
codex --version

Use CLI Prefix

Reference the CLI provider using the cli suffix in your model name:

curl http://localhost:18080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-test-123" \
  -d '{
    "model": "geminicli:gemini-2.5-pro",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Supported Providers

Available CLI providers:

Provider	CLI Tool	Prefix	Example Model
Google Gemini	`gemini`	`geminicli:`	`geminicli:gemini-2.5-pro`
Anthropic Claude	`claude`	`claudecli:`	`claudecli:claude-sonnet-4`
OpenAI Codex	`codex`	`codex:`	`codex:gpt-4`
Mistral Vibe	`vibe`	`vibe:`	`vibe:mistral-large`
OpenCode	`opencode`	`opencode:`	`opencode:build`

Option B: API Keys (Standard)

For cloud API providers, add API keys directly to your config.yaml.

Copy Example Config

cp config.example.yaml config.yaml

Add Provider Credentials

Edit config.yaml and add your API keys:

gemini-api-key:
  - api-key: "AIzaSy..."
    prefix: "google"
    base-url: "https://generativelanguage.googleapis.com"

Use Without CLI Suffix

curl http://localhost:18080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-test-123" \
  -d '{"model": "gemini:gemini-2.5-pro", "messages": [...]}'

For users who want switchAILocal to manage OAuth tokens directly without CLI tools.

This method requires GEMINI_CLIENT_ID and GEMINI_CLIENT_SECRET environment variables. Most users should use Option A or Option B instead.

Set Environment Variables

export GEMINI_CLIENT_ID="your-client-id"
export GEMINI_CLIENT_SECRET="your-client-secret"

Run OAuth Login

./switchAILocal --login

Complete Browser Authentication

A browser window will open for you to authorize switchAILocal. After approval, tokens are stored in ~/.switchailocal/.

Local Model Providers

Ollama

Connect to locally running Ollama models.

Enable Ollama in Config

config.yaml

ollama:
  enabled: true
  base-url: "http://localhost:11434"
  auto-discover: true  # Automatically fetch available models

Start Ollama

ollama serve

Pull Models

ollama pull llama3.2
ollama pull qwen:0.5b

Use Ollama Models

curl http://localhost:18080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-test-123" \
  -d '{"model": "ollama:llama3.2", "messages": [...]}'

LM Studio

Connect to LM Studio for local model hosting.

config.yaml

lmstudio:
  enabled: true
  base-url: "http://localhost:1234/v1"
  auto-discover: true

OpenCode

Integrate with OpenCode for specialized development tasks.

config.yaml

opencode:
  enabled: true
  base-url: "http://localhost:4096"
  default-agent: "build"

OpenAI-Compatible Providers

Connect any OpenAI-compatible API endpoint.

config.yaml

openai-compatibility:
  - name: "groq"
    prefix: "groq"
    base-url: "https://api.groq.com/openai/v1"
    api-key-entries:
      - api-key: "gsk_..."
  
  - name: "openrouter"
    prefix: "or"
    base-url: "https://openrouter.ai/api/v1"
    api-key-entries:
      - api-key: "sk-or-v1-..."

Load Balancing

Configure multiple credentials per provider for automatic load balancing.

config.yaml

gemini-api-key:
  - api-key: "AIzaSy...account1"
  - api-key: "AIzaSy...account2"
  - api-key: "AIzaSy...account3"

routing:
  strategy: "round-robin"  # or "fill-first"

Routing Strategies

round-robin: Distributes requests evenly across all credentials
fill-first: Uses the first credential until quota is exhausted, then moves to the next

Model Aliasing

Create friendly aliases for frequently used models.

config.yaml

switchai-api-key:
  - api-key: "sk-lf-..."
    models:
      - name: "openai/gpt-oss-120b"
        alias: "fast"
      - name: "deepseek-reasoner"
        alias: "reasoner"

Use aliases in requests:

curl http://localhost:18080/v1/chat/completions \
  -d '{"model": "fast", "messages": [...]}'

Verification

List all available models to verify provider setup:

curl http://localhost:18080/v1/models \
  -H "Authorization: Bearer sk-test-123"

Check provider health status:

curl http://localhost:18080/v0/management/heartbeat/status \
  -H "X-Management-Key: your-secret-key"

Get Started

Core Concepts

Configuration

Intelligent Systems

Advanced Features

Guides

Overview

Authentication Methods

Option A: CLI Wrappers (Recommended)

Option B: API Keys (Standard)

Local Model Providers

Ollama

LM Studio

OpenCode

OpenAI-Compatible Providers

Load Balancing

Model Aliasing

Verification

Next Steps

Docker Deployment

Management Dashboard

Get Started

Core Concepts

Configuration

Intelligent Systems

Advanced Features

Guides

​Overview

​Authentication Methods

​Option A: CLI Wrappers (Recommended)

​Option B: API Keys (Standard)

​Option C: OAuth Login (Advanced)

​Local Model Providers

​Ollama

​LM Studio

​OpenCode

​OpenAI-Compatible Providers

​Load Balancing

​Model Aliasing

​Verification

​Next Steps

Docker Deployment

Management Dashboard

Overview

Authentication Methods

Option A: CLI Wrappers (Recommended)

Option B: API Keys (Standard)

Option C: OAuth Login (Advanced)

Local Model Providers

Ollama

LM Studio

OpenCode

OpenAI-Compatible Providers

Load Balancing

Model Aliasing

Verification

Next Steps