Open Source · v0.5.0 · Real Provider Expansion

One Endpoint.
12 Providers.
Zero Downtime.

Connect your AI tools, apps, and agents to one OpenAI-compatible endpoint. 8Router handles provider routing, quota tracking, circuit breaker, and fallback so your tools stay connected when keys expire or models go down.

Get Started Open Dashboard Setup Guide

Install & Run

$ npm install -g 8router
$ 8router

# API live at http://localhost:8080/v1
# Dashboard at http://localhost:8080/8router/dashboard

Providers

Model Aliases

3-Tier

Fallback

Tests Passed

Built for messy real-world AI workflows.

Provider keys expire. Quotas reset. Models go down. 8Router keeps your tools connected through one local endpoint.

Chat / LLM

Route chat requests across connected providers with health-aware fallback.

Active

OpenAI-Compatible API

Use one /v1 endpoint with tools that support custom OpenAI base URLs.

Active

Provider Routing

Pick the right provider based on quota, health, latency, and route mode.

Active

Quota Tracker

Track key usage, limits, cooldowns, and exhausted providers in one place.

Active

Smart Model Picker

Use aliases like 8router/cheap, 8router/fast, and 8router/coding.

Beta

Tool Calling

Route tool-capable requests only to models that support function calling.

Beta

Streaming Fallback

Fallback before streaming starts, then keep response chunks stable.

Beta

Embeddings

Vector embeddings endpoint with multi-provider fallback.

Coming Soon

Image Generation

Image generation endpoint with provider routing.

Coming Soon

12 Providers. 3 Tiers. One Endpoint.

Route requests across premium, efficient, and local providers with health checks, quota awareness, circuit breaker, and fallback chains.

Premium

OpenAIActive

AnthropicActive

GeminiActive

xAISoon

Efficient

GroqActive

OpenRouterActive

MistralActive

DeepSeekBeta

Together AIBeta

CohereSoon

PerplexitySoon

Local

OllamaLocal

LM StudioLocal

vLLMLocal

Model Aliases

Use simple aliases instead of memorizing provider-specific model names.

8router/auto

Best available route based on health, quota, latency, and fallback rules.

8router/cheap

Chooses the cheapest healthy model available.

8router/fast

Chooses the lowest-latency healthy model.

8router/smart

Routes to the strongest available model for reasoning-heavy tasks.

8router/coding

Optimized for coding, debugging, tool use, and long context tasks.

8router/local

Local-only route. Never falls back to cloud providers.

Local mode requires Ollama, LM Studio, or another configured local provider.

How It Works

Install once, connect your providers, and never think about rate limits again.

Install

npm install -g 8router

Connect

Add API keys or local providers from the dashboard.

Point Your Tool

Set base URL in Cursor, Cline, or OpenWebUI:

http://localhost:8080/v1

Use Aliases

Use 8router/auto, 8router/cheap, 8router/coding, or 8router/local.

Monitor

Track quota, key health, fallback paths, and latency from the dashboard.

Built for Production

Every feature is tested and production-ready.

Real Provider Adapters

12 providers with dedicated adapters for request normalization, response parsing, and health checks.

Model Capability Map

30+ models tracked for vision, tools, streaming, embeddings, context length, and cost.

Smart Model Picker

Score-based model selection using cost, speed, quality, and latency data.

Cost Table per Model

Per-model pricing with input/output token costs, currency, and free quota tracking.

Latency Benchmark

p50/p95/p99 latency tracking per provider with automatic health monitoring.

Circuit Breaker

Automatic provider cooldown after 5 consecutive failures. 3-minute reset window.

Key Pool Health

Track key status: healthy, rate_limited, exhausted, invalid, cooldown. Auto-skip bad keys.

Quota Tracker

Per-provider quota tracking across 5h, daily, weekly, and monthly periods.

Secret Masking

API keys masked in all responses, logs, dashboard, and error messages.

Error Sanitization

Error messages sanitized to remove sensitive data before logging or returning.

OpenAI-Compatible API

Drop-in replacement for /v1/chat/completions, /v1/models, /v1/embeddings.

Setup Guide

Ready-to-copy configs for Cursor, Cline, Continue, OpenWebUI, Claude Code, and Codex.

Compatible With Your Tools

Drop-in compatible with tools that support custom OpenAI base URLs.

Cursor

Tested

Cline

Tested

Continue

Tested

Roo Code

Tested

Open WebUI

Tested

Claude Code

Compat

Codex CLI

Compat

Hermes Agent

Tested

Quick Config

Base URL: http://localhost:8080/v1
API Key:  ***
Model:    8router/auto

Start Routing in Under a Minute

Install

npm install -g 8router

Run

8router

Configure Your Tool

Base URL: http://localhost:8080/v1
Model:    8router/auto

Open Dashboard Setup Guide

One Endpoint.
12 Providers.
Zero Downtime.

Built for messy real-world AI workflows.

Chat / LLM

OpenAI-Compatible API

Provider Routing

Quota Tracker

Smart Model Picker

Tool Calling

Streaming Fallback

Embeddings

Image Generation

12 Providers. 3 Tiers. One Endpoint.

Model Aliases

How It Works

Install

Connect

Point Your Tool

Use Aliases

Monitor

Built for Production

Real Provider Adapters

Model Capability Map

Smart Model Picker

Cost Table per Model

Latency Benchmark

Circuit Breaker

Key Pool Health

Quota Tracker

Secret Masking

Error Sanitization

OpenAI-Compatible API

Setup Guide

Local-First Dashboard

Compatible With Your Tools

Production Safety

Tested Before Release

Start Routing in Under a Minute

One Endpoint. 12 Providers. Zero Downtime.

Built for messy real-world AI workflows.

Chat / LLM

OpenAI-Compatible API

Provider Routing

Quota Tracker

Smart Model Picker

Tool Calling

Streaming Fallback

Embeddings

Image Generation

12 Providers. 3 Tiers. One Endpoint.

Model Aliases

How It Works

Install

Connect

Point Your Tool

Use Aliases

Monitor

Built for Production

Real Provider Adapters

Model Capability Map

Smart Model Picker

Cost Table per Model

Latency Benchmark

Circuit Breaker

Key Pool Health

Quota Tracker

Secret Masking

Error Sanitization

OpenAI-Compatible API

Setup Guide

Local-First Dashboard

Compatible With Your Tools

Production Safety

Tested Before Release

Start Routing in Under a Minute

One Endpoint.
12 Providers.
Zero Downtime.