Open Source · v0.5.0 · Real Provider Expansion

One Endpoint.
12 Providers.
Zero Downtime.

Connect your AI tools, apps, and agents to one OpenAI-compatible endpoint. 8Router handles provider routing, quota tracking, circuit breaker, and fallback so your tools stay connected when keys expire or models go down.

Install & Run
$ npm install -g 8router
$ 8router

# API live at http://localhost:8080/v1
# Dashboard at http://localhost:8080/8router/dashboard
12
Providers
6
Model Aliases
3-Tier
Fallback
41
Tests Passed

Built for messy real-world AI workflows.

Provider keys expire. Quotas reset. Models go down. 8Router keeps your tools connected through one local endpoint.

Chat / LLM

Route chat requests across connected providers with health-aware fallback.

Active

OpenAI-Compatible API

Use one /v1 endpoint with tools that support custom OpenAI base URLs.

Active

Provider Routing

Pick the right provider based on quota, health, latency, and route mode.

Active

Quota Tracker

Track key usage, limits, cooldowns, and exhausted providers in one place.

Active

Smart Model Picker

Use aliases like 8router/cheap, 8router/fast, and 8router/coding.

Beta

Tool Calling

Route tool-capable requests only to models that support function calling.

Beta

Streaming Fallback

Fallback before streaming starts, then keep response chunks stable.

Beta

Embeddings

Vector embeddings endpoint with multi-provider fallback.

Coming Soon

Image Generation

Image generation endpoint with provider routing.

Coming Soon

12 Providers. 3 Tiers. One Endpoint.

Route requests across premium, efficient, and local providers with health checks, quota awareness, circuit breaker, and fallback chains.
Premium
OpenAIActive
AnthropicActive
GeminiActive
xAISoon
Efficient
GroqActive
OpenRouterActive
MistralActive
DeepSeekBeta
Together AIBeta
CohereSoon
PerplexitySoon
Local
OllamaLocal
LM StudioLocal
vLLMLocal

Model Aliases

Use simple aliases instead of memorizing provider-specific model names.
8router/auto

Best available route based on health, quota, latency, and fallback rules.

8router/cheap

Chooses the cheapest healthy model available.

8router/fast

Chooses the lowest-latency healthy model.

8router/smart

Routes to the strongest available model for reasoning-heavy tasks.

8router/coding

Optimized for coding, debugging, tool use, and long context tasks.

8router/local

Local-only route. Never falls back to cloud providers.

Local mode requires Ollama, LM Studio, or another configured local provider.

How It Works

Install once, connect your providers, and never think about rate limits again.
1

Install

npm install -g 8router
2

Connect

Add API keys or local providers from the dashboard.

3

Point Your Tool

Set base URL in Cursor, Cline, or OpenWebUI:

http://localhost:8080/v1
4

Use Aliases

Use 8router/auto, 8router/cheap, 8router/coding, or 8router/local.

5

Monitor

Track quota, key health, fallback paths, and latency from the dashboard.

Built for Production

Every feature is tested and production-ready.

Real Provider Adapters

12 providers with dedicated adapters for request normalization, response parsing, and health checks.

Model Capability Map

30+ models tracked for vision, tools, streaming, embeddings, context length, and cost.

Smart Model Picker

Score-based model selection using cost, speed, quality, and latency data.

Cost Table per Model

Per-model pricing with input/output token costs, currency, and free quota tracking.

Latency Benchmark

p50/p95/p99 latency tracking per provider with automatic health monitoring.

Circuit Breaker

Automatic provider cooldown after 5 consecutive failures. 3-minute reset window.

Key Pool Health

Track key status: healthy, rate_limited, exhausted, invalid, cooldown. Auto-skip bad keys.

Quota Tracker

Per-provider quota tracking across 5h, daily, weekly, and monthly periods.

Secret Masking

API keys masked in all responses, logs, dashboard, and error messages.

Error Sanitization

Error messages sanitized to remove sensitive data before logging or returning.

OpenAI-Compatible API

Drop-in replacement for /v1/chat/completions, /v1/models, /v1/embeddings.

Setup Guide

Ready-to-copy configs for Cursor, Cline, Continue, OpenWebUI, Claude Code, and Codex.

Local-First Dashboard

Monitor requests, provider health, fallback paths, and key pool status locally.
8Router Dashboard localhost:8080/8router/dashboard
Total Requests
24,891
Tokens Used
1.2M
Avg Latency
142ms
groq42%
openrouter28%
mistral18%
8Router

Compatible With Your Tools

Drop-in compatible with tools that support custom OpenAI base URLs.
Cursor
Cursor
Tested
Cline
Cline
Tested
Continue
Continue
Tested
Roo Code
Roo Code
Tested
Open WebUI
Open WebUI
Tested
Claude Code
Claude Code
Compat
Codex CLI
Codex CLI
Compat
Hermes Agent
Hermes Agent
Tested
Quick Config
Base URL: http://localhost:8080/v1
API Key:  ***
Model:    8router/auto

Production Safety

Secret masking across logs and UI
Error sanitization before logging
Local-only admin endpoints
Circuit breaker for unstable providers
Quota and key health tracking
Backup and config export
If you expose 8Router publicly, enable authentication and never expose raw provider keys.

Tested Before Release

Every release is validated with router tests, OpenAI-compatible smoke tests, doctor diagnostics, and admin endpoint checks.
18/18
Router Tests
12/12
OpenAI Compat
13/13
Doctor Checks
43
Total Checks

Start Routing in Under a Minute

1
Install
npm install -g 8router
2
Run
8router
3
Configure Your Tool
Base URL: http://localhost:8080/v1
Model:    8router/auto