# A3M Router — AI-Gateway & LLM Router

## One sentence
Open-source LLM router with parallel multi-LLM execution, 47+ providers, 99.5% routing accuracy, 62% cost savings. 19.5 KB, zero ML dependencies.

## Core features
- Parallel multi-LLM execution (unique, no competitor does this)
- RouteLLM-style routing: 12 signals → difficulty tier → cheapest capable model
- 47+ providers: OpenAI, Anthropic, Groq, DeepSeek, NVIDIA, Together, OpenRouter, Gemini, Mistral, Cohere, and more
- Semantic cache (30%+ hit rate)
- Budget enforcement with per-query cost tracking
- Circuit breaker with auto failover
- Prompt injection guardrails (17 patterns)
- Persistent episodic memory (JSON file, auto-save)
- Query-type presets: fast, creative, deep thinking, code

## Quick start
```bash
npm install adaptive-memory-multi-model-router
npx a3m-router route "Write a Python sort function"
npx a3m-router serve  # OpenAI-compatible proxy at :8787
```

## Key files
- `src/index.ts` — Main entry, all exports
- `src/routing/ensembleVoting.ts` — P0: Parallel ensemble voting
- `src/routing/queryTypePresets.ts` — P1: Query-type presets
- `src/memory/episodicMemory.ts` — P3: Persistent memory
- `src/providers/providerConfig.ts` — 47 provider configurations
- `src/routing/advancedRouter.ts` — RouteLLM-style routing with 12 signals
- `docs/BENCHMARK.md` — Independent benchmark data (third-party via llm-gateway-bench)
- `docs/CORE_VISION_PRD.md` — Product requirements document

## Benchmark (independent, third-party)
- Direct to Groq: 138ms TTFT
- Through A3M (forced): 234ms (+96ms overhead)
- Through A3M (auto route): 374ms (+236ms overhead for full intelligence)
- 62% cost savings vs all-premium routing
- Tool: llm-gateway-bench (not our own scripts)

## Unique differentiator
Parallel ensemble execution: runs N providers simultaneously, scores results on specificity/structure/relevance, returns best answer with reasoning. No other router (litellm, one-api, LibreChat, gpt-researcher) does this.

## Competitive landscape
- litellm (48K★): Sequential fallback only, Python-only
- one-api (34K★): API key management, Chinese-focused
- LibreChat (20K★): UI-focused, single-provider
- gpt-researcher (20K★): Deep research, single-provider per query
- A3M Router: Parallel ensemble + confidence scoring (unique)

## Growth
- 10,024 npm downloads in 14 days
- 72 versions published
- Zero marketing budget
