Auto-route tasks to the cheapest Claude model that works correctly. Three-tier progression: Haiku → Sonnet → Opus. Classify before responding. HAIKU (default): factual Q&A, greetings, reminders, status checks, lookups, simple file ops, heartbeats, casual chat, 1-2 sentence tasks. ESCALATE TO SONNET: code >10 lines, analysis, comparisons, planning, reports, multi-step reasoning, tables, long writing >3 paragraphs, summarization, research synthesis, most user conversations. ESCALATE TO OPUS: architecture decisions, complex debugging, multi-file refactoring, strategic planning, nuanced judgment, deep research, critical production decisions. Rule: If a human needs >30 seconds of focused thinking, escalate. If Sonnet struggles with complexity, go to Opus. Save 50-90% on API costs by starting cheap and escalating only when needed.
Install
Documentation
Smart Model Switching
Three-tier Claude routing: Haiku → Sonnet → OpusStart with the cheapest model. Escalate only when needed. Save 50-90% on API costs.
The Golden Rule
> If a human would need more than 30 seconds of focused thinking, escalate from Haiku to Sonnet.
> If the task involves architecture, complex tradeoffs, or deep reasoning, escalate to Opus.
Cost Reality
| Model | Input | Output | Relative Cost |
|-------|-------|--------|---------------|
| Haiku | \$0.25/M | \$1.25/M | 1x (baseline) |
| Sonnet | \$3.00/M | \$15.00/M | 12x |
| Opus | \$15.00/M | \$75.00/M | 60x |
Bottom line: Wrong model selection wastes money OR time. Haiku for simple, Sonnet for standard, Opus for complex.---
💚 HAIKU — Default for Simple Tasks
Stay on Haiku for:- -Factual Q&A — "what is X", "who is Y", "when did Z"
- -Quick lookups — definitions, unit conversions, short translations
- -Status checks — calendar, file reads, session monitoring
- -Heartbeats — periodic checks, HEARTBEAT_OK responses
- -Memory & reminders — "remember this", "remind me to..."
- -Casual conversation — greetings, small talk, acknowledgments
- -Simple file ops — read, list, basic writes
- -One-liner tasks — anything answerable in 1-2 sentences
NEVER do these on Haiku
- -❌ Write code longer than 10 lines
- -❌ Create comparison tables
- -❌ Write more than 3 paragraphs
- -❌ Do multi-step analysis
- -❌ Write reports or proposals
---
💛 SONNET — Standard Work (The Workhorse)
Escalate to Sonnet for:Code & Technical
- -Code generation — write functions, build features, scripts
- -Code review — PR reviews, quality checks
- -Debugging — standard bug investigation
- -Documentation — README, comments, user guides
Analysis & Planning
- -Analysis & evaluation — compare options, assess trade-offs
- -Planning — project plans, roadmaps, task breakdowns
- -Research synthesis — combining multiple sources
- -Multi-step reasoning — "first... then... finally"
Writing & Content
- -Long-form writing — reports, proposals, articles (>3 paragraphs)
- -Creative writing — blog posts, descriptions, copy
- -Summarization — long documents, transcripts
- -Structured output — tables, outlines, formatted docs
---
❤️ OPUS — Complex Reasoning Only
Escalate to Opus for:Architecture & Design
- -System architecture decisions
- -Major codebase refactoring
- -Design pattern selection with tradeoffs
- -Database schema design
Deep Analysis
- -Complex debugging (multi-file, race conditions)
- -Security reviews
- -Performance optimization strategy
- -Root cause analysis of subtle bugs
Strategic & Creative
- -Strategic planning — business decisions, roadmaps
- -Nuanced judgment — ethics, ambiguity, competing values
- -Deep research — comprehensive multi-source analysis
---
🔄 Implementation
For Subagents
\\\javascript
// Routine monitoring
sessions_spawn(task="Check backup status", model="haiku")
// Standard code work
sessions_spawn(task="Build the REST API endpoint", model="sonnet")
// Architecture decisions
sessions_spawn(task="Design the database schema for multi-tenancy", model="opus")
\\\
For Cron Jobs
\\\json
{
"payload": {
"kind": "agentTurn",
"model": "haiku"
}
}
\\\
Always use Haiku for cron unless the task genuinely needs reasoning.
---
📊 Quick Decision Tree
\\\
Is it a greeting, lookup, status check, or 1-2 sentence answer?
YES → HAIKU
NO ↓
Is it code, analysis, planning, writing, or multi-step?
YES → SONNET
NO ↓
Is it architecture, deep reasoning, or critical decision?
YES → OPUS
NO → Default to SONNET, escalate if struggling
\\\
---
📋 Quick Reference Card
\\\
┌─────────────────────────────────────────────────────────────┐
│ SMART MODEL SWITCHING │
│ Haiku → Sonnet → Opus │
├─────────────────────────────────────────────────────────────┤
│ 💚 HAIKU (cheapest) │
│ • Greetings, status checks, quick lookups │
│ • Factual Q&A, definitions, reminders │
│ • Simple file ops, 1-2 sentence answers │
├─────────────────────────────────────────────────────────────┤
│ 💛 SONNET (standard) │
│ • Code > 10 lines, debugging │
│ • Analysis, comparisons, planning │
│ • Reports, proposals, long writing │
├─────────────────────────────────────────────────────────────┤
│ ❤️ OPUS (complex) │
│ • Architecture decisions │
│ • Complex debugging, multi-file refactoring │
│ • Strategic planning, deep research │
├─────────────────────────────────────────────────────────────┤
│ 💡 RULE: If a human needs > 30 sec thinking → escalate │
│ 💰 COST: Haiku 1x → Sonnet 12x → Opus 60x │
└─────────────────────────────────────────────────────────────┘
\\\
---
*Built for Claude-only setups with Haiku, Sonnet, and Opus.*
*Inspired by save-money skill, extended with three-tier progression.*
Launch an agent with Smart Model Switching on Termo.