User Guides

These guides cover common patterns and best practices for working with SandBase in production.

Available Guides

Guide	Description
Model Routing	How SandBase routes requests across providers — fallback chains, priority-based routing, and capability-aware selection
Streaming	SSE protocol details, OpenAI vs Anthropic streaming formats, and client-side handling
Error Handling	Retry strategies, exponential backoff, circuit breaker patterns, and idempotency
Rate Limiting	Understanding rate limit headers, handling 429 responses, and high-throughput best practices
Billing & Pricing	Credit system, top-up flow, per-model pricing, usage tracking, and cost optimization

Use streaming for chat interfaces — it reduces perceived latency significantly
Implement retries with backoff — transient errors from upstream providers are normal
Monitor your usage — the dashboard shows real-time token consumption and costs
Choose models wisely — smaller models (GPT-4o Mini, Claude 3.5 Haiku) are 10-50x cheaper for simple tasks