|
All checks were successful
Deploy to Production / deploy (push) Successful in 1m33s
Sonnet 4.6 was still hitting max_tokens on ambitious prompts like "WorldWeather MCP for any location" because the implementation bodies ballooned with defensive scaffolding. Two changes: 1. SYSTEM_PROMPT now imposes hard limits the model can self-enforce: - at most 6 tools (combine related capabilities with a mode param) - implementation body <= 40 lines, no comments, no overengineering - descriptions <= 100 chars These keep a typical preview under ~7k output tokens. 2. team/enterprise maxTokens 8192 -> 12288. At ~130 tok/s that fits in ~94s, still under Cloudflare's 100s edge cap. Hobby (GLM) and pro (Haiku) keep their existing limits — they were not hitting the ceiling. SpecTruncatedError still fires + surfaces 422 spec_too_large when even 12288 isn't enough, so the user gets actionable feedback instead of an opaque zod error. |
||
|---|---|---|
| .. | ||
| auth | ||
| db | ||
| llm | ||
| types | ||