Troubleshooting

Agent Burning Too Many Tokens?

Fix runaway token usage with a fast proof-first workflow: smaller turns, tighter outputs, and no-repeat context handoffs.

Community pattern: users hit limits fast, assume billing is broken, and keep retrying giant prompts. Most of the time, this is a prompt-shape and workflow issue—not a model bug.

Reality check: one oversized turn can cost more than ten small turns. The fastest way to save tokens is to force short, staged responses.

2-minute diagnosis checklist

  1. Are you pasting huge chat history repeatedly?
  2. Are you asking for giant outputs in one message ("full strategy", "all code", "everything")?
  3. Are you retrying the same failed prompt without reducing scope?
  4. Are you mixing multiple tasks in one request?
  5. Did you ask the agent to be concise and return only final output?

Known-good low-token prompt pattern

Token-aware mode ON. Goal: [one concrete task only] Context (max 5 bullets): - ... - ... Constraints: - Keep response under 180 words - No markdown tables - Ask at most one clarifying question (only if blocking) - If not blocked, execute immediately Output format: 1) Result 2) Evidence (paths/links/commands used) 3) Next smallest step

High-friction mistakes that spike token burn

Advanced edge case: external-tool retry spiral

Recurring pattern from community chat: users try to connect external generators (like Midjourney/social tools), then keep retrying the same broad prompt while setup or capability is still unresolved.

Canary check (copy/paste): 1) In one sentence, confirm whether you can directly perform [specific external action] in this session. 2) If NO, return only: - exact manual handoff step I should do - exact output artifact you can still produce now 3) Keep response under 120 words.

Do not continue with long production prompts until the canary returns a clear capability answer.

Loop-break rule: after two failed retries, stop. Split into smaller steps or start a fresh session with a compact handoff instead of trying the same giant prompt again.

Compact handoff when starting fresh

New session handoff: - Objective: [single outcome] - Completed so far: [3 bullets max] - Current blocker: [1 sentence] - Required output: [exact format] - Hard limits: <= 180 words, no tables, no repeated recap

When to escalate to #help

Bottom line: small scoped turns + hard output limits + compact handoffs solve most token-burn loops quickly.