Troubleshooting

Your Agent Said It Did Something… But Nothing Changed?

A fast way to diagnose when an agent sounds confident but the action never actually happened.

This is a trust-breaker that shows up often: the agent says “done,” but no file was created, no message was sent, or no change appears in your workspace.

Important: polished language is not proof of execution. Always verify outcomes, not wording.

2-minute reality check

  1. Ask the agent to show specific evidence (path, command output, commit hash, or screenshot).
  2. Confirm the result yourself in the target place (file exists, message posted, task completed).
  3. If there is no evidence, rerun with a deterministic prompt (template below).

Common reasons this happens

High-friction case: “Approval Required” looked like completion

One of the most common false-"done" scenarios is an approval pause. The agent may describe the intended action, but execution is waiting for you to approve the exact command.

  1. Check whether the run shows an Approval Required step.
  2. Approve the exact full command (including &&, pipes, semicolons, or multiline script content).
  3. After approval, ask for fresh execution evidence (not the pre-approval plan).

Gotcha: approving a partial command often causes repeat loops. Approve the complete command string exactly as shown, then verify output again.

Use this “show your work” prompt

Do the task now, then return ONLY: 1) What changed 2) Exact evidence (file path, command output, or link) 3) A one-line verification step I can run

For file edits: require before/after proof

For external actions: require destination proof

Best practice: ask for “done + evidence + verification” every time you care about reliability. This reduces support loops dramatically.

Escalation packet for #help

Execution mismatch report Task requested: What the agent claimed: Expected result location: Actual result: Evidence provided by agent (if any): Your verification step + result: