Troubleshooting

Your Agent Said It Did Something… But Nothing Changed?

A fast way to diagnose when an agent sounds confident but the action never actually happened.

This is a trust-breaker that shows up often: the agent says “done,” but no file was created, no message was sent, or no change appears in your workspace.

Important: polished language is not proof of execution. Always verify outcomes, not wording.

2-minute reality check

Ask the agent to show specific evidence (path, command output, commit hash, or screenshot).
Confirm the result yourself in the target place (file exists, message posted, task completed).
If there is no evidence, rerun with a deterministic prompt (template below).

Common reasons this happens

Planning language confusion: “I’ll do X” was interpreted as completion.
Wrong workspace/path: action happened somewhere else.
Permission/approval stop: the tool call was blocked and never executed.
Context drift: the agent responded from chat context without re-checking current files.
Surface mismatch: output landed in a different channel/thread/session.

High-friction case: “Approval Required” looked like completion

One of the most common false-"done" scenarios is an approval pause. The agent may describe the intended action, but execution is waiting for you to approve the exact command.

Check whether the run shows an Approval Required step.
Approve the exact full command (including &&, pipes, semicolons, or multiline script content).
After approval, ask for fresh execution evidence (not the pre-approval plan).

Gotcha: approving a partial command often causes repeat loops. Approve the complete command string exactly as shown, then verify output again.

Use this “show your work” prompt

Do the task now, then return ONLY: 1) What changed 2) Exact evidence (file path, command output, or link) 3) A one-line verification step I can run

For file edits: require before/after proof

Request the exact file path.
Ask for the changed section (or line range) before and after.
Ask for one verification command, e.g. grep or git diff -- file.

For external actions: require destination proof

GitHub: PR/commit URL.
Discord: channel/thread reference and message confirmation.
Cron/reminders: job ID and schedule details.

Best practice: ask for “done + evidence + verification” every time you care about reliability. This reduces support loops dramatically.

If it “thinks forever” and then does nothing

Another common pattern: the agent shows thinking for several minutes, then no visible action lands anywhere.

Run a tiny canary task first: "Create a file named canary.txt with one line: test".
If the canary fails, treat this as an execution-layer issue (not a prompt-quality issue).
Start a fresh session and rerun the same tiny canary task.
If it still fails, restart the gateway/runtime and retry once.

Do not keep sending bigger prompts while tiny tasks are failing. You will only add confusion and context noise.

Quick reset prompt (when runs stall)

Before doing anything else: 1) Confirm your current workspace path. 2) Do this tiny task now: create canary.txt containing exactly "test". 3) Return proof: absolute file path + first line of file. If this fails, stop and report the exact failure point.

Escalation packet for #help

Execution mismatch report Task requested: What the agent claimed: Expected result location: Actual result: Evidence provided by agent (if any): Your verification step + result: