Concepts

API Keys & Models

Choosing your agent's brain — what models are, what they cost, and how to set them up.

⏱ 5 minute read

What Are Models and Why Do They Matter?

Your agent needs an AI "brain" to think with — that's the model. When your agent reads a message and writes a reply, it's the model doing the heavy lifting.

Different models have different strengths. Some are faster, some are smarter, some are cheaper. Think of it like choosing between a sports car, an SUV, and a commuter bike — they all get you there, but the experience (and the price) is different.

The good news: heyron lets you switch models anytime. You're never locked in. Try one, and if it doesn't fit, swap it in seconds.

Do I need an API key?

Not to get started — your heyron subscription includes AI model usage. You can chat with your agent right away without any API key. However, if you want to use premium models heavily (like GPT-5.4 or Claude Opus), or if you run into usage limits, adding your own API key from the provider gives you more capacity. Think of it like this: the subscription gets you in the door, and your own API key is the upgrade if you need more horsepower.


The Main Providers

OpenAI

GPT-5.4 · GPT-4o-mini · GPT-5.3-codex

Best for: General tasks, coding, creative writing

GPT-5.4 — flagship, great value GPT-4o-mini — fast & cheap GPT-5.3-codex — best for coding

GPT-5.4 is the current flagship — performs on par with Sonnet and Opus for many tasks at a better price point. GPT-4o-mini is the budget option that still punches above its weight. GPT-5.3-codex is purpose-built for coding. Note: heavy use of premium models may require adding your own OpenAI API key.

Get API key → platform.openai.com

Anthropic

Claude Sonnet · Claude Opus

Best for: Long documents, careful reasoning, following complex instructions

Claude Sonnet — excellent all-rounder Claude Opus — most capable, premium price

Claude models are known for being thorough and precise. If your agent handles long documents or nuanced requests, Claude shines.

Get API key → console.anthropic.com

Google

Gemini 2.5 Flash · Gemini 2.5 Pro

Best for: Research, analysis, multimodal (images + text)

Gemini 2.5 Flash — fast & affordable Gemini 2.5 Pro — very capable

Google's Gemini models handle text and images natively. Great value, especially the Flash tier for everyday use.

Get API key → aistudio.google.com

OpenRouter

Many models · One API key

Best for: Experimenting, accessing free/cheap models, variety

One key, many providers Some models are free

OpenRouter is a gateway to dozens of models from different providers. Great if you want to try things without committing to one provider. Some models are free or nearly free.

Get API key → openrouter.ai

How to Set Up Your API Key

1

Create a Provider Account

Go to the provider's website (links above) and sign up. Most offer free credits to start with.

2

Generate an API Key

Find the API keys section in your provider's dashboard and create a new key. It'll look something like sk-proj-abc123...xyz — a long string of characters.

3

Add It to heyron

Go to your heyron dashboard and configure the model in your agent's settings. Or just tell your agent:

"Set my OpenAI API key to sk-proj-..."

⚠️ Security

Never share your API key publicly. Never paste it in Discord, a public channel, or a GitHub repo. Treat it like a password. If you accidentally leak it, revoke it immediately on the provider's site and generate a new one.


What Do Models Cost?

Models charge per token — roughly 1 token ≈ 0.75 words. A typical back-and-forth conversation uses 1,000–5,000 tokens.

💬 Casual use (chatting, quick questions) — $5–20/month

💻 Heavy use (coding, long documents, frequent conversations) — $20–50/month

🧪 Experimenting (free models via OpenRouter) — $0

Set spending limits!

All major providers let you set a monthly spending cap so you don't get surprised. Do this before you start using the API.
OpenAI: Settings → Limits · Anthropic: Settings → Spend Limits · Google: Billing → Budgets

💡 Pro tip

Start with a cheaper model like GPT-4o-mini or Gemini Flash. They're surprisingly good for most tasks. When you're ready to upgrade, GPT-5.4 is a great value sweet spot before jumping to Opus.


How to Switch Models

Switching is instant. You have two options:

Talk to your agent: Just say "Switch to Claude Sonnet" or "Use GPT-5.4". Your agent will handle the rest.

Use the dashboard: Go to your agent's settings and change the model in the Model section.

You can switch anytime — even mid-conversation. Different tasks might work better with different models, so don't be afraid to experiment.


Which Model Should I Pick?

🚀
"I just want it to work"
GPT-4o-mini or Gemini Flash — cheap, fast, good enough for most things
🏆
"I need the best quality"
GPT-5.4 or Claude Opus — premium models, premium results (may need your own API key for heavy use)
💰
"I'm on a tight budget"
Gemini Flash or OpenRouter free models — great quality at $0–5/month
👨‍💻
"I do a lot of coding"
GPT-5.3-codex or Claude Sonnet — purpose-built for code generation and review
📄
"I work with long documents"
Claude Sonnet — great context window, handles long inputs gracefully

Quick Troubleshooting (Billing & Access)

Seeing errors? Start here.

Most model errors are account limits, missing provider credits, or key/provider mismatch. Use this checklist before opening a support ticket.

"Insufficient credits" / "quota exceeded"

Add funds (or free credits) in your model provider account and confirm a spending limit is set above $0. If you're using your own API key, this error comes from the provider account attached to that key.

"Invalid API key"

Regenerate the key in the provider dashboard, then re-add it in heyron. Watch for copy/paste issues (leading/trailing spaces are common). Make sure the key matches the provider (OpenAI key with OpenAI model, Anthropic key with Claude, etc.).

Model works for a while, then starts failing

You likely hit rate limits or monthly caps. Either lower traffic, switch to a cheaper/faster model for routine tasks, or raise your provider spend/rate limits. Keep a fallback model configured so your agent stays responsive.

"Do I need an API key or not?"

For most users: no, not at first. heyron subscription usage covers getting started. Add your own key when you need higher-volume premium usage, custom provider routing, or you're hitting included-usage limits.

"I switched models but it still feels the same"

This is usually a context/session issue, not a broken model switch. Start a fresh chat/session, then ask the agent to confirm the active model in one line (for example: "Confirm active model only"). If needed, switch once via dashboard settings, then run one clean test prompt. Old context can make two different models sound similar on routine tasks.

"Why can't the agent fully automate Midjourney or similar tools?"

Some external tools still require manual confirmation/selection steps in their own UI. Your agent can draft prompts and workflows, but image picking/upscaling may need a human click unless you use an API-native pipeline. This is expected behavior, not a model failure.


What's Next?

Write Your First SOUL.md
Give your agent a personality, instructions, and purpose.
Connect to Discord
Talk to your agent from any Discord server.
How Memory Works
Understand how your agent remembers (and forgets) things.