Chat alongside any tab. Hand off tasks. Watch it click, fill, read, and reason. Bring your own model key, run one on-device, or mix and match per agent.
Egg’s agent isn’t a chatbot wrapped around your browser. It runs in the Egg Gateway daemon, calls tools, executes plans, and streams output back into the chat.
Open the agent panel from the sidebar or the kebab menu. It runs alongside any tab and stays open across navigation.
Token-by-token streaming for chat. Tool calls render with collapsed/expanded views — you see what the agent did, not just what it said.
For multi-step tasks, the agent writes a plan and pauses for your approval before executing. You can edit or reject any step.
The agent calls a tool, reads the result, decides the next step. The loop runs until the task is done or stuck.
T1/T2/T3 conversation compression near context limits. You can run for hours without losing the early thread.
Conversations survive restarts. Reopen the panel and pick up where you left off.
Skills are multi-file folders that bundle instructions, examples, and tools for a specific domain. Toggle which skills each agent has. Add your own as a folder.
Open URLs, extract content, follow links, build a synthesis. Falls back to search when starting from a topic instead of a URL.
40 built-in tools across Gmail, Calendar, Drive, Docs, Sheets. Authorize once per profile.
Run Python, JavaScript, or shell snippets in a remote sandbox. The agent reads stdout/stderr like any tool.
FFmpeg-driven video and audio operations — trim, transcode, extract frames, concatenate. Local execution, no upload.
Read, write, and search files inside configured zones. The firewall enforces zone boundaries.
Write Markdown, code, or structured documents straight to a workspace. The agent picks the right artifact type for the task.
Built-in skill that constrains how the agent reasons — check before acting, summarize before continuing, verify before claiming success.
Generate flowcharts, sequence diagrams, and architecture diagrams as embedded mermaid blocks.
Drop a folder with a SKILL.md and tool definitions into the skills directory. Egg picks it up automatically. Three types: general, intent-routed, site-scoped.
What powers the agent is your call — per agent, per task, per minute. Bring keys. Run local. Swap providers. Egg doesn’t hold the wallet.
Anthropic, OpenAI, Google, OpenRouter. Direct API calls from the daemon — no proxy, no relay, no extra hop.
Run Llama, Mistral, Qwen, DeepSeek, or any Ollama-compatible model on your hardware. Zero cloud dependency.
Use Claude for the Primary, GPT-4 for code, a local Qwen for a research agent — all in the same conversation.
Override per turn. "Use cheap" or "use big" without leaving chat.
When a task needs vision, Egg automatically uses a vision-capable variant of the agent’s model. No manual config.
Per-agent token budgets, daily usage tracking, alerts when an agent burns through its allowance.
One key, every model. Switch on Tuesday. Change your mind on Wednesday.
Most agents fake the web through scraping APIs and language models pretending to know HTML. Egg runs the agent inside the actual browser, using the same DOM you see.
Click, type, scroll, hover, snapshot, navigate. Each runs over Chrome DevTools Protocol against the real tab.
Low-level CDP commands → mid-level page primitives (find, fill_form, get_visible_text) → high-level skill verbs the agent uses.
The agent takes screenshots when it’s uncertain. Vision-capable models inspect the result before continuing.
The agent uses your real cookies and sessions. No bot fingerprints, no captchas the agent can’t solve.
Watch what the agent does. Replay any sequence. Export as a script for the demo system.
Persistent facts, preferences, context. The agent remembers across conversations, across days, across sessions.
Facts and preferences saved automatically when relevant, surfaced when useful. Editable from Settings.
Ctrl+M (Cmd+M on Mac) on any page, selection, or chat turn commits it to memory.
System agents (Primary, Studio, Cheap) plus user-created agents. Each has a personality, model assignments, skills, and memory.
Constrain how much each agent can spend, per day. Memory and persona are independent of budget.
The Primary agent maintains a curiosity queue — topics it’s thinking about between conversations. Ask "what are you thinking about?" to see.
The agent runs untrusted code in remote sandboxes — not on your machine. Local code execution is disabled by default.
Python, Node, shell. Full filesystem inside the sandbox, network egress, package install. Sandbox spins up per task and tears down after.
The local code execution skill is disabled out of the box. Enable it explicitly per agent if you want it.
When local execution is enabled (Windows), child processes run in a Job Object so they can’t escape or persist.
stdout, stderr, file writes — all stream back into the chat as they happen.
MCP for community tools. Built-in MCP servers for the systems people use most. AG-UI compatibility for the agent UI standard.
Generic Model Context Protocol client. Tool naming convention mcp__{server}_{tool}. Add a server, get its tools.
Gmail, Calendar, Drive, Docs, Sheets, Forms, Slides — 40 tools, no setup.
Issues, PRs, code search, file fetch. Authorize once.
Egg’s chat surface speaks the AG-UI streaming standard, so external tools and dashboards can render the same conversations.
The browser exposes the new window.ai.* APIs (Prompt, Summarizer, Translator, Writer, Rewriter) backed by your configured model.
Agent execution doesn’t live in the Tauri app. It lives in a separate Rust daemon that the app launches and talks to over HTTP.
Standalone Rust process. The app spawns it on launch and re-attaches on restart. Single source of truth for agent state.
Long-running tasks (research, monitors, watchdogs) keep going even when the app window is closed.
Tool execution, skill loading, streaming, context compaction — all in one place. The Tauri app is a UI shell.
Local-only HTTP API with bearer auth. External tools (terminals, dashboards) can drive the agent the same way the app does.
Free to download. Bring your own API key.
Download for WindowsAlso available for macOS