Autonomous Agents

Build features, fix bugs, and refactor code with AI agents that work autonomously.

Overview

Autonomous agents go beyond chat—they take action. Give them a task, and they’ll plan, code, test, and iterate until it’s done. You stay in control with approval checkpoints and configurable permissions.

One-Shot Tasks

The fastest way to build something from scratch.

How It Works

Describe your goal - “Build a REST API for user management with Express.js and MongoDB”
Set completion criteria - Define what “done” looks like:
- All tests pass
- API documentation generated
- Docker compose file included
Configure execution:
- Sandbox: Local (fastest) or Isolated (secure)
- Provider: Anthropic, OpenAI, etc.
- Model: Claude, GPT-4o, etc.
- Backend: Auto, TurboLight, DeepCode
- Iterations: 10 (quick) to 100 (thorough)
Start the agent - Watch it work or come back later

Example Tasks

Build a feature:

Build a user authentication system with:
- JWT tokens
- Password hashing
- Login/logout endpoints
- Protected route middleware

Refactor code:

Refactor the payment module to use the strategy pattern.
Extract each payment provider into its own class.
Add comprehensive tests.

Fix a bug:

The checkout flow fails when users have items
with special characters in the name.
Find and fix the root cause.
Add regression tests.

Agent Backends

Different backends optimize for different scenarios:

Backend	Best For
TurboLight	Fast iterations, simple tasks
AutoAgent	Balanced performance
DeepCode	Complex reasoning, large codebases
FastAgent	Quick prototyping

AI Councils

When you need multiple perspectives, convene a council.

How Councils Work

You pose a question or task
Multiple AI agents discuss it
They deliberate and debate
They reach consensus (or report disagreement)
You get a recommendation with reasoning

Preset Councils

Technical Review (4 members)

Architect, Security, Performance, Quality
Multi-perspective code review
Catches issues a single reviewer might miss

Quick Review (1 member)

Fast feedback for small changes
Good for typos, minor refactors

Architecture Decision (3 members)

System design choices
“Should we use microservices or monolith?”

Product Council (3 members)

Feature prioritization
Trade-off analysis

Model Showdown (3 models)

Compare responses from different AI models
Find the best model for your task

Code Duel (2 members)

Two agents implement the same feature
Compare approaches

Supervised Team (3 members)

Senior model oversees junior models
Mentored development with quality control

Review Board (2 members)

Thorough code review
Detailed feedback

Council Configuration

Setting	Options	Description
Deliberation Mode	Collaborative, Overseer, Debate	How agents interact
Max Rounds	1-10	Discussion cycles before deciding
Consensus Threshold	50-100%	Agreement percentage required
Human Approval	On/Off	Require your sign-off

Creating Custom Councils

Click + in the Councils panel
Set council name and description
Add members from:
- Native LLM - Raw model access
- Agent Engine - Tool-using agents
- Amplifier Agents - Specialized experts
- CLI Agent - External AI CLIs
Configure deliberation settings
Save as template for reuse

Agent Settings

Fine-tune how agents behave across all tasks.

General Settings

Setting	Description
Default Backend	Which agent engine to use
Permission Profile	Strict, Balanced, Permissive
Sandbox Type	Local (fast) or Isolated (secure)
Max Iterations	Steps before stopping (0 = unlimited)
Timeout	Maximum minutes per task

Behavior Settings

Setting	Description
Auto-approve Safe Actions	Skip approval for low-risk operations
Allow Sub-agents	Let agents spawn helper agents

Permission Profiles

Strict

Approval required for all file changes
No command execution without confirmation
Best for: Production code, sensitive projects

Balanced (Recommended)

Auto-approve read operations
Approve writes and commands
Best for: Most development work

Permissive

Auto-approve most safe operations
Only confirm destructive actions
Best for: Prototyping, personal projects

Background Agents

Run agents in the background while you work.

Starting Background Tasks

Configure your task as usual
Enable “Run in Background”
Continue working on other things
Get notified when complete

Managing Background Agents

View running agents in the status bar
Check progress anytime
Cancel if needed
Review results when done

Best Practices

Writing Good Task Descriptions

Be specific:

Good: "Add pagination to the /users endpoint with
      page and limit query params, default 20 per page"

Vague: "Add pagination"

Include context:

Good: "This is a Next.js 14 app using Prisma.
      Add a dark mode toggle to the settings page."

Missing context: "Add dark mode"

Define success:

Good: "Refactor auth to use refresh tokens.
      Criteria: All existing tests pass,
      new tests for refresh flow, docs updated."

No criteria: "Refactor auth"

Choosing Iterations

Iterations	Use When
10-20	Simple, well-defined tasks
30-50	Medium complexity, some exploration
50-100	Complex features, thorough testing
100+	Large refactors, unknown scope

When to Use Councils

Architecture decisions - Get multiple perspectives
Code review - Catch more issues
Trade-off analysis - Explore options
Learning - See different approaches

Keyboard Shortcuts Sharing