Autonomous Agents
Build features, fix bugs, and refactor code with AI agents that work autonomously.
Overview
Autonomous agents go beyond chat—they take action. Give them a task, and they’ll plan, code, test, and iterate until it’s done. You stay in control with approval checkpoints and configurable permissions.
One-Shot Tasks
The fastest way to build something from scratch.
How It Works
- Describe your goal - “Build a REST API for user management with Express.js and MongoDB”
- Set completion criteria - Define what “done” looks like:
- All tests pass
- API documentation generated
- Docker compose file included
- Configure execution:
- Sandbox: Local (fastest) or Isolated (secure)
- Provider: Anthropic, OpenAI, etc.
- Model: Claude, GPT-4o, etc.
- Backend: Auto, TurboLight, DeepCode
- Iterations: 10 (quick) to 100 (thorough)
- Start the agent - Watch it work or come back later
Example Tasks
Build a feature:
Build a user authentication system with:
- JWT tokens
- Password hashing
- Login/logout endpoints
- Protected route middlewareRefactor code:
Refactor the payment module to use the strategy pattern.
Extract each payment provider into its own class.
Add comprehensive tests.Fix a bug:
The checkout flow fails when users have items
with special characters in the name.
Find and fix the root cause.
Add regression tests.Agent Backends
Different backends optimize for different scenarios:
| Backend | Best For |
|---|---|
| TurboLight | Fast iterations, simple tasks |
| AutoAgent | Balanced performance |
| DeepCode | Complex reasoning, large codebases |
| FastAgent | Quick prototyping |
AI Councils
When you need multiple perspectives, convene a council.
How Councils Work
- You pose a question or task
- Multiple AI agents discuss it
- They deliberate and debate
- They reach consensus (or report disagreement)
- You get a recommendation with reasoning
Preset Councils
Technical Review (4 members)
- Architect, Security, Performance, Quality
- Multi-perspective code review
- Catches issues a single reviewer might miss
Quick Review (1 member)
- Fast feedback for small changes
- Good for typos, minor refactors
Architecture Decision (3 members)
- System design choices
- “Should we use microservices or monolith?”
Product Council (3 members)
- Feature prioritization
- Trade-off analysis
Model Showdown (3 models)
- Compare responses from different AI models
- Find the best model for your task
Code Duel (2 members)
- Two agents implement the same feature
- Compare approaches
Supervised Team (3 members)
- Senior model oversees junior models
- Mentored development with quality control
Review Board (2 members)
- Thorough code review
- Detailed feedback
Council Configuration
| Setting | Options | Description |
|---|---|---|
| Deliberation Mode | Collaborative, Overseer, Debate | How agents interact |
| Max Rounds | 1-10 | Discussion cycles before deciding |
| Consensus Threshold | 50-100% | Agreement percentage required |
| Human Approval | On/Off | Require your sign-off |
Creating Custom Councils
- Click + in the Councils panel
- Set council name and description
- Add members from:
- Native LLM - Raw model access
- Agent Engine - Tool-using agents
- Amplifier Agents - Specialized experts
- CLI Agent - External AI CLIs
- Configure deliberation settings
- Save as template for reuse
Agent Settings
Fine-tune how agents behave across all tasks.
General Settings
| Setting | Description |
|---|---|
| Default Backend | Which agent engine to use |
| Permission Profile | Strict, Balanced, Permissive |
| Sandbox Type | Local (fast) or Isolated (secure) |
| Max Iterations | Steps before stopping (0 = unlimited) |
| Timeout | Maximum minutes per task |
Behavior Settings
| Setting | Description |
|---|---|
| Auto-approve Safe Actions | Skip approval for low-risk operations |
| Allow Sub-agents | Let agents spawn helper agents |
Permission Profiles
Strict
- Approval required for all file changes
- No command execution without confirmation
- Best for: Production code, sensitive projects
Balanced (Recommended)
- Auto-approve read operations
- Approve writes and commands
- Best for: Most development work
Permissive
- Auto-approve most safe operations
- Only confirm destructive actions
- Best for: Prototyping, personal projects
Background Agents
Run agents in the background while you work.
Starting Background Tasks
- Configure your task as usual
- Enable “Run in Background”
- Continue working on other things
- Get notified when complete
Managing Background Agents
- View running agents in the status bar
- Check progress anytime
- Cancel if needed
- Review results when done
Best Practices
Writing Good Task Descriptions
Be specific:
Good: "Add pagination to the /users endpoint with
page and limit query params, default 20 per page"
Vague: "Add pagination"Include context:
Good: "This is a Next.js 14 app using Prisma.
Add a dark mode toggle to the settings page."
Missing context: "Add dark mode"Define success:
Good: "Refactor auth to use refresh tokens.
Criteria: All existing tests pass,
new tests for refresh flow, docs updated."
No criteria: "Refactor auth"Choosing Iterations
| Iterations | Use When |
|---|---|
| 10-20 | Simple, well-defined tasks |
| 30-50 | Medium complexity, some exploration |
| 50-100 | Complex features, thorough testing |
| 100+ | Large refactors, unknown scope |
When to Use Councils
- Architecture decisions - Get multiple perspectives
- Code review - Catch more issues
- Trade-off analysis - Explore options
- Learning - See different approaches