Everything you need to build software with AI agents — from first build to advanced orchestration.
git clone https://github.com/herakles-dev/nova-forge.git
cd nova-forge
./setup.sh # Creates venv + installs deps
source .venv/bin/activate
Nova Forge needs API keys for at least one provider. Amazon Bedrock is recommended:
export AWS_ACCESS_KEY_ID="your-key"
export AWS_SECRET_ACCESS_KEY="your-secret"
export AWS_DEFAULT_REGION="us-east-1"
Or use OpenRouter (OPENROUTER_API_KEY) for Gemini models, or Anthropic (ANTHROPIC_API_KEY) for Claude models.
You can also run /login inside Nova Forge for an interactive setup wizard.
python3 forge_cli.py
On first run, Nova Forge asks your experience level (Beginner / Intermediate / Expert) and sets an appropriate autonomy level automatically.
Just describe what you want:
> Build me an expense tracker with categories and charts
Nova Forge detects your intent and runs the full pipeline automatically:
The planner generates spec.md (project specification) and tasks.json (dependency-ordered task list). Tasks are sorted into parallel waves using topological sort.
If the decomposer fails, a 3-stage fallback guarantees tasks are always produced: retry → JSON recovery → single fallback task.
Each task is assigned to a ForgeAgent — an autonomous tool-use loop calling Amazon Nova via AWS Bedrock.
| Tool | Purpose |
|---|---|
| read_file | Read any project file |
| write_file | Create or overwrite a file |
| edit_file | Search-and-replace within a file |
| bash | Run shell commands |
| glob_files | Find files by pattern |
| grep | Search file contents |
| append_file | Add to end of file |
| list_directory | List directory contents |
| search_replace_all | Bulk find-and-replace |
| think | Internal reasoning (no side effects) |
| claim_file | Claim ownership (multi-agent) |
| remember | Store a note in persistent memory |
| check_context | Check remaining context budget |
Safety: Circuit breaker (3 failures → tool disabled), convergence tracker (5 idle turns → writes disabled), verify phase (agent reads back files to check correctness), adaptive turn budgets.
An adversarial read-only reviewer inspects output: PASS, CONDITIONAL, or FAIL.
Auto-detects your stack (14 supported), starts a dev server, creates a Cloudflare Tunnel for a shareable URL.
Docker + nginx + SSL deployment: auto-generates Dockerfile, builds container, writes reverse-proxy config, health-checks.
| Model | Alias | Context | Cost/1K | Best For |
|---|---|---|---|---|
| Nova Lite | nova-lite | 32K | $0.00006 | Fast prototypes, simple apps |
| Nova Pro | nova-pro | 300K | $0.0008 | Complex features, multi-file |
| Nova Premier | nova-premier | 1M | $0.002 | Deep reasoning, architecture |
| Model | Alias | Context | Provider |
|---|---|---|---|
| Gemini Flash | gemini-flash | 1M | OpenRouter |
| Gemini Pro | gemini-pro | 1M | OpenRouter |
| Claude Sonnet | claude-sonnet | 200K | Anthropic |
| Claude Haiku | claude-haiku | 200K | Anthropic |
| Project Type | Model | Why |
|---|---|---|
| Quick prototype (1–3 files) | nova-lite REC | Fast, cheap, S-tier on benchmarks |
| Multi-file feature (4–8 files) | nova-pro | Larger context, better coordination |
| Complex architecture (10+ files) | nova-premier | 1M context, deep reasoning |
| Tight budget | nova-lite | 33x cheaper than Pro |
| Speed priority | gemini-flash | Fastest inference |
Nova Forge auto-adapts prompts to each model’s context window:
/model nova-pro # Switch directly
/model # Interactive selector
/models # Compare all models
Pre-configured multi-agent team layouts. Nova Forge auto-selects the right one, or you can override with /formation.
Solo implementer. Config changes, small edits.
Implementer → Tester. Simple single-layer features.
Backend + Frontend → Integrator → Tester. Most common.
Architect → 2 parallel implementers. Greenfield builds.
3 parallel investigators with different strategies.
Threat modeler + Scanner → Fixer. Security audits.
Optimizer → Tester. Performance bottlenecks.
3 parallel reviewers (security, perf, coverage). Read-only.
Investigator → Fixer → Validator. Broken deployments.
4 reviewers → Synthesizer. Complex architecture decisions.
Auditor → Fixer → Verifier. Cross-file validation.
| Small (1–3 tasks) | Medium (4–8) | Large (9+) | |
|---|---|---|---|
| Routine | single-file | lightweight-feature | lightweight-feature |
| Medium | lightweight-feature | lightweight-feature | feature-impl |
| Complex | lightweight-feature | feature-impl | all-hands-planning |
Six trust levels control how much agents can do without asking.
Trust grows with successful builds: A0→A1 (5 builds), A1→A2 (10), A2→A3 (25). A3→A4 is never automatic — must use /autonomy 4.
/autonomy # Show current level
/autonomy ? # Explain all levels
/autonomy 3 # Set to Trusted
| Quick Build (default) | /interview | /guide | |
|---|---|---|---|
| Trigger | “Build me a todo app” | /interview | /guide |
| Questions | 2–6 targeted | 15–25 deep dive | Step-by-step wizard |
| Categories | Features, data, auth, design | 8 categories (features, data, auth, visual, API, realtime, deploy, testing) | Conversational |
| Best for | Prototypes, known scope | Complex projects, precise control | First-time users |
| Time | ~1 minute | ~5 minutes | ~3 minutes |
/interview for important projects. The deep planning produces dramatically better specs — especially for projects with auth, real-time features, or complex data models.
/preview # Auto-detect stack, start server + tunnel
/preview stop # Stop the preview
/preview status # Check preview status
14 supported stacks: Flask, FastAPI, Django, Streamlit, Next.js, Vite, Node.js, Go, Rust, Rails, PHP, generic Python, Docker, static HTML.
Creates a Cloudflare Tunnel for a shareable URL — no account needed. Falls back to localhost if Cloudflare is unavailable.
/deploy # Interactive (asks for domain)
/deploy myapp.herakles.dev # Direct deployment
Generates Dockerfile → builds image → runs container → nginx config → SSL → health check. All automated.
forge plan "expense tracker" --model nova-lite
forge build
forge preview
forge deploy --domain myapp.herakles.dev
| Command | Description |
|---|---|
/plan <goal> | Plan a project from description |
/build | Execute all tasks with AI agents |
/preview | Live preview via Cloudflare Tunnel |
/deploy [domain] | Ship to production (Docker + nginx) |
/status | Progress bar and overview |
/tasks | All tasks with dependencies |
| Command | Description |
|---|---|
/guide | Smart setup wizard |
/interview | 5-step deep planning |
/autonomy [0-5] | View or set autonomy level |
/model [alias] | Switch model |
/models | Compare all models |
/config [key] [val] | View or edit settings |
/login | Credential setup wizard |
/formation [name] | View or set formation |
| Command | Description |
|---|---|
/new <name> | Start fresh project |
/resume [n] | Resume recent project |
/cd <path> | Switch project directory |
/audit | View build audit log |
/builds [n] | Build history |
/health | System health dashboard |
/competition | Hackathon readiness check |
| Setting | Default | Description |
|---|---|---|
default_model | nova-lite | Model for new builds |
model_preset | nova | nova / mixed / premium |
project_dir | ~/projects | Where new projects are created |
max_turns | 50 | Maximum turns per agent task |
temperature | 0.3 | LLM sampling temperature |
auto_build | true | Auto-confirm in guided flow |
show_tips | true | Display contextual hints |
/config # View all settings
/config default_model nova-pro # Change a setting
python3 benchmark_nova_models.py --model nova-lite -v # Single model
python3 benchmark_nova_models.py --all # All 3 Nova models
python3 benchmark_nova_models.py --scenario kanban-board # Hard scenario
| Scenario | Difficulty | Stack |
|---|---|---|
| expense-tracker | Easy | Flask + SQLite + Chart.js |
| todo-app | Easy | FastAPI + SQLite |
| kanban-board | Hard | Flask + SQLite + Auth |
| realtime-kanban | Nightmare | Flask + SSE + Uploads |
| Model | Expense Tracker | Time | Turns |
|---|---|---|---|
| Nova Lite (32K) | S 100% | 144s | 40 |
| Nova Pro (300K) | S 100% | 167s | 39 |
| Nova Premier (1M) | S 100% | 1110s | 33 |
Grades: S 95–100% · A 85–94% · B 75–84% · C 60–74% · D 40–59% · F <40%
Set AWS env vars or run /login. Or switch to a non-AWS model: /model gemini-flash
Shouldn’t happen (3-stage fallback). Try a more specific description or use /interview.
Check for port conflicts: lsof -i :5000. Try /preview stop then /preview.
Circuit breaker auto-disables after 3 failures. Re-run /build for a fresh attempt.
Auto-compaction handles this. For large builds, switch to /model nova-pro or /model nova-premier.
Run /build again to retry only failed tasks. Or describe the issue and Nova will help fix it.
/interview for important projects/audit to see every tool call and decisionpython3 benchmark_nova_models.py --all