Home How It Works Skills Skills Catalog Capabilities Roadmap Changelog FAQ Pricing
Capabilities

What Scaffold OS
can understand and build.

From a simple SaaS product to a sprawling enterprise system spanning cloud infrastructure, ML pipelines, and external platform automation — all managed under one coordination protocol.

Multi-System Projects Existing Codebases ML & AI Pipelines Enterprise Platforms Solo to Team Scale
v5.5-v5.6

The engine now understands context, delivery, and release readiness structurally.

That means uploaded workbooks, repo snapshots, deploy expectations, next-version recommendations, and readiness checks can become explicit engine state. The wrapper stops guessing what matters and starts rendering what the engine already knows.

Context and Continuity
Trusted session-ready context

Incoming material is registered, interpreted, summarized, and carried forward so later sessions start with trusted context instead of raw attachment chaos.

Uploads, URLs, repo snapshots, and prior bundles handled as durable inputs
Parser choices and source handling become easier to surface honestly
Later sessions restart faster with less context rebuild waste
Delivery Intelligence
Deploy-ready handoff and next-version planning

Delivery is no longer just a wrapper concern. The engine can now coordinate deploy intent, rollback-aware follow-through, and recommendation mode for what the product should build next.

Deploy targets, environment expectations, and smoke checks can be surfaced cleanly
Rollback thinking becomes more explicit before release follow-through
Live products can route into recommendation mode for the next version
Readiness Hardening
Smarter repo onboarding and safer real-project starts

Existing repositories surface richer continuity clues, while release gates and environment checks make it easier to confirm the platform is truly ready before serious work begins.

Hotspots, ownership clues, entry points, and dependency risk
Clearer extend-vs-rebuild continuity decisions
Secret checks and environment doctor signals that can block unsafe starts explicitly
Planning Intelligence — Core Protocol

Plan what's worth building.
Then make it bulletproof.

A perfect technical plan for the wrong product is the most expensive failure mode in software. v4.1 adds a planning intelligence layer that validates demand and challenges assumptions before architecture is locked — so every plan starts from a position of known demand, not hopeful guessing.

Demand Reality Check

Before any architecture is written, the system forces three questions: what do target users do today without this product, who specifically would pay for it this week, and what is the absolute minimum version that proves real demand. Answers are locked into the plan as the reference point for all scope decisions.

Prevents building technically perfect software that nobody needs.

Adversarial Spec Review

After the challenge phase, the system argues against its own plan — surfacing 2–3 specific objections to the approach before the architecture is locked. You hear the strongest case against your plan while you can still change direction. Not after 3 weeks of building.

The objections and your response are recorded in the project record.

Planning Mode Control

Before files are generated you choose how the engine approaches your plan: Expand (surface everything missing and flag unconsidered edge cases), Hold Scope (make what's specified bulletproof, no additions), or Reduce (cut everything that isn't required to prove the narrowest wedge).

Same project, different modes for different moments in the build cycle.

Failure Path Architecture

Every architecture document now includes an Error & Rescue Map and an Observability section — required, not optional. Every named error state, what users see, what the system does, and how it's logged. Plus logging format, key metrics, health check endpoint, and alerting thresholds. Planned before a line of code is written.

No silent failures. No "we'll add logging later."
Tier 1 — Enterprise-Scale Infrastructure

Enterprise platforms.
Not integrations. Build surfaces.

Scaffold OS doesn't write code that calls Salesforce. It logs into Salesforce and builds inside it — custom object schemas, permission sets, automation flows — as tracked, drift-monitored artifacts. The application boundary is no longer confined to a git repository.

Enterprise SaaS
Salesforce / SAP / ServiceNow

Agent generates custom object schemas, configures role-based permission sets, and connects complex automation workflows — directly inside the platform, via API.

Dynamic schema generation with relationship mapping
Automated permission set configuration per role
Custom object drift tracking vs. declared schema
Cloud Infrastructure
AWS / Cloudflare / Kubernetes

Provisions IAM roles, configures WAF routing rules, deploys serverless functions, and manages Kubernetes manifests as structured, versioned artifacts.

IAM role scaffolding with least-privilege enforcement
Serverless function deployment with version pinning
K8s manifest generation with config contract
Data Ecosystems
Snowflake / dbt / Databricks

Writes complex database migrations, creates analytics views and transformation models, and establishes data contracts across the entire pipeline as version-controlled code.

Schema migrations with rollback contracts
dbt model generation with lineage tracking
Data pipeline drift vs. declared transformation logic
Tier 2 — Workflow Automation & Content Platforms
Workflow Automation
n8n / Zapier / Make

Creates complex multi-step automation workflows from structured templates via REST API. The workflows themselves are version-controlled, drift-tracked build artifacts — not just configurations.

Multi-step workflow creation via API from JSON spec
Workflow version comparison and drift detection
Trigger logic declaratively specified and enforced
Headless CMS
WordPress / Strapi / Ghost

Structures the complete CMS via CLI and REST API — custom post types, field groups, metabox configurations — all as declarative artifacts that can be rebuilt identically.

Custom post type and taxonomy scaffolding
ACF/Metabox field group generation
CMS schema drift detection vs. declared structure
Workspace Logic
Notion / Todoist / Trello

Orchestrates enterprise knowledge graphs, canonical task synchronization pipelines, and structural templates across collaborative workspaces — with MCP-driven enforcement.

Knowledge base structure scaffolding
Cross-workspace sync logic definition
Template enforcement and drift detection
Tier 3 — AI & ML Infrastructure

First-class ML support.
Not squeezed into a code template.

ML projects aren't just more code. They have training pipelines, evaluation gates, performance thresholds, and model drift that's entirely separate from code drift. Scaffold OS handles this natively — auto-detected from architecture signals.

Full ML Build Pipeline
Auto-Detect

PyTorch, TF, XGBoost, LangChain signals detected from architecture

ML Setup

GPU config, training environment, evaluation pipeline scaffolded

Model Spec

Architecture, performance thresholds, training data, eval criteria defined

Training Build

Training pipeline executed as a first-class build step type

Eval Gate

Evaluation pipeline with pass/fail gate vs. declared thresholds

Model Drift States
BASELINE — Model performance within declared thresholds
DEGRADED — Performance outside acceptable range, flagged
RETRAIN_REQUIRED — Model must be retrained before deployment
Vector & AI Infrastructure

Pinecone, Qdrant, Weaviate, OpenAI, Anthropic, HuggingFace.

Collection schemas, hybrid search indices, chunking pipelines, fine-tuning orchestration — all as tracked, versioned infrastructure with drift monitoring.

Agentic Systems (LangChain / AutoGen / CrewAI)

Scaffold OS doesn't just run agents — it builds agent architectures. It defines schemas, tools, handoff protocols, and orchestration logic for multi-agent systems to execute. Including, yes, for AI systems that build other AI systems.

Tier 4 — Commerce & Financial Logic
Headless Commerce
Shopify / WooCommerce

Structures the complete store via Admin APIs — product catalog schemas, inventory routing rules, pricing matrices, and discount logic — as versioned, tracked code.

Payment Infrastructure
Stripe / Paddle / LemonSqueezy

Product and pricing configuration, webhook handler scaffolding, subscription logic, and metered billing architecture — all declaratively defined and drift-tracked.

Communication Layers
SendGrid / Twilio / Postmark

Template structure, delivery routing rules, notification trigger logic, and provider failover — structured as contracted build artifacts, not ad-hoc integrations.

Archaeology Mode

Works with code
you didn't build with us.

You have an existing codebase. Maybe it's a legacy system. Maybe it's from a previous dev team. Maybe it's your own code that grew past the point where you feel in control of it. Scaffold OS handles this — with a dedicated, standalone archaeology workflow that runs before any building begins.

Archaeology Flow (current standalone session)
Repository Import
Codebase Scan
Architecture Reconstruction
Spec Generation
Drift Baseline
Full Protocol Active

Archaeology is a dedicated, separate session that runs before any build work begins. The agent reads the codebase - stack fingerprinting, schema reconstruction from migrations, feature area mapping, API contract extraction from routes, hotspot detection, and continuity clues. Every inferred section is marked [VERIFY] for human confirmation. The result: a complete planning file set built through the full Code Archaeology protocol. From that point, the project is managed under the current Scaffold OS protocol with richer context, delivery intelligence, and readiness coverage.

Any Tech Stack

The protocol is stack-agnostic. Python, Node, Go, Rails, PHP — it reads what's there and manages it.

Partial Codebases

Don't need to hand over the full project. Even partial system handover works — the system infers what it doesn't know and flags uncertainty explicitly.

No Re-explaining

Once archaeology is complete, all future sessions start with full context. You never re-explain the system. State lives in documentation, not memory.

Infrastructure Layer

Skip the infrastructure tax.
Start building business logic immediately.

For non-technical founders and teams working on standard-stack projects, Scaffold OS includes an optional managed infrastructure layer. Instead of spending 30–50% of your AI budget debugging Docker networking and authentication wiring, the system detects your project profile and auto-generates a fully configured, ready-to-run local environment.

Auto-Detection

During the brainstorm phase, the system reads your architecture and determines if your project fits the standard stack profile. If it does, the infrastructure layer activates automatically.

What Gets Generated

A complete, runnable environment configuration — database, authentication provider, blob storage, and API gateway — pre-wired and ready to accept agent connections. No manual setup. No Docker debugging.

MCP-Connected

The environment exposes connection parameters via MCP, allowing agents to interact with the database, auth system, and storage directly — without any manual configuration or credential management.

Human + Machine Columns

The generated setup file uses a two-column format: Human column covers what you configure manually today. Machine column documents how a future SaaS version of the system will automate each step.

The real cost of infrastructure setup: A typical agent session solving Docker networking issues consumes 40–60% of a context window solving infrastructure problems before writing a single line of business logic. The infrastructure layer eliminates this entirely — so 100% of the build budget goes toward features.

Intelligence Engine

Not one agent doing everything.
A coordinated intelligence architecture.

The cognitive layer that powers Scaffold OS is designed around a core insight: different phases of software development require fundamentally different types of intelligence. We don't send one agent into a vague task. We route each phase to the most appropriate cognitive mode — automatically.

Parallel Agent Architecture

The contract enforcement system enables multiple agents to work on different features simultaneously without collision. When Feature A's agent changes an API endpoint, the contract system immediately flags all agents working on features that depend on that endpoint — before they build against a stale shape.

Result: parallel builds without merge conflicts or silent API drift between agents.

Extended Reasoning Mode

During the architecture challenge phase, the system invokes deep deliberative reasoning — interleaved between each challenge question. It doesn't just accept your initial answers. It surfaces assumption gaps, contradictions, and missing requirements that standard prompting misses entirely.

Result: architecture documents that reflect what you need, not what you described in your opening message.

Adaptive Skill Invocation

The routing layer detects which cognitive mode to invoke at each step: architect mode for system design, security analyst mode for audit passes, implementation mode for build steps, archaeology mode for codebase reconstruction. The protocol triggers the right mindset — not the agent's default behavior.

Result: each phase is handled by an agent operating at maximum effectiveness for that specific task.

Context Distillation Engine

Full session history is compressed into structured state files at the end of every session. A new agent starting a new session reconstructs complete project context from these files — without reading the original conversation. This is how a cheap, fast model can pick up exactly where an expensive model left off.

Result: perfect context handoff across sessions, across model classes, across team members.

Adaptive Technical Level Detection

The brainstorm intelligence automatically calibrates its behavior based on detected user sophistication — detecting across three modes during the conversation:

Non-Technical

Makes all technical decisions autonomously. Explains in plain language. Never asks about tech stack — infers from project requirements.

Semi-Technical

Presents options with trade-offs. Explains implications before decisions. Confirms major architectural choices collaboratively.

Technical

Peer-level discussion. Accepts technical constraints directly. Debates architecture choices. Respects explicit overrides.

Cross-System Contract Enforcement

One API change.
Every dependent blocked instantly.

The most silent and expensive failure in multi-agent builds: Agent A changes an API shape, Agent B builds against the old shape. Neither knows. The bug ships. Scaffold OS's contract enforcement system prevents this structurally — not through convention, but through automatic propagation.

Example: Breaking Schema Change Cascade
Backend agent renames POST /orders/createPOST /v2/orders
⚠ Flagged: Frontend Agent

Status set to DIVERGED. Build blocked. Frontend cannot call an endpoint that no longer exists. Receives contract change alert with 24-hour ACK requirement.

⚠ Flagged: Snowflake Pipeline

External data pipeline consuming order events is flagged as DIVERGED. Must be updated to read from new endpoint structure before new data contracts take effect.

✓ Not Blocked: Auth Module

Auth system has no declared dependency on the orders endpoint. No flag generated. Auth agent continues working uninterrupted.

8-step Contract Change Protocol activates automatically: change is staged → all 3+ dependent systems identified → alerts sent → ACK required within 24h → coordinator reviews → each dependent updated → audit confirms consistency → build resumes.
Code → Code Contracts

Backend API shape vs. what every frontend feature expects. Changes propagate automatically to all consumers of that endpoint.

Code → External Platform Contracts

Your backend vs. Snowflake pipelines, n8n workflows, Salesforce object fields — all drift-tracked identically to local code.

Model → Performance Contracts

ML models have declared performance thresholds. When production metrics drift below threshold, the system flags RETRAIN_REQUIRED before your users notice.

Continuous Audit System

Audit isn't a one-time gate.
It runs at every scale, every session.

Most systems audit once before a release. Scaffold OS runs a 3-tier audit architecture continuously — fast spot checks after every build cycle, deep focused audits when signals indicate risk, and full pre-build audit rounds for major phases. Each tier is calibrated for the right cost-to-signal ratio.

Tier 1
Spot Audit
5–10 min

Rapid 5-check scan after every build cycle. Runs automatically as part of the continuous mode loop. Catches regressions before they compound.

✓ Spec consistency
✓ Contract integrity
✓ Security flags
✓ Test coverage
✓ Dependency drift
Tier 2
Focused Audit
30–60 min

Deep 6-dimension investigation. Triggered automatically when Spot Audit flags risk, before a major feature merge, or when contract changes occur across systems.

✓ Data flow tracing
✓ Cross-system consistency
✓ Security end-to-end
✓ Agent dry-run simulation
✓ Spec/dependency cross-validation
✓ Fix verification
Tier 3 (Full)
Full Pre-Build Audit
2–4 hrs

Two-round 13-dimension gate that runs before major build phases begin. Nothing is written to code until both rounds return a CONSISTENT verdict.

✓ Round 1: 7 consistency dimensions
✓ Round 2: 6 deep-trace dimensions
✓ Build blocked until CONSISTENT
✓ Architecture problems cost nothing here
✓ Same problems at session 8 cost days

Tier selection is automatic: The continuous mode engine decides which tier to invoke based on what changed in the last cycle — no manual audit scheduling needed. Low-risk cycles get a Spot. High-risk cycles get a Focused. Phase boundaries get the Full audit.

Engine Forward Compatibility

Projects don't get stranded.
The engine upgrades with you.

Every project built with Scaffold OS carries its engine version. When a new engine version ships, a migration agent reads the project state, applies changes in order — additive only, never overwriting decisions — and logs every change for human review. No manual file hunting. No starting from scratch.

Engine File Headers

Every system prompt file carries a machine-readable version header. The migration agent can identify exactly which files to replace vs. which planning files to leave untouched — no ambiguity.

Migration Audit Trail

Every upgrade is fully logged — what changed, when, which harness applied it, and which items need human review. Full audit trail across the project lifetime.

Multi-Version Jump

Skipped a version? No problem. The migration agent applies each version transition in order — v3.5→v3.6→v4.0 — so no transition is skipped and the project reaches the latest version correctly.

Decisions Are Preserved

Architecture files, build plans, feature specs — all project-specific decisions stay exactly as written. Only engine system files get replaced. Your project's IP is never touched.

Who It's For

Solo founder to enterprise team.
The protocol scales.

Solo Founders

One person. Multiple systems. Scaffold OS handles backend, frontend, automation, payments, and infrastructure — in parallel, coordinated — while you focus on product decisions.

Small Engineering Teams

Teams of 2-8. Scaffold OS handles the coordination overhead that slows small teams: contract enforcement, drift detection, build state tracking between members.

Enterprise Projects

Complex systems spanning Salesforce, AWS, Snowflake, and internal microservices — all under one protocol. Enterprise build surfaces, enterprise-grade contract enforcement.

Domain Intelligence

The protocol knows what it's building.
And what context it is building from.

v4.6 adds project profile matching — 11 profiles that auto-detect your project type at brainstorm and load the right domain skills automatically. A SaaS build gets a SaaS specialist. An ML build gets an ML engineer. Quality gates enforce actual verdicts, not advisory reports.

SaaS Product
Multi-tenant auth, billing flows, onboarding sequences, subscription management
Enterprise Platform
RBAC, audit logging, SSO integration, compliance controls, enterprise admin
API-First Product
Spec-first OpenAPI design, versioning strategy, rate limiting, developer experience
Mobile App
Offline sync, push notifications, app store requirements, accessibility compliance
ML Platform
Model lifecycle, training pipelines, experiment tracking, evaluation gates, feature stores
E-Commerce
Product catalog, cart mechanics, checkout flow, payment integration, order management
Internal Tool
Admin panels, bulk operations, role-gated views, workflow automation, reporting
Data Platform
Warehouse schemas, transformation pipelines, data quality gates, lineage tracking
Automation Platform
Trigger/action design, retry logic, idempotency, webhook handling, queue management
Developer Tool
SDK design, CLI ergonomics, documentation-first development, versioning protocol
Content / Media
CMS architecture, publishing workflows, SEO structure, media pipeline, CDN strategy
Explore the Skills System → Browse Skills Catalog
Canonical Workflow + Delivery State - v5.6

The engine owns workflow state.
Not the surface on top.

Scaffold OS now maintains compact, machine-readable records for every meaningful workflow signal. Project status, health, session handoff context, decision urgency, deployment follow-through, and next-version planning signals are all engine-first outputs - available to any surface without guesswork.

Canonical Project State Snapshot

A machine-readable snapshot of current workflow state is the authoritative source of truth. Phase, build progress, health score, and active context — available in one compact, queryable record. No session parsing required.

Any wrapper or surface reads current project state from one place.

Session Handoff Artifact

A concise human-readable handoff summary is generated at each session boundary — where work stopped, what's pending, what context is needed for a clean resume. Session continuity is explicit, not inferred.

Clean resumability across any agent, model, or surface boundary.

Canonical Health Score

Project health is engine-computed, not dashboard-guessed. Drift states, open debt, and session staleness roll into one score any surface can trust without recalculating it independently.

One formula. One score. Consistent across every wrapper and dashboard.

Git Strategy, Deploy Coordination, and Follow-Through

Deploy readiness now reflects the project's actual branching model, and delivery handoff has become more structured. PR-required projects get draft-ready records, direct-deploy projects skip that overhead, and surfaces can consume clearer deployment and rollback signals without inventing release rules from scratch.

CI/CD surfaces consume branch-aware release and delivery signals without hardcoding project rules.

Generic wrapper execution contract: Run requests, status signals, result records, recommendation outputs, and delivery follow-through form a clean, stable contract any wrapper surface can use to trigger and observe Scaffold OS sessions — without embedding session-specific logic or parsing internal files. The engine generates these outcomes; the surface consumes them.

Delivery Intelligence — Live in v5.5 and v5.6

Ship, recover, and decide what comes next.
From one protocol surface.

These releases turn more of the operational story into engine-owned behavior. Delivery intent, rollback awareness, next-version planning, and release-readiness checks are no longer loose wrapper-side guesses.

Deploy Coordination
Structured release handoff

The engine can hand a surface a clear deployment story: where work is going, what it depends on, what checks should run, and what outcome must come back.

Target-aware deploy expectations for each environment
Smoke-check follow-through after shipping
Cleaner surface handback after a release attempt
Recommendation Mode
Plan the next version structurally

A live product can now route into a dedicated planning mode that studies shipped work, backlog pressure, health, and open decisions before recommending the next version.

Ranked next-version candidates instead of freeform wish lists
One clear recommendation for the strongest next step
Cleaner transition from planning into the next update cycle
Readiness Hardening
Prove the environment before real work

The platform now checks environment shape and release readiness more explicitly, which reduces false confidence before the first serious project or deploy run begins.

Environment doctor checks for setup drift
Stricter release validation before serious execution
Golden-path confidence for the most important starts
Protocol Platform Foundation — v5.0

The engine is reusable.
Multiple surfaces. One protocol.

v5.2 makes Scaffold OS a reusable protocol foundation, v5.3 makes delivery and project intelligence more operationally complete, v5.4 adds context intelligence, and v5.5-v5.6 extend that foundation into deploy coordination, next-version recommendation, and readiness hardening. Any product layer that follows the wrapper contract gets access to canonical state, lifecycle events, trusted context summaries, target tracking, integration readiness, delivery follow-through, and queue management.

Target Registry

Declared build targets — deployment environments, external platforms, cloud destinations — are registered in a canonical target manifest. Each target carries its own readiness state, independently addressable by any wrapper or integration surface.

Multi-environment targets tracked in one canonical record
Per-target readiness state queryable without parsing build history
Surfaces address specific targets independently
Integration Registry

Active integrations are tracked separately from the build plan. Per-integration readiness manifests surface whether each integration is connected, configured, and healthy — without requiring any surface to parse internal session records.

Integration health visible without reading session files
Connection, config, and readiness state per integration
Tracked independently from the active build plan
Lifecycle Event History

The engine emits a canonical event stream — wrapper-safe, append-only, timestamped — covering the full project lifecycle. Any surface consuming Scaffold OS can query or replay this history without parsing session files or inferring state from scattered records.

Full lifecycle events in a queryable, append-only stream
Replay history without parsing session files
Wrapper-safe format for any surface to consume

Multi-track work queue management: The engine maintains a canonical summary of all active work tracks across a project. Parallel workstreams can coexist, be prioritized, and be tracked without colliding with each other — giving any surface a single place to read overall project work state without managing session coordination manually.

Ready to build something complex?

See how it works or explore the full roadmap.

See How It Works → See What's Coming