Visuals: Design-Driven Development

Visuals enable Agent OS to direct your agents build interfaces that match your design vision. By including mockups, wireframes, or screenshots during the research phase, you give agents a clear visual target for implementation.

Including visuals is entirely optional, but when provided, they help reduce ambiguity and ensure the final implementation aligns with your expectations.

This page covers:

When and where to add visuals

Visuals are gathered during the research phase, which happens when you run the new-spec command.

During this phase, Agent OS will prompt you about whether you have visual assets to include. If you do, place them in:

Copy
agent-os/specs/[this-spec]/planning/visuals/

Commonly used formats for visuals: PNG, JPG, JPEG, PDF

Agent OS automatically detects and analyzes visual files in this folder, even if you don't explicitly mention them during the research phase.

File naming conventions

Give your visual files descriptive names that clearly indicate what they represent:

❌ Not descriptive

Screen Shot 2025-01-15.png
mockup-v3-final.jpg
image1.png

✅ Descriptive

user-profile-page.png
dashboard-mobile-320px.jpg
checkout-flow-wireframe.png

Consider including additional details in filenames:

  • Component or page name: user-profile-page.png
  • Viewport size: dashboard-mobile-320px.jpg
  • State or variation: form-validation-errors.png
  • Fidelity level: checkout-flow-wireframe.png

Fidelity levels

Different visual assets serve different purposes. Agent OS recognizes two fidelity levels:

High-fidelity
Detailed mockups for exact implementation

High-fidelity mockups are detailed designs that agents should match closely, including exact colors, typography, spacing, and visual effects.

By default, Agent OS treats all visuals as high-fidelity unless you indicate otherwise.

Low-fidelity
Wireframes for structural guidance

Low-fidelity wireframes are rough layouts that provide structural guidance without dictating exact styling. Agents will use these for layout and component placement while applying your app's existing design system.

To indicate a visual is low-fidelity, include one of these keywords in the filename:

  • lofi or lo-fi
  • wireframe
  • sketch
  • rough

Example: homepage-wireframe.png or checkout-lofi-sketch.jpg

How visuals are used throughout the workflow

Visual assets are referenced at multiple stages of the Agent OS workflow:

1
Research phase

During new-spec, Agent OS prompts you to add visuals and automatically detects any files placed in the planning/visuals/ folder.

It will then analyze the visuals and ask you clarifying questions based on what it "sees" in them, to help you refine the planning of this feature spec.

2
Specification phase

During create-spec, Agent OS references the visuals when generating the spec, mapping UI components and requirements to the visual assets.

3
Implementation phase

During implement-spec, UI implementer agents (such as ui-designer) reference the visuals to build components that match the design.

4
Verification phase

During verification, verifier agents (such as frontend-verifier) compare the implementation against the original visuals to ensure alignment.

Playwright integration

Agent OS includes Playwright MCP integration for browser-based visual verification. This enables agents to launch browsers, take screenshots, and compare implementations against your visual assets.

Using Playwright is optional. Without it, your agents can still analyze your visual assets and use them to guide their work. However, it's highly recommended since it adds a lot more reliability and accuracy to agents' visual UI implementation. And it's free!

To install Playwright, follow the instructions in the Playwright MCP repository.

Agents with Playwright access

By default, these agents have access to Playwright tools:

  • ui-designer: Can launch browsers during implementation to preview work
  • frontend-verifier: Can take screenshots and verify UI matches mockups

Adding Playwright to custom agents

To give Playwright access to your custom agents, add the Playwright tool to their configuration:

Copy
- id: custom-ui-agent
  tools: Write, Read, Bash, WebFetch, Playwright

That Playwright tool name is actually a keyword. When Agent OS compiles your Claude Code agents (if you're using multi-agent mode with Claude Code), it will automatically detect it and replace it with the 12+ specific Playwright browser tool names.

Next steps

Install Agent OS

See the Agent OS workflow

Learn about roles and agents

Builder Methods

Training & community for pro software developers building with AI.

Created by Brian Casel (that's me). I'm a career software developer, founder, and creator of Builder Methods and Agent OS, the system for spec-driven development with AI.

© 2025 CasJam Media, LLC / Builder Methods
Contact