Using Visuals for UI Design with Agent OS

Visuals enable Agent OS to direct your agents build interfaces that match your design vision. By including mockups, wireframes, or screenshots during the research phase, you give agents a clear visual target for implementation.

Including visuals is entirely optional, but when provided, they help reduce ambiguity and ensure the final implementation aligns with your expectations.

This page covers:

When and where to add visuals
File naming conventions
Fidelity levels
How visuals are used throughout the workflow
Playwright integration

When and where to add visuals

Visuals are gathered during the research phase, which happens when you run the new-spec command.

During this phase, Agent OS will prompt you about whether you have visual assets to include. If you do, place them in:

Copy

agent-os/specs/[this-spec]/planning/visuals/

Commonly used formats for visuals: PNG, JPG, JPEG, PDF

Agent OS automatically detects and analyzes visual files in this folder, even if you don't explicitly mention them during the research phase.

File naming conventions

Give your visual files descriptive names that clearly indicate what they represent:

❌ Not descriptive

Screen Shot 2025-01-15.png

mockup-v3-final.jpg

image1.png

✅ Descriptive

user-profile-page.png

dashboard-mobile-320px.jpg

checkout-flow-wireframe.png

Consider including additional details in filenames:

Component or page name: user-profile-page.png
Viewport size: dashboard-mobile-320px.jpg
State or variation: form-validation-errors.png
Fidelity level: checkout-flow-wireframe.png

Fidelity levels

Different visual assets serve different purposes. Agent OS recognizes two fidelity levels:

High-fidelity

Detailed mockups for exact implementation

High-fidelity mockups are detailed designs that agents should match closely, including exact colors, typography, spacing, and visual effects.

By default, Agent OS treats all visuals as high-fidelity unless you indicate otherwise.

Low-fidelity

Wireframes for structural guidance

Low-fidelity wireframes are rough layouts that provide structural guidance without dictating exact styling. Agents will use these for layout and component placement while applying your app's existing design system.

To indicate a visual is low-fidelity, include one of these keywords in the filename:

lofi or lo-fi
wireframe
sketch
rough

Example: homepage-wireframe.png or checkout-lofi-sketch.jpg

How visuals are used throughout the workflow

Visual assets are referenced at multiple stages of the Agent OS workflow:

1

Research phase

During new-spec, Agent OS prompts you to add visuals and automatically detects any files placed in the planning/visuals/ folder.

It will then analyze the visuals and ask you clarifying questions based on what it "sees" in them, to help you refine the planning of this feature spec.

2

Specification phase

During create-spec, Agent OS references the visuals when generating the spec, mapping UI components and requirements to the visual assets.

3

Implementation phase

During implement-spec, UI implementer agents (such as ui-designer) reference the visuals to build components that match the design.

4

Verification phase

During verification, verifier agents (such as frontend-verifier) compare the implementation against the original visuals to ensure alignment.

Playwright integration

Agent OS includes Playwright MCP integration for browser-based visual verification. This enables agents to launch browsers, take screenshots, and compare implementations against your visual assets.

Using Playwright is optional. Without it, your agents can still analyze your visual assets and use them to guide their work. However, it's highly recommended since it adds a lot more reliability and accuracy to agents' visual UI implementation. And it's free!

To install Playwright, follow the instructions in the Playwright MCP repository.

Agents with Playwright access

By default, these agents have access to Playwright tools:

ui-designer: Can launch browsers during implementation to preview work
frontend-verifier: Can take screenshots and verify UI matches mockups

Adding Playwright to custom agents

To give Playwright access to your custom agents, add the Playwright tool to their configuration:

Copy

- id: custom-ui-agent
  tools: Write, Read, Bash, WebFetch, Playwright

That Playwright tool name is actually a keyword. When Agent OS compiles your Claude Code agents (if you're using multi-agent mode with Claude Code), it will automatically detect it and replace it with the 12+ specific Playwright browser tool names.

Next steps

Install Agent OS

See the Agent OS workflow

Learn about roles and agents

Visuals: Design-Driven Development

When and where to add visuals

File naming conventions

❌ Not descriptive

✅ Descriptive

Fidelity levels

High-fidelity

Detailed mockups for exact implementation

Low-fidelity

Wireframes for structural guidance

How visuals are used throughout the workflow

1

Research phase

2

Specification phase

3

Implementation phase

4

Verification phase

Playwright integration

Agents with Playwright access

Adding Playwright to custom agents

Next steps

Free Resources

Training & Community

Visuals: Design-Driven Development

When and where to add visuals

File naming conventions

❌ Not descriptive

✅ Descriptive

Fidelity levels

High-fidelity Detailed mockups for exact implementation

Low-fidelity Wireframes for structural guidance

How visuals are used throughout the workflow

1 Research phase

2 Specification phase

3 Implementation phase

4 Verification phase

Playwright integration

Agents with Playwright access

Adding Playwright to custom agents

Next steps

Free Resources

Training & Community

High-fidelity

Detailed mockups for exact implementation

Low-fidelity

Wireframes for structural guidance

1

Research phase

2

Specification phase

3

Implementation phase

4

Verification phase