design-consultation
Design consultation: understands your product, researches the landscape, proposes a complete design system (aesthetic, typography, color, layout, spacing, motion), and generates font+color preview pages. Creates DESIGN.md as your project's design source of truth. For existing sit
What it does
Preamble
eval "$(~/.vibestack/bin/vibe-slug 2>/dev/null)" 2>/dev/null || SLUG="unknown"
_LEARN_FILE="${VIBESTACK_HOME:-$HOME/.vibestack}/projects/${SLUG:-unknown}/learnings.jsonl"
if [ -f "$_LEARN_FILE" ]; then
_LEARN_COUNT=$(wc -l < "$_LEARN_FILE" 2>/dev/null | tr -d ' ')
echo "LEARNINGS: $_LEARN_COUNT entries loaded"
if [ "$_LEARN_COUNT" -gt 5 ] 2>/dev/null; then
~/.vibestack/bin/vibe-learnings-search --limit 5 2>/dev/null || true
fi
else
echo "LEARNINGS: none yet"
fi
Phase 0: Pre-checks
Check for existing DESIGN.md:
ls DESIGN.md design-system.md 2>/dev/null || echo "NO_DESIGN_FILE"
- If a DESIGN.md exists: Read it. Ask the user: "You already have a design system. Want to update it, start fresh, or cancel?"
- If no DESIGN.md: continue.
Gather product context from the codebase:
cat README.md 2>/dev/null | head -50
cat package.json 2>/dev/null | head -20
ls src/ app/ pages/ components/ 2>/dev/null | head -30
Look for office-hours output:
setopt +o nomatch 2>/dev/null || true # zsh compat
eval "$(~/.vibestack/bin/vibe-slug 2>/dev/null)"
ls ~/.vibestack/projects/$SLUG/*office-hours* 2>/dev/null | head -5
ls .context/*office-hours* .context/attachments/*office-hours* 2>/dev/null | head -5
If office-hours output exists, read it — the product context is pre-filled.
If the codebase is empty and purpose is unclear, say: "I don't have a clear picture of what you're building yet. Want to explore first with /office-hours? Once we know the product direction, we can set up the design system."
Find the browse binary (optional — enables visual competitive research):
SETUP
# vibestack does not include a browse daemon.
echo "BROWSE_NOT_AVAILABLE"
If BROWSE_NOT_AVAILABLE: skip all $B commands and use text-only fallbacks (curl, open, direct HTTP checks).
DESIGN SETUP
# vibestack does not include a design daemon.
echo "DESIGN_NOT_AVAILABLE"
If DESIGN_NOT_AVAILABLE: skip visual mockup generation and fall back to text-based design review.
{{include lib/snippets/prior-learnings.md}}
Phase 1: Product Context
Ask the user a single question that covers everything you need to know. Pre-fill what you can infer from the codebase.
AskUserQuestion Q1 — include ALL of these:
- Confirm what the product is, who it's for, what space/industry
- What project type: web app, dashboard, marketing site, editorial, internal tool, etc.
- "Want me to research what top products in your space are doing for design, or should I work from my design knowledge?"
- Explicitly say: "At any point you can just drop into chat and we'll talk through anything — this isn't a rigid form, it's a conversation."
If the README or office-hours output gives you enough context, pre-fill and confirm: "From what I can see, this is [X] for [Y] in the [Z] space. Sound right? And would you like me to research what's out there in this space, or should I work from what I know?"
Memorable-thing forcing question. Before moving on, ask the user: "What's the one thing you want someone to remember after they see this product for the first time?"
One sentence answer. Could be a feeling ("this is serious software for serious work"), a visual ("the blue that's almost black"), a claim ("faster than anything else"), or a posture ("for builders, not managers"). Write it down. Every subsequent design decision should serve this memorable thing. Design that tries to be memorable for everything is memorable for nothing.
Taste profile (if this user has prior sessions)
Read the persistent taste profile if it exists:
_TASTE_PROFILE=~/.vibestack/projects/$SLUG/taste-profile.json
if [ -f "$_TASTE_PROFILE" ]; then
# Schema v1: { dimensions: { fonts, colors, layouts, aesthetics }, sessions: [] }
# Each dimension has approved[] and rejected[] entries with
# { value, confidence, approved_count, rejected_count, last_seen }
# Confidence decays 5% per week of inactivity — computed at read time.
cat "$_TASTE_PROFILE" 2>/dev/null | head -200
echo "TASTE_PROFILE_FOUND"
else
echo "NO_TASTE_PROFILE"
fi
If TASTE_PROFILE_FOUND: Summarize the strongest signals (top 3 approved entries per dimension by confidence * approved_count). Include them in the design brief:
"Based on ${SESSION_COUNT} prior sessions, this user's taste leans toward: fonts [top-3], colors [top-3], layouts [top-3], aesthetics [top-3]. Bias generation toward these unless the user explicitly requests a different direction. Also avoid their strong rejections: [top-3 rejected per dimension]."
If NO_TASTE_PROFILE: Fall through to per-session approved.json files (legacy).
Conflict handling: If the current user request contradicts a strong persistent signal (e.g., "make it playful" when taste profile strongly prefers minimal), flag it: "Note: your taste profile strongly prefers minimal. You're asking for playful this time — I'll proceed, but want me to update the taste profile, or treat this as a one-off?"
Decay: Confidence scores decay 5% per week. A font approved 6 months ago with 10 approvals has less weight than one approved last week. The decay calculation happens at read time, not write time, so the file only grows on change.
Schema migration: If the file has no version field or version: 0, it's
the legacy approved.json aggregate (taste-update not available in vibestack)
will migrate it to schema v1 on the next write.
If a taste profile exists for this project, factor it into your Phase 3 proposal. The profile reflects what the user has actually approved in prior sessions — treat it as a demonstrated preference, not a constraint. You may still deliberately depart from it if the product direction demands something different; when you do, say so explicitly and connect the departure to the memorable-thing answer above.
Phase 2: Research (only if user said yes)
If the user wants competitive research:
Step 1: Identify what's out there via WebSearch
Use WebSearch to find 5-10 products in their space. Search for:
- "[product category] website design"
- "[product category] best websites 2025"
- "best [industry] web apps"
Step 2: Visual research via browse (if available)
If the browse binary is available ($B is set), visit the top 3-5 sites in the space and capture visual evidence:
$B goto "https://example-site.com"
$B screenshot "/tmp/design-research-site-name.png"
$B snapshot
For each site, analyze: fonts actually used, color palette, layout approach, spacing density, aesthetic direction. The screenshot gives you the feel; the snapshot gives you structural data.
If a site blocks the headless browser or requires login, skip it and note why.
If browse is not available, rely on WebSearch results and your built-in design knowledge — this is fine.
Step 3: Synthesize findings
Three-layer synthesis:
- Layer 1 (tried and true): What design patterns does every product in this category share? These are table stakes — users expect them.
- Layer 2 (new and popular): What are the search results and current design discourse saying? What's trending? What new patterns are emerging?
- Layer 3 (first principles): Given what we know about THIS product's users and positioning — is there a reason the conventional design approach is wrong? Where should we deliberately break from the category norms?
Eureka check: If Layer 3 reasoning reveals a genuine design insight — a reason the category's visual language fails THIS product — name it: "EUREKA: Every [category] product does X because they assume [assumption]. But this product's users [evidence] — so we should do Y instead." Log the eureka moment (see preamble).
Summarize conversationally:
"I looked at what's out there. Here's the landscape: they converge on [patterns]. Most of them feel [observation — e.g., interchangeable, polished but generic, etc.]. The opportunity to stand out is [gap]. Here's where I'd play it safe and where I'd take a risk..."
Graceful degradation:
- Browse available → screenshots + snapshots + WebSearch (richest research)
- Browse unavailable → WebSearch only (still good)
- WebSearch also unavailable → agent's built-in design knowledge (always works)
If the user said no research, skip entirely and proceed to Phase 3 using your built-in design knowledge.
Design Outside Voices (parallel)
Use AskUserQuestion:
"Want outside design voices? Codex evaluates against OpenAI's design hard rules + litmus checks; Claude subagent does an independent design direction proposal."
A) Yes — run outside design voices B) No — proceed without
If user chooses B, skip this step and continue.
Check Codex availability:
which codex 2>/dev/null && echo "CODEX_AVAILABLE" || echo "CODEX_NOT_AVAILABLE"
If Codex is available, launch both voices simultaneously:
- Codex design voice (via Bash):
TMPERR_DESIGN=$(mktemp /tmp/codex-design-XXXXXXXX)
_REPO_ROOT=$(git rev-parse --show-toplevel) || { echo "ERROR: not in a git repo" >&2; exit 1; }
command -v codex >/dev/null 2>&1 && codex exec "Given this product context, propose a complete design direction:
- Visual thesis: one sentence describing mood, material, and energy
- Typography: specific font names (not defaults — no Inter/Roboto/Arial/system) + hex colors
- Color system: CSS variables for background, surface, primary text, muted text, accent
- Layout: composition-first, not component-first. First viewport as poster, not document
- Differentiation: 2 deliberate departures from category norms
- Anti-slop: no purple gradients, no 3-column icon grids, no centered everything, no decorative blobs
Be opinionated. Be specific. Do not hedge. This is YOUR design direction — own it." -C "$_REPO_ROOT" -s read-only -c 'model_reasoning_effort="medium"' --enable web_search_cached < /dev/null 2>"$TMPERR_DESIGN"
Use a 5-minute timeout (timeout: 300000). After the command completes, read stderr:
cat "$TMPERR_DESIGN" && rm -f "$TMPERR_DESIGN"
- Claude design subagent (via Agent tool): Dispatch a subagent with this prompt: "Given this product context, propose a design direction that would SURPRISE. What would the cool indie studio do that the enterprise UI team wouldn't?
- Propose an aesthetic direction, typography stack (specific font names), color palette (hex values)
- 2 deliberate departures from category norms
- What emotional reaction should the user have in the first 3 seconds?
Be bold. Be specific. No hedging."
Error handling (all non-blocking):
- Auth failure: If stderr contains "auth", "login", "unauthorized", or "API key": "Codex authentication failed. Run
codex loginto authenticate." - Timeout: "Codex timed out after 5 minutes."
- Empty response: "Codex returned no response."
- On any Codex error: proceed with Claude subagent output only, tagged
[single-model]. - If Claude subagent also fails: "Outside voices unavailable — continuing with primary review."
Present Codex output under a CODEX SAYS (design direction): header.
Present subagent output under a CLAUDE SUBAGENT (design direction): header.
Synthesis: Claude main references both Codex and subagent proposals in the Phase 3 proposal. Present:
- Areas of agreement between all three voices (Claude main + Codex + subagent)
- Genuine divergences as creative alternatives for the user to choose from
- "Codex and I agree on X. Codex suggested Y where I'm proposing Z — here's why..."
Log the result:
true # vibe-review-log '{"skill":"design-outside-voices","timestamp":"'"$(date -u +%Y-%m-%dT%H:%M:%SZ)"'","status":"STATUS","source":"SOURCE","commit":"'"$(git rev-parse --short HEAD)"'"}'
Replace STATUS with "clean" or "issues_found", SOURCE with "codex+subagent", "codex-only", "subagent-only", or "unavailable".
Phase 3: The Complete Proposal
This is the soul of the skill. Propose EVERYTHING as one coherent package.
AskUserQuestion Q2 — present the full proposal with SAFE/RISK breakdown:
Based on [product context] and [research findings / my design knowledge]:
AESTHETIC: [direction] — [one-line rationale]
DECORATION: [level] — [why this pairs with the aesthetic]
LAYOUT: [approach] — [why this fits the product type]
COLOR: [approach] + proposed palette (hex values) — [rationale]
TYPOGRAPHY: [3 font recommendations with roles] — [why these fonts]
SPACING: [base unit + density] — [rationale]
MOTION: [approach] — [rationale]
This system is coherent because [explain how choices reinforce each other].
SAFE CHOICES (category baseline — your users expect these):
- [2-3 decisions that match category conventions, with rationale for playing safe]
RISKS (where your product gets its own face):
- [2-3 deliberate departures from convention]
- For each risk: what it is, why it works, what you gain, what it costs
The safe choices keep you literate in your category. The risks are where
your product becomes memorable. Which risks appeal to you? Want to see
different ones? Or adjust anything else?
The SAFE/RISK breakdown is critical. Design coherence is table stakes — every product in a category can be coherent and still look identical. The real question is: where do you take creative risks? The agent should always propose at least 2 risks, each with a clear rationale for why the risk is worth taking and what the user gives up. Risks might include: an unexpected typeface for the category, a bold accent color nobody else uses, tighter or looser spacing than the norm, a layout approach that breaks from convention, motion choices that add personality.
Options: A) Looks great — generate the preview page. B) I want to adjust [section]. C) I want different risks — show me wilder options. D) Start over with a different direction. E) Skip the preview, just write DESIGN.md.
Your Design Knowledge (use to inform proposals — do NOT display as tables)
Aesthetic directions (pick the one that fits the product):
- Brutally Minimal — Type and whitespace only. No decoration. Modernist.
- Maximalist Chaos — Dense, layered, pattern-heavy. Y2K meets contemporary.
- Retro-Futuristic — Vintage tech nostalgia. CRT glow, pixel grids, warm monospace.
- Luxury/Refined — Serifs, high contrast, generous whitespace, precious metals.
- Playful/Toy-like — Rounded, bouncy, bold primaries. Approachable and fun.
- Editorial/Magazine — Strong typographic hierarchy, asymmetric grids, pull quotes.
- Brutalist/Raw — Exposed structure, system fonts, visible grid, no polish.
- Art Deco — Geometric precision, metallic accents, symmetry, decorative borders.
- Organic/Natural — Earth tones, rounded forms, hand-drawn texture, grain.
- Industrial/Utilitarian — Function-first, data-dense, monospace accents, muted palette.
Decoration levels: minimal (typography does all the work) / intentional (subtle texture, grain, or background treatment) / expressive (full creative direction, layered depth, patterns)
Layout approaches: grid-disciplined (strict columns, predictable alignment) / creative-editorial (asymmetry, overlap, grid-breaking) / hybrid (grid for app, creative for marketing)
Color approaches: restrained (1 accent + neutrals, color is rare and meaningful) / balanced (primary + secondary, semantic colors for hierarchy) / expressive (color as a primary design tool, bold palettes)
Motion approaches: minimal-functional (only transitions that aid comprehension) / intentional (subtle entrance animations, meaningful state transitions) / expressive (full choreography, scroll-driven, playful)
Font recommendations by purpose:
- Display/Hero: Satoshi, General Sans, Instrument Serif, Fraunces, Clash Grotesk, Cabinet Grotesk
- Body: Instrument Sans, DM Sans, Source Sans 3, Geist, Plus Jakarta Sans, Outfit
- Data/Tables: Geist (tabular-nums), DM Sans (tabular-nums), JetBrains Mono, IBM Plex Mono
- Code: JetBrains Mono, Fira Code, Berkeley Mono, Geist Mono
Font blacklist (never recommend): Papyrus, Comic Sans, Lobster, Impact, Jokerman, Bleeding Cowboys, Permanent Marker, Bradley Hand, Brush Script, Hobo, Trajan, Raleway, Clash Display, Courier New (for body)
Overused fonts (never recommend as primary — use only if user specifically requests): Inter, Roboto, Arial, Helvetica, Open Sans, Lato, Montserrat, Poppins, Space Grotesk.
Space Grotesk is on the list specifically because every AI design tool converges on it as "the safe alternative to Inter." That's the convergence trap. Treat it the same as Inter: only use if the user asks for it by name.
Anti-convergence directive: Across multiple generations in the same project, VARY light/dark, fonts, and aesthetic directions. Never propose the same choices twice without explicit justification. If the user's prior session used Geist + dark + editorial, propose something different this time (or explicitly acknowledge you're doubling down because it fits the brief). Convergence across generations is slop.
AI slop anti-patterns (never include in your recommendations):
- Purple/violet gradients as default accent
- 3-column feature grid with icons in colored circles
- Centered everything with uniform spacing
- Uniform bubbly border-radius on all elements
- Gradient buttons as the primary CTA pattern
- Generic stock-photo-style hero sections
- system-ui / -apple-system as the primary display or body font (the "I gave up on typography" signal)
- "Built for X" / "Designed for Y" marketing copy patterns
Coherence Validation
When the user overrides one section, check if the rest still coheres. Flag mismatches with a gentle nudge — never block:
- Brutalist/Minimal aesthetic + expressive motion → "Heads up: brutalist aesthetics usually pair with minimal motion. Your combo is unusual — which is fine if intentional. Want me to suggest motion that fits, or keep it?"
- Expressive color + restrained decoration → "Bold palette with minimal decoration can work, but the colors will carry a lot of weight. Want me to suggest decoration that supports the palette?"
- Creative-editorial layout + data-heavy product → "Editorial layouts are gorgeous but can fight data density. Want me to show how a hybrid approach keeps both?"
- Always accept the user's final choice. Never refuse to proceed.
Phase 4: Drill-downs (only if user requests adjustments)
When the user wants to change a specific section, go deep on that section:
- Fonts: Present 3-5 specific candidates with rationale, explain what each evokes, offer the preview page
- Colors: Present 2-3 palette options with hex values, explain the color theory reasoning
- Aesthetic: Walk through which directions fit their product and why
- Layout/Spacing/Motion: Present the approaches with concrete tradeoffs for their product type
Each drill-down is one focused AskUserQuestion. After the user decides, re-check coherence with the rest of the system.
Phase 5: Design System Preview (default ON)
This phase generates visual previews of the proposed design system. Two paths depending on whether the vibestack designer is available.
Path A: AI Mockups (if DESIGN_READY)
Generate AI-rendered mockups showing the proposed design system applied to realistic screens for this product. This is far more powerful than an HTML preview — the user sees what their product could actually look like.
eval "$(~/.vibestack/bin/vibe-slug 2>/dev/null)"
_DESIGN_DIR="$HOME/.vibestack/projects/$SLUG/designs/design-system-$(date +%Y%m%d)"
mkdir -p "$_DESIGN_DIR"
echo "DESIGN_DIR: $_DESIGN_DIR"
Construct a design brief from the Phase 3 proposal (aesthetic, colors, typography, spacing, layout) and the product context from Phase 1:
$D variants --brief "<product name: [name]. Product type: [type]. Aesthetic: [direction]. Colors: primary [hex], secondary [hex], neutrals [range]. Typography: display [font], body [font]. Layout: [approach]. Show a realistic [page type] screen with [specific content for this product].>" --count 3 --output-dir "$_DESIGN_DIR/"
Run quality check on each variant:
$D check --image "$_DESIGN_DIR/variant-A.png" --brief "<the original brief>"
Show each variant inline (Read tool on each PNG) for instant preview.
Before presenting to the user, self-gate: For each variant, ask yourself: "Would a human designer be embarrassed to put their name on this?" If yes, discard the variant and regenerate. This is a hard gate. A mediocre AI mockup is worse than no mockup. Embarrassment triggers include: purple gradient hero, 3-column SaaS grid, centered-everything, Inter body text, generic stock-photo vibe, system-ui font, gradient CTA button, bubble-radius everything. Any of those = reject and regenerate.
Tell the user: "I've generated 3 visual directions applying your design system to a realistic [product type] screen. Pick your favorite in the comparison board that just opened in your browser. You can also remix elements across variants."
Comparison Board + Feedback Loop
Create the comparison board and serve it over HTTP:
$D compare --images "$_DESIGN_DIR/variant-A.png,$_DESIGN_DIR/variant-B.png,$_DESIGN_DIR/variant-C.png" --output "$_DESIGN_DIR/design-board.html" --serve
This command generates the board HTML, starts an HTTP server on a random port,
and opens it in the user's default browser. Run it in the background with &
because the server needs to stay running while the user interacts with the board.
Parse the port from stderr output: SERVE_STARTED: port=XXXXX. You need this
for the board URL and for reloading during regeneration cycles.
PRIMARY WAIT: AskUserQuestion with board URL
After the board is serving, use AskUserQuestion to wait for the user. Include the board URL so they can click it if they lost the browser tab:
"I've opened a comparison board with the design variants: http://127.0.0.1:<PORT>/ — Rate them, leave comments, remix elements you like, and click Submit when you're done. Let me know when you've submitted your feedback (or paste your preferences here). If you clicked Regenerate or Remix on the board, tell me and I'll generate new variants."
Do NOT use AskUserQuestion to ask which variant the user prefers. The comparison board IS the chooser. AskUserQuestion is just the blocking wait mechanism.
After the user responds to AskUserQuestion:
Check for feedback files next to the board HTML:
$_DESIGN_DIR/feedback.json— written when user clicks Submit (final choice)$_DESIGN_DIR/feedback-pending.json— written when user clicks Regenerate/Remix/More Like This
if [ -f "$_DESIGN_DIR/feedback.json" ]; then
echo "SUBMIT_RECEIVED"
cat "$_DESIGN_DIR/feedback.json"
elif [ -f "$_DESIGN_DIR/feedback-pending.json" ]; then
echo "REGENERATE_RECEIVED"
cat "$_DESIGN_DIR/feedback-pending.json"
rm "$_DESIGN_DIR/feedback-pending.json"
else
echo "NO_FEEDBACK_FILE"
fi
The feedback JSON has this shape:
{
"preferred": "A",
"ratings": { "A": 4, "B": 3, "C": 2 },
"comments": { "A": "Love the spacing" },
"overall": "Go with A, bigger CTA",
"regenerated": false
}
If feedback.json found: The user clicked Submit on the board.
Read preferred, ratings, comments, overall from the JSON. Proceed with
the approved variant.
If feedback-pending.json found: The user clicked Regenerate/Remix on the board.
- Read
regenerateActionfrom the JSON ("different","match","more_like_B","remix", or custom text) - If
regenerateActionis"remix", readremixSpec(e.g.{"layout":"A","colors":"B"}) - Generate new variants with
$D iterateor$D variantsusing updated brief - Create new board:
$D compare --images "..." --output "$_DESIGN_DIR/design-board.html" - Reload the board in the user's browser (same tab):
curl -s -X POST http://127.0.0.1:PORT/api/reload -H 'Content-Type: application/json' -d '{"html":"$_DESIGN_DIR/design-board.html"}' - The board auto-refreshes. AskUserQuestion again with the same board URL to
wait for the next round of feedback. Repeat until
feedback.jsonappears.
If NO_FEEDBACK_FILE: The user typed their preferences directly in the
AskUserQuestion response instead of using the board. Use their text response
as the feedback.
POLLING FALLBACK: Only use polling if $D serve fails (no port available).
In that case, show each variant inline using the Read tool (so the user can see them),
then use AskUserQuestion:
"The comparison board server failed to start. I've shown the variants above.
Which do you prefer? Any feedback?"
After receiving feedback (any path): Output a clear summary confirming what was understood:
"Here's what I understood from your feedback: PREFERRED: Variant [X] RATINGS: [list] YOUR NOTES: [comments] DIRECTION: [overall]
Is this right?"
Use AskUserQuestion to verify before proceeding.
Save the approved choice:
echo '{"approved_variant":"<V>","feedback":"<FB>","date":"'$(date -u +%Y-%m-%dT%H:%M:%SZ)'","screen":"<SCREEN>","branch":"'$(git branch --show-current 2>/dev/null)'"}' > "$_DESIGN_DIR/approved.json"
After the user picks a direction:
- Use
$D extract --image "$_DESIGN_DIR/variant-<CHOSEN>.png"to analyze the approved mockup and extract design tokens (colors, typography, spacing) that will populate DESIGN.md in Phase 6. This grounds the design system in what was actually approved visually, not just what was described in text. - If the user wants to iterate further:
$D iterate --feedback "<user's feedback>" --output "$_DESIGN_DIR/refined.png"
Plan mode vs. implementation mode:
- If in plan mode: Add the approved mockup path (the full
$_DESIGN_DIRpath) and extracted tokens to the plan file under an "## Approved Design Direction" section. The design system gets written to DESIGN.md when the plan is implemented. - If NOT in plan mode: Proceed directly to Phase 6 and write DESIGN.md with the extracted tokens.
Path B: HTML Preview Page (fallback if DESIGN_NOT_AVAILABLE)
Generate a polished HTML preview page and open it in the user's browser. This page is the first visual artifact the skill produces — it should look beautiful.
PREVIEW_FILE="/tmp/design-consultation-preview-$(date +%s).html"
Write the preview HTML to $PREVIEW_FILE, then open it:
open "$PREVIEW_FILE"
Preview Page Requirements (Path B only)
The agent writes a single, self-contained HTML file (no framework dependencies) that:
- Loads proposed fonts from Google Fonts (or Bunny Fonts) via
<link>tags - Uses the proposed color palette throughout — dogfood the design system
- Shows the product name (not "Lorem Ipsum") as the hero heading
- Font specimen section:
- Each font candidate shown in its proposed role (hero heading, body paragraph, button label, data table row)
- Side-by-side comparison if multiple candidates for one role
- Real content that matches the product (e.g., civic tech → government data examples)
- Color palette section:
- Swatches with hex values and names
- Sample UI components rendered in the palette: buttons (primary, secondary, ghost), cards, form inputs, alerts (success, warning, error, info)
- Background/text color combinations showing contrast
- Realistic product mockups — this is what makes the preview page powerful. Based on the project type from Phase 1, render 2-3 realistic page layouts using the full design system:
- Dashboard / web app: sample data table with metrics, sidebar nav, header with user avatar, stat cards
- Marketing site: hero section with real copy, feature highlights, testimonial block, CTA
- Settings / admin: form with labeled inputs, toggle switches, dropdowns, save button
- Auth / onboarding: login form with social buttons, branding, input validation states
- Use the product name, realistic content for the domain, and the proposed spacing/layout/border-radius. The user should see their product (roughly) before writing any code.
- Light/dark mode toggle using CSS custom properties and a JS toggle button
- Clean, professional layout — the preview page IS a taste signal for the skill
- Responsive — looks good on any screen width
The page should make the user think "oh nice, they thought of this." It's selling the design system by showing what the product could feel like, not just listing hex codes and font names.
If open fails (headless environment), tell the user: "I wrote the preview to [path] — open it in your browser to see the fonts and colors rendered."
If the user says skip the preview, go directly to Phase 6.
Phase 6: Write DESIGN.md & Confirm
If $D extract was used in Phase 5 (Path A), use the extracted tokens as the primary source for DESIGN.md values — colors, typography, and spacing grounded in the approved mockup rather than text descriptions alone. Merge extracted tokens with the Phase 3 proposal (the proposal provides rationale and context; the extraction provides exact values).
If in plan mode: Write the DESIGN.md content into the plan file as a "## Proposed DESIGN.md" section. Do NOT write the actual file — that happens at implementation time.
If NOT in plan mode: Write DESIGN.md to the repo root with this structure:
# Design System — [Project Name]
## Product Context
- **What this is:** [1-2 sentence description]
- **Who it's for:** [target users]
- **Space/industry:** [category, peers]
- **Project type:** [web app / dashboard / marketing site / editorial / internal tool]
## Aesthetic Direction
- **Direction:** [name]
- **Decoration level:** [minimal / intentional / expressive]
- **Mood:** [1-2 sentence description of how the product should feel]
- **Reference sites:** [URLs, if research was done]
## Typography
- **Display/Hero:** [font name] — [rationale]
- **Body:** [font name] — [rationale]
- **UI/Labels:** [font name or "same as body"]
- **Data/Tables:** [font name] — [rationale, must support tabular-nums]
- **Code:** [font name]
- **Loading:** [CDN URL or self-hosted strategy]
- **Scale:** [modular scale with specific px/rem values for each level]
## Color
- **Approach:** [restrained / balanced / expressive]
- **Primary:** [hex] — [what it represents, usage]
- **Secondary:** [hex] — [usage]
- **Neutrals:** [warm/cool grays, hex range from lightest to darkest]
- **Semantic:** success [hex], warning [hex], error [hex], info [hex]
- **Dark mode:** [strategy — redesign surfaces, reduce saturation 10-20%]
## Spacing
- **Base unit:** [4px or 8px]
- **Density:** [compact / comfortable / spacious]
- **Scale:** 2xs(2) xs(4) sm(8) md(16) lg(24) xl(32) 2xl(48) 3xl(64)
## Layout
- **Approach:** [grid-disciplined / creative-editorial / hybrid]
- **Grid:** [columns per breakpoint]
- **Max content width:** [value]
- **Border radius:** [hierarchical scale — e.g., sm:4px, md:8px, lg:12px, full:9999px]
## Motion
- **Approach:** [minimal-functional / intentional / expressive]
- **Easing:** enter(ease-out) exit(ease-in) move(ease-in-out)
- **Duration:** micro(50-100ms) short(150-250ms) medium(250-400ms) long(400-700ms)
## Decisions Log
| Date | Decision | Rationale |
|------|----------|-----------|
| [today] | Initial design system created | Created by /design-consultation based on [product context / research] |
Update CLAUDE.md (or create it if it doesn't exist) — append this section:
## Design System
Always read DESIGN.md before making any visual or UI decisions.
All font choices, colors, spacing, and aesthetic direction are defined there.
Do not deviate without explicit user approval.
In QA mode, flag any code that doesn't match DESIGN.md.
AskUserQuestion Q-final — show summary and confirm:
List all decisions. Flag any that used agent defaults without explicit user confirmation (the user should know what they're shipping). Options:
- A) Ship it — write DESIGN.md and CLAUDE.md
- B) I want to change something (specify what)
- C) Start over
After shipping DESIGN.md, if the session produced screen-level mockups or page layouts (not just system-level tokens), suggest: "Want to see this design system as working Pretext-native HTML? Run /design-html."
{{include lib/snippets/capture-learnings.md}}
Important Rules
- Propose, don't present menus. You are a consultant, not a form. Make opinionated recommendations based on the product context, then let the user adjust.
- Every recommendation needs a rationale. Never say "I recommend X" without "because Y."
- Coherence over individual choices. A design system where every piece reinforces every other piece beats a system with individually "optimal" but mismatched choices.
- Never recommend blacklisted or overused fonts as primary. If the user specifically requests one, comply but explain the tradeoff.
- The preview page must be beautiful. It's the first visual output and sets the tone for the whole skill.
- Conversational tone. This isn't a rigid workflow. If the user wants to talk through a decision, engage as a thoughtful design partner.
- Accept the user's final choice. Nudge on coherence issues, but never block or refuse to write a DESIGN.md because you disagree with a choice.
- No AI slop in your own output. Your recommendations, your preview page, your DESIGN.md — all should demonstrate the taste you're asking the user to adopt.
Capabilities
Install
Quality
deterministic score 0.46 from registry signals: · indexed on github topic:agent-skills · 15 github stars · SKILL.md body (33,587 chars)