PluginBench
Skill
Pass
Audit score 90

audit

pbakaus/impeccable

Run technical quality checks across accessibility, performance, theming, responsive design, and anti-patterns with severity ratings.

What is audit?

Systematically audits code-level quality across five dimensions: accessibility (WCAG compliance), performance (layout thrashing, animations, bundle size), theming (design tokens, dark mode), responsive design (breakpoints, touch targets), and anti-patterns (AI slop detection). Generates a scored report (0-20) with P0-P3 severity ratings and actionable recommendations. Use when you need a comprehensive technical quality review before release.

  • Scans accessibility issues: contrast, ARIA labels, keyboard navigation, semantic HTML, alt text, form accessibility
  • Detects performance problems: layout thrashing, expensive animations, missing lazy loading, bundle bloat, render inefficiencies
  • Evaluates theming implementation: hard-coded colors, dark mode variants, design token consistency, theme switching
  • Checks responsive design: fixed widths, touch target sizes, horizontal overflow, text scaling, breakpoint coverage
  • Identifies anti-patterns: AI slop tells (gradients, glassmorphism, generic fonts), design anti-patterns (gray-on-color, nested cards, redundant copy)
  • Generates scored report with health score (0-20), issue counts by severity, and prioritized fix recommendations

How to install audit

npx skills add https://github.com/pbakaus/impeccable --skill audit
Prerequisites
  • Install impeccable skill: npx skills add https://github.com/pbakaus/impeccable
  • Run /impeccable teach first to establish design context and anti-pattern guidelines
  • Have codebase accessible for analysis
Claude Code
Cursor
Windsurf
Cline

How to use audit

  1. 1.Invoke /audit with optional area parameter (e.g., '/audit homepage' or '/audit checkout component')
  2. 2.Review the Anti-Patterns Verdict first to assess if design looks AI-generated
  3. 3.Check Audit Health Score and rating band (0-5 Critical through 18-20 Excellent)
  4. 4.Read Executive Summary for total issue count by severity and top 3-5 critical issues
  5. 5.Review Detailed Findings organized by severity (P0 Blocking → P1 Major → P2 Minor → P3 Polish)
  6. 6.Note Patterns & Systemic Issues section for recurring problems across codebase
  7. 7.Review Positive Findings to identify practices worth replicating
  8. 8.Follow Recommended Actions in priority order, running suggested commands one at a time or in batch

Use cases

Good for
  • Audit a component before shipping to catch accessibility and performance regressions
  • Review a full page for WCAG AA compliance and responsive design gaps
  • Detect AI-generated design patterns and hard-coded styling that should use design tokens
  • Establish baseline quality score and track improvements across multiple audit runs
  • Identify systemic issues (e.g., touch targets too small across entire mobile experience)
Who it's for
  • Frontend developers performing quality assurance before release
  • Accessibility specialists checking WCAG compliance
  • Design system maintainers ensuring token usage and theming consistency
  • Product teams evaluating technical debt and prioritizing fixes
  • QA engineers establishing quality baselines

audit FAQ

What's the difference between P0, P1, P2, and P3 severity?

P0 Blocking prevents task completion and must be fixed immediately. P1 Major causes significant difficulty or WCAG AA violations and should be fixed before release. P2 Minor is an annoyance with a workaround and can be fixed in the next pass. P3 Polish has no real user impact and is fixed only if time permits.

Do I need to run /impeccable teach before using /audit?

Yes. The MANDATORY PREPARATION section requires invoking /impeccable first to establish design principles, anti-patterns, and context. If no design context exists, run /impeccable teach before proceeding with the audit.

What does the Audit Health Score mean?

The score ranges 0-20 across five dimensions (accessibility, performance, responsive design, theming, anti-patterns), each scored 0-4. Rating bands are: 18-20 Excellent (minor polish), 14-17 Good (address weak dimensions), 10-13 Acceptable (significant work needed), 6-9 Poor (major overhaul), 0-5 Critical (fundamental issues).

Will /audit fix issues or just report them?

Audit only documents issues; it does not fix them. After reviewing the report, use the recommended commands (/animate, /optimize, /colorize, /adapt, etc.) to address specific findings. Re-run /audit afterward to verify improvements.

What are AI slop tells and why do they matter?

AI slop tells are design patterns commonly generated by AI: AI color palettes, gradient text, glassmorphism, hero metrics, card grids, and generic fonts. The Anti-Patterns dimension scores how many tells appear (0=5+ tells, 4=no tells). They matter because they signal generic, undifferentiated design lacking intentionality.

Full instructions (SKILL.md)

Source of truth, from pbakaus/impeccable.


name: audit description: Run technical quality checks across accessibility, performance, theming, responsive design, and anti-patterns. Generates a scored report with P0-P3 severity ratings and actionable plan. Use when the user wants an accessibility check, performance audit, or technical quality review. version: 2.1.1 user-invocable: true argument-hint: "[area (feature, page, component...)]"

MANDATORY PREPARATION

Invoke /impeccable — it contains design principles, anti-patterns, and the Context Gathering Protocol. Follow the protocol before proceeding — if no design context exists yet, you MUST run /impeccable teach first.


Run systematic technical quality checks and generate a comprehensive report. Don't fix issues — document them for other commands to address.

This is a code-level audit, not a design critique. Check what's measurable and verifiable in the implementation.

Diagnostic Scan

Run comprehensive checks across 5 dimensions. Score each dimension 0-4 using the criteria below.

1. Accessibility (A11y)

Check for:

  • Contrast issues: Text contrast ratios < 4.5:1 (or 7:1 for AAA)
  • Missing ARIA: Interactive elements without proper roles, labels, or states
  • Keyboard navigation: Missing focus indicators, illogical tab order, keyboard traps
  • Semantic HTML: Improper heading hierarchy, missing landmarks, divs instead of buttons
  • Alt text: Missing or poor image descriptions
  • Form issues: Inputs without labels, poor error messaging, missing required indicators

Score 0-4: 0=Inaccessible (fails WCAG A), 1=Major gaps (few ARIA labels, no keyboard nav), 2=Partial (some a11y effort, significant gaps), 3=Good (WCAG AA mostly met, minor gaps), 4=Excellent (WCAG AA fully met, approaches AAA)

2. Performance

Check for:

  • Layout thrashing: Reading/writing layout properties in loops
  • Expensive animations: Animating layout properties (width, height, top, left) instead of transform/opacity
  • Missing optimization: Images without lazy loading, unoptimized assets, missing will-change
  • Bundle size: Unnecessary imports, unused dependencies
  • Render performance: Unnecessary re-renders, missing memoization

Score 0-4: 0=Severe issues (layout thrash, unoptimized everything), 1=Major problems (no lazy loading, expensive animations), 2=Partial (some optimization, gaps remain), 3=Good (mostly optimized, minor improvements possible), 4=Excellent (fast, lean, well-optimized)

3. Theming

Check for:

  • Hard-coded colors: Colors not using design tokens
  • Broken dark mode: Missing dark mode variants, poor contrast in dark theme
  • Inconsistent tokens: Using wrong tokens, mixing token types
  • Theme switching issues: Values that don't update on theme change

Score 0-4: 0=No theming (hard-coded everything), 1=Minimal tokens (mostly hard-coded), 2=Partial (tokens exist but inconsistently used), 3=Good (tokens used, minor hard-coded values), 4=Excellent (full token system, dark mode works perfectly)

4. Responsive Design

Check for:

  • Fixed widths: Hard-coded widths that break on mobile
  • Touch targets: Interactive elements < 44x44px
  • Horizontal scroll: Content overflow on narrow viewports
  • Text scaling: Layouts that break when text size increases
  • Missing breakpoints: No mobile/tablet variants

Score 0-4: 0=Desktop-only (breaks on mobile), 1=Major issues (some breakpoints, many failures), 2=Partial (works on mobile, rough edges), 3=Good (responsive, minor touch target or overflow issues), 4=Excellent (fluid, all viewports, proper touch targets)

5. Anti-Patterns (CRITICAL)

Check against ALL the DON'T guidelines in the impeccable skill. Look for AI slop tells (AI color palette, gradient text, glassmorphism, hero metrics, card grids, generic fonts) and general design anti-patterns (gray on color, nested cards, bounce easing, redundant copy).

Score 0-4: 0=AI slop gallery (5+ tells), 1=Heavy AI aesthetic (3-4 tells), 2=Some tells (1-2 noticeable), 3=Mostly clean (subtle issues only), 4=No AI tells (distinctive, intentional design)

Generate Report

Audit Health Score

#DimensionScoreKey Finding
1Accessibility?[most critical a11y issue or "--"]
2Performance?
3Responsive Design?
4Theming?
5Anti-Patterns?
Total??/20[Rating band]

Rating bands: 18-20 Excellent (minor polish), 14-17 Good (address weak dimensions), 10-13 Acceptable (significant work needed), 6-9 Poor (major overhaul), 0-5 Critical (fundamental issues)

Anti-Patterns Verdict

Start here. Pass/fail: Does this look AI-generated? List specific tells. Be brutally honest.

Executive Summary

  • Audit Health Score: ??/20 ([rating band])
  • Total issues found (count by severity: P0/P1/P2/P3)
  • Top 3-5 critical issues
  • Recommended next steps

Detailed Findings by Severity

Tag every issue with P0-P3 severity:

  • P0 Blocking: Prevents task completion — fix immediately
  • P1 Major: Significant difficulty or WCAG AA violation — fix before release
  • P2 Minor: Annoyance, workaround exists — fix in next pass
  • P3 Polish: Nice-to-fix, no real user impact — fix if time permits

For each issue, document:

  • [P?] Issue name
  • Location: Component, file, line
  • Category: Accessibility / Performance / Theming / Responsive / Anti-Pattern
  • Impact: How it affects users
  • WCAG/Standard: Which standard it violates (if applicable)
  • Recommendation: How to fix it
  • Suggested command: Which command to use (prefer: /animate, /quieter, /shape, /optimize, /adapt, /clarify, /layout, /distill, /delight, /audit, /harden, /polish, /bolder, /typeset, /critique, /colorize, /overdrive)

Patterns & Systemic Issues

Identify recurring problems that indicate systemic gaps rather than one-off mistakes:

  • "Hard-coded colors appear in 15+ components, should use design tokens"
  • "Touch targets consistently too small (<44px) throughout mobile experience"

Positive Findings

Note what's working well — good practices to maintain and replicate.

Recommended Actions

List recommended commands in priority order (P0 first, then P1, then P2):

  1. [P?] /command-name — Brief description (specific context from audit findings)
  2. [P?] /command-name — Brief description (specific context)

Rules: Only recommend commands from: /animate, /quieter, /shape, /optimize, /adapt, /clarify, /layout, /distill, /delight, /audit, /harden, /polish, /bolder, /typeset, /critique, /colorize, /overdrive. Map findings to the most appropriate command. End with /polish as the final step if any fixes were recommended.

After presenting the summary, tell the user:

You can ask me to run these one at a time, all at once, or in any order you prefer.

Re-run /audit after fixes to see your score improve.

IMPORTANT: Be thorough but actionable. Too many P3 issues creates noise. Focus on what actually matters.

NEVER:

  • Report issues without explaining impact (why does this matter?)
  • Provide generic recommendations (be specific and actionable)
  • Skip positive findings (celebrate what works)
  • Forget to prioritize (everything can't be P0)
  • Report false positives without verification

Remember: You're a technical quality auditor. Document systematically, prioritize ruthlessly, cite specific code locations, and provide clear paths to improvement.