PluginBench
Skill
Review
Audit score 70

website-to-hyperframes

heygen-com/hyperframes

Capture a website and create professional HyperFrames videos from it.

What is website-to-hyperframes?

Convert any website into a polished video by capturing its design, understanding its brand, and building animated compositions. Use this when a user provides a URL and wants a video—whether a product demo, social ad, feature announcement, or brand reel.

  • Capture website content and extract brand identity (colors, typography, voice, mood)
  • Generate concept-first storyboards with narrative arcs and beat-by-beat techniques
  • Create narration scripts with TTS integration (HeyGen, ElevenLabs, or Kokoro) and auto-timed captions
  • Build animated HTML compositions for each beat using the hyperframes framework
  • Validate output with linting, snapshots, and WCAG verification before delivery
  • Support multiple formats (landscape 1920×1080, portrait 1080×1920, square 1080×1080)

How to install website-to-hyperframes

npx skills add https://github.com/heygen-com/hyperframes --skill website-to-hyperframes
Prerequisites
  • Node.js and npm installed
  • HyperFrames skill installed via: npx skills add https://github.com/heygen-com/hyperframes --skill website-to-hyperframes
  • A valid website URL to capture
  • (Optional) TTS API key for narration (HeyGen, ElevenLabs, or Kokoro)
Claude Code
Cursor
Windsurf
Cline

How to use website-to-hyperframes

  1. 1.Provide a website URL and describe the video you want (type, duration, key message)
  2. 2.Agent captures the site and summarizes its brand identity, product, and visual style
  3. 3.Review and approve the storyboard and script (concept-first, beat-by-beat breakdown)
  4. 4.Choose TTS provider and narration style; agent generates audio and syncs captions
  5. 5.Agent builds animated HTML compositions for each beat following brand guidelines
  6. 6.Review snapshots and validation reports; agent fixes any issues before delivery
  7. 7.Access the localhost Studio project URL to preview, iterate, or render to MP4

Use cases

Good for
  • Turn a product website into a 30-second product demo video with full narration
  • Create a 15-second Instagram ad from a landing page with music and captions
  • Build a feature announcement video from a product page with animated beat-by-beat breakdowns
  • Generate a brand reel from a company website with optional voiceover and background music
  • Produce a launch teaser video from a startup site with minimal narration and high-energy pacing
Who it's for
  • Product marketers creating social ads and demo videos
  • Startup founders launching products or features
  • Marketing teams building brand reels and announcements
  • Content creators converting web content into video assets
  • Agencies producing client videos from existing websites

website-to-hyperframes FAQ

What if I just paste a URL without specifying video type or duration?

The agent will ask clarifying questions about video type (social ad, demo, announcement, etc.), target duration, and the one key message the video must communicate. These gates ensure the storyboard serves your actual goal.

Can I use this without narration?

Yes. If you don't request narration, the agent asks about background music and uses manual beat timings from the storyboard. The video can be entirely visual with music.

What happens in 'autonomous mode'?

Autonomous mode skips user-preference gates (TTS provider, voice, colors, music, captions) so the agent decides on your behalf. It does NOT skip quality verification gates—asset audits, per-beat HTML review, WCAG checks, and honest disclosure of what wasn't verified remain mandatory.

What formats and aspect ratios are supported?

Landscape (1920×1080, default), portrait (1080×1920 for Stories/TikTok), and square (1080×1080 for feed). Choose based on where the video will be posted.

How long does the workflow take?

Typical end-to-end is 6 steps: capture & brand analysis, design cheat sheet, strategy & messaging, storyboard & script approval, VO/timing/captions, and validation. Complexity depends on video length and iteration rounds.

Full instructions (SKILL.md)

Source of truth, from heygen-com/hyperframes.


name: website-to-hyperframes description: | Capture a website and create a HyperFrames video from it. Use when: (1) a user provides a URL and wants a video, (2) someone says "capture this site", "turn this into a video", "make a promo from my site", (3) the user wants a social ad, product tour, or any video based on an existing website, (4) the user shares a link and asks for any kind of video content. Even if the user just pastes a URL — this is the skill to use.

Website to HyperFrames

Capture a website, then produce a professional video from it.

Users say things like:

  • "Capture https://... and make me a 25-second product launch video"
  • "Turn this website into a 15-second social ad for Instagram"
  • "Create a 30-second product tour from https://..."

The workflow has 7 steps. Each produces an artifact that gates the next. By default it's collaborative — gates marked 💬 stop and ask the user. If the user signals autonomous mode ("decide for me", "surprise me"), 💬 user-preference gates are skipped; see step-2-brief.md for how that propagates.

Autonomous mode is NOT "skip all gates." Auto mode covers user-preference questions (TTS provider, voice, color emphasis, beat count, music yes/no, captions yes/no — where the agent decides on the user's behalf). It does NOT cover quality-verification gates. The following remain non-skippable in auto mode:

  • Asset Audit (Step 3) — viewing contact sheets and justifying USE/SKIP for each asset
  • Per-beat HTML read (Step 5) — structured evidence block per beat
  • DoD checklist (Step 6) — including animation-map, per-warning WCAG verification, audio/motion playback
  • Honest disclosure section (Step 6) — "What I did NOT verify" must appear in your final summary

If you find yourself reasoning "auto mode says bias toward action, so I'll skip X" — and X is a verification gate, not a preference question — that reasoning is wrong. Bias toward action applies to deciding what to build, not to deciding whether to verify.


Step 0: Capture & Understand the Brand

Read: references/step-0-capture.md

Capture the site, then read the extracted data to understand the brand and product — what it does, who it's for, what voice it speaks in, what mood it lives in. The captured assets are a brand toolkit for later, not the building blocks the video is made from.

Gate: Site summary printed — strategy-first (what the product does, who it's for, brand voice) before the asset / color / font inventory.


Step 1: Brand Identity

Read: references/step-1-design.md

Write DESIGN.md — a brand cheat sheet covering the visual identity: colors, typography, component styles, layout principles. Use design-styles.json for exact computed values.

Speed option: For fast-pacing videos (billboard-per-beat), DESIGN.md can be a 50-line summary of colors + fonts + do's/don'ts — not a 300-line document. The sub-agent prompt in Step 5 pastes brand values directly, so DESIGN.md depth only matters for complex compositions.

Gate: DESIGN.md exists (any length) with at minimum: color palette, font choices, and do's/don'ts.


Step 2: Strategy & Messaging

Read: references/step-2-brief.md, references/capabilities.md (scan the Table of Contents — deep-dive sections only as needed)

Align with the user on what the video must communicate before talking visuals or assets. Parse the user's prompt — they probably already gave you the video type and style. Ask only what's missing: the ONE thing this video must say, the narrative arc, and the audience.

Gate: Video type, duration, format, and — critically — the message and narrative arc are locked. Without those, Step 3 can't write a concept-first storyboard.


Step 3: Storyboard + Script 💬

Read: references/step-3-storyboard.md

Write the storyboard concept-first: message → narrative arc → beats that serve the arc → techniques per beat → brand accents pass at the end. Then write the narration script to match. Present both to the user with a beat-by-beat summary. Iterate until they approve.

Gate: STORYBOARD.md + SCRIPT.md exist AND the user has approved the plan.


Step 4: VO, Timing + Captions 💬

Read: references/step-4-vo.md

If Step 2 said no narration — ask about background music, then skip to Step 5. Otherwise: ask the user which TTS provider (HeyGen TTS, ElevenLabs, or Kokoro), generate audio, transcribe, map timestamps to beats. Then ask about captions.

Gate: Either (a) no narration was requested and storyboard has manual beat timings, or (b) narration.wav + transcript.json exist and beat timings updated with real durations.


Step 5: Build Compositions

Read: The hyperframes skill (load it — every rule matters) Read: references/step-5-build.md

Build index.html and compositions following the architecture and pacing chosen in the storyboard (Step 3). Sub-agents run hyperframes lint and hyperframes snapshot on each beat before reporting back.

Gate: Every compositions/beat-N.html has been read top-to-bottom by the main agent against DESIGN.md and STORYBOARD.md. The per-beat checklist lives in step-5-build.md.


Step 6: Validate & Deliver

Read: references/step-6-validate.md

Lint, validate, take snapshots scaled to video length (formula: max(beats × 3, ceil(duration_seconds / 2))), and review each one. Fix issues before delivering. Deliver the localhost Studio project URL — only render to MP4 on explicit user request.

Deliver something you're proud of. Before handing off, ask yourself: would I post this on social media with my name on it? If not, fix what's wrong.

Gate: npx hyperframes lint and npx hyperframes validate pass with zero errors, and the final response includes the active Studio project URL.


Quick Reference

Video Types

Typical constraints by video type — use as a starting point, not a formula. Beat count should follow from the content and the narration, not from a target range.

TypeTypical durationDuration driverNarration
Social ad (IG/TikTok)10–15sPlatform limitOptional
Product demo30–60sScript lengthFull narration
Feature announcement15–30sFeature complexityFull narration
Brand reel20–45sMusic trackOptional, music focus
Launch teaser10–20sHook energyMinimal

Beat count is not in this table intentionally — it should come from the storyboard, not from "social ad = 3-4 beats." A social ad for a complex product might need 5 well-timed beats. A brand reel with one strong visual thesis might need 3.

Format

  • Landscape: 1920x1080 (default)
  • Portrait: 1080x1920 (Instagram Stories, TikTok)
  • Square: 1080x1080 (Instagram feed)

Reference Files

FileWhen to read
step-0-capture.mdStep 0 — capture, understand the brand and product, write strategy-first site summary
step-1-design.mdStep 1 — write DESIGN.md brand cheat sheet (5 sections, 250-350 lines; 50-line fast-path for billboard-style social ads)
step-2-brief.mdStep 2 — align on message, narrative arc, audience with user
capabilities.mdSteps 2 & 5 — full inventory of what HyperFrames can do (24 sections). Scan the TOC during the brief, deep-dive specific sections during build
step-3-storyboard.mdStep 3 — storyboard + script (combined) with user review gate
step-4-vo.mdStep 4 — TTS provider choice, generation, timing
step-5-build.mdStep 5 — build index.html + compositions
step-6-validate.mdStep 6 — lint, validate, snapshots (scaled to video length), preview
techniques.mdSteps 3 & 5 — 13 primitive animation techniques with code patterns (adapt, don't copy-paste)
html-in-canvas-patterns.mdStep 5 — complete code patterns for HTML-in-Canvas effects (lives in the hyperframes skill)