AI Skill
Review
Audit score 70

runcomfy-cli

agentspace-so/runcomfy-agent-skills

One CLI to run any RunComfy model — image, video, edit, lip-sync, face-swap, LoRA training, and more.

What is runcomfy-cli?

runcomfy-cli is a skill that teaches coding agents how to install and use the `runcomfy` CLI binary to call any model on the RunComfy platform. It covers installation, authentication, model discovery, job submission, polling, output downloading, and scripting with JSON output mode. This skill is the foundation that sibling skills (image generation, video generation, face-swap, etc.) all build on.

  • Installs the `runcomfy` CLI via npm or npx
  • Authenticates via browser device-code flow or RUNCOMFY_TOKEN env var
  • Submits requests to any RunComfy model endpoint using `runcomfy run <vendor>/<model>/<endpoint>`
  • Polls for job status and downloads output files automatically
  • Supports no-wait submission and manual status checks via `runcomfy status <request_id>`
  • Outputs machine-readable JSON for scripting and piping

How to install runcomfy-cli

npx skills add https://github.com/agentspace-so/runcomfy-agent-skills --skill runcomfy-cli
Prerequisites
  • Node.js installed (for npm/npx install method)
  • A RunComfy account at runcomfy.com
  • A RunComfy API token (for CI/headless environments)
  • The `runcomfy` CLI installed globally or via npx
Claude Code
Cursor
Windsurf
Cline

How to use runcomfy-cli

  1. 1.Install the skill: `npx skills add agentspace-so/runcomfy-agent-skills --skill runcomfy-cli -g`
  2. 2.Install the CLI: `npm i -g @runcomfy/cli` or use `npx -y @runcomfy/cli`
  3. 3.Confirm installation: `runcomfy --version`
  4. 4.Authenticate: run `runcomfy login` (browser flow) or set `RUNCOMFY_TOKEN` env var for CI
  5. 5.Verify identity: `runcomfy whoami`
  6. 6.Browse available models at runcomfy.com/models and find the model ID and input schema from its API tab
  7. 7.Submit a job: `runcomfy run <vendor>/<model>/<endpoint> --input '{...}'`
  8. 8.For async use, add `--no-wait` to get a request_id, then check with `runcomfy status <request_id>`

Use cases

Good for
  • Generate images from text prompts using models like GPT Image 2 or Flux Kontext
  • Automate video generation or image-to-video jobs from a script or CI pipeline
  • Run face-swap, lip-sync, inpainting, or outpainting jobs from the terminal
  • Poll async jobs in CI containers using RUNCOMFY_TOKEN without a browser
  • Pipe JSON results into downstream tools using --output json and --no-download flags
Who it's for
  • Developers who want to call RunComfy AI models from scripts or terminals
  • CI/CD engineers automating media generation pipelines
  • Coding agents (Claude Code, Cursor) using sibling runcomfy-* skills
  • Anyone who prefers CLI over the RunComfy web UI for batch or programmatic use

runcomfy-cli FAQ

How do I authenticate in a CI environment without a browser?

Set the RUNCOMFY_TOKEN environment variable to a token from your RunComfy profile page. This overrides the local token file entirely.

How do I skip downloading output files and just get the result JSON?

Use the `--no-download` flag with `runcomfy run`. Add `--output json` to get machine-readable JSON suitable for piping.

How do I find the correct model ID and input schema?

Browse runcomfy.com/models, open a model's detail page, and check its API tab for the exact endpoint path and input JSON schema.

Can I submit a job without waiting for it to finish?

Yes. Use `--no-wait` to submit and immediately receive a `request_id`, then poll manually with `runcomfy status <request_id>`.

Do sibling skills like ai-image-generation or face-swap require this skill?

Yes. All sibling runcomfy-* skills dispatch through this CLI, so runcomfy-cli must be installed and authenticated first.

Full instructions (SKILL.md)

Source of truth, from agentspace-so/runcomfy-agent-skills.


name: runcomfy-cli displayName: "RunComfy CLI" allowed-tools: Bash(runcomfy *) description: > Run any model on RunComfy from the command line. The runcomfy CLI is one binary, one auth, hundreds of model endpoints — image generation, image edit, video generation, image-to-video, lip-sync, face swap, video edit, inpainting, outpainting, extend, ControlNet, relight, upscale, LoRA training and more. Submit a request, poll for status, download the output. This skill teaches the agent how to install, authenticate, discover model schemas, invoke models, stream / poll / no-wait, script in JSON output mode, and handle errors. Triggers on "runcomfy cli", "install runcomfy", "runcomfy login", "runcomfy run", "runcomfy whoami", "runcomfy api", or any explicit ask to call a RunComfy model from a script or terminal. Sibling skills (ai-image-generation, ai-video-generation, image-edit, video-edit, face-swap, lipsync, image-to-video, image-inpainting, image-outpainting, video-extend, controlnet-pose, relight) all dispatch through this CLI. homepage: https://www.runcomfy.com license: MIT

RunComfy CLI

One binary, one auth, every RunComfy model. Install once, sign in once, then call any text-to-image, video, edit, lip-sync, face-swap, or LoRA-training endpoint with runcomfy run <model_id> --input '{...}'. This skill is the foundation every other runcomfy-* skill builds on.

runcomfy.com · CLI docs · All models

Install this skill

npx skills add agentspace-so/runcomfy-agent-skills --skill runcomfy-cli -g

Install the CLI

Pick one:

# Global install via npm (recommended for repeat use)
npm i -g @runcomfy/cli

# Zero-install one-shot (no Node global state)
npx -y @runcomfy/cli --version

A standalone curl-pipe installer also exists for environments without Node — see docs.runcomfy.com/cli/install. Inspect any install script before piping it into a shell. This skill only invokes the CLI via Bash(runcomfy *) after you have installed it through one of the verified package managers above.

Confirm:

runcomfy --version

Full options on the Install page.

Sign in

Interactive (opens browser):

runcomfy login
# Code shown in terminal — paste into the browser page, click Authorize
# Token saved to ~/.config/runcomfy/token.json with mode 0600

CI / containers (no browser):

export RUNCOMFY_TOKEN=<token-from-runcomfy.com/profile>

Verify:

runcomfy whoami
# 📛 you@example.com
#    token type: cli
#    user id: ...

Full flow + token rotation: Authentication.

Run a model

The general shape:

runcomfy run <vendor>/<model>/<endpoint> \
  --input '<JSON body>' \
  --output-dir <path>

Example — generate an image with GPT Image 2:

runcomfy run openai/gpt-image-2/text-to-image \
  --input '{"prompt": "a small purple cat at sunset, photorealistic"}'

You will see:

⏳ Submitting request to openai/gpt-image-2/text-to-image
   request_id: 8a3f...
⏳ Polling status (every 2s)...
   in_queue
   in_progress
   completed
✅ completed
{
  "images": [
    "https://playgrounds-storage-public.runcomfy.net/.../result.png"
  ]
}
📥 Downloading 1 file(s) to .
   ./result.png

By default the result is downloaded to the current directory. Override with --output-dir ./out, skip downloading with --no-download.

Quickstart: docs.runcomfy.com/cli/quickstart.

Discover model schemas

Every model has an API tab on its detail page with the exact input schema. Browse the catalog:

open https://www.runcomfy.com/models

Or search by collection / capability:

URLWhat
/modelsAll featured models
/models/allThe full catalog
/models/collections/recently-addedFresh additions
/models/collections/nano-banana · /seedream · /flux-kontext · /kling · /seedance · /veo-3 · /wan-models · /hailuo · /qwen-imageCurated brand collections
/models/feature/lip-syncLip-sync capability
/models/feature/character-swapCharacter / face swap
/models/feature/upscale-videoVideo upscalers

Commands

runcomfy run <model_id>

Synchronous run — submit, poll, download.

FlagWhat
--input '<JSON>'Inline JSON body. Strings can contain newlines; quote-escape as needed
--input-file <path>Read body from a file (JSON or YAML by extension)
--output-dir <path>Where to download result files (default: cwd)
--no-downloadSkip the download step; only print the result JSON
--no-waitSubmit and return request_id immediately; don't poll
--timeout <seconds>Cap the polling wait. Default: model-dependent
--output jsonPrint machine-readable JSON for piping (default human-readable)
--quietSuppress progress, keep only the final result line

runcomfy login / runcomfy whoami / runcomfy logout

login runs the device-code flow; whoami prints the active identity; logout removes the local token file. Set RUNCOMFY_TOKEN env var to override the file entirely.

runcomfy status <request_id>

Check status of a --no-wait job:

RID=$(runcomfy --output json run google/nano-banana-2/text-to-image \
  --input '{"prompt": "..."}' --no-wait | jq -r .request_id)

runcomfy status "$RID"

Full command reference: docs.runcomfy.com/cli/commands.

Scripting patterns

Pipe-friendly JSON

runcomfy --output json run openai/gpt-image-2/text-to-image \
  --input '{"prompt": "X"}' \
  --no-download \
| jq -r '.images[0]'

Batch from a file of prompts

while IFS= read -r prompt; do
  runcomfy run blackforestlabs/flux-2-klein/9b/text-to-image \
    --input "$(jq -nc --arg p "$prompt" '{prompt:$p, steps:8}')" \
    --output-dir "./out/$(date +%s%N)"
done < prompts.txt

Submit now, poll later

# Submit one or many jobs without blocking
RID=$(runcomfy --output json run bytedance/seedance-v2/pro \
  --input '{"prompt": "..."}' --no-wait | jq -r .request_id)

# Later — possibly from a different shell:
runcomfy status "$RID"

Retry on transient failure

The CLI returns exit code 75 on retryable errors (timeout, 429). Wrap with a shell retry loop:

for i in 1 2 3; do
  runcomfy run <model_id> --input '{...}' && break
  rc=$?
  [ $rc -eq 75 ] && sleep $((2**i)) && continue
  exit $rc
done

Exit codes

codemeaningretry?
0success
64bad CLI argsno
65bad input JSON / schema mismatchno
69upstream 5xxyes (after backoff)
75retryable: timeout / 429yes
77not signed in or token rejectedno — re-auth
130interrupted (Ctrl-C); remote request is cancelled before exit

Full reference: docs.runcomfy.com/cli/troubleshooting.

How it works

The CLI does three things for each run call:

  1. Submit — POSTs the JSON body to model-api.runcomfy.net with your bearer token.
  2. Poll — GETs the request every ~2s until status is completed, failed, or canceled.
  3. Download — for each output URL under *.runcomfy.net / *.runcomfy.com, fetch into --output-dir.

Ctrl-C sends DELETE to the request endpoint to cancel the remote job before exit, so you don't get billed for work you abandoned.

Security & Privacy

  • Install via verified package manager only. This skill recommends npm i -g @runcomfy/cli or npx -y @runcomfy/cli. A standalone curl-pipe installer exists in the official docs but agents must not pipe an arbitrary remote script into a shell on the user's behalf — if the user wants the curl path, they should review the script themselves first.
  • Token storage: runcomfy login writes the API token to ~/.config/runcomfy/token.json with mode 0600 (owner-only read/write). Set RUNCOMFY_TOKEN env var to bypass the file entirely in CI / containers. Never log the token, never echo it into prompts, never check it into a repo.
  • Input boundary (shell injection): prompts are passed as a JSON string via --input. The CLI does not shell-expand prompt content; it transmits the JSON body directly to the Model API over HTTPS. There is no shell-injection surface from prompt content, even when the prompt contains backticks, quotes, or $(...) patterns.
  • Indirect prompt injection (third-party content): image / audio / video URLs and enable_web_search outputs are untrusted. They are fetched by the RunComfy model server and can influence generation through embedded instructions inside the asset (e.g. text painted into an image, hidden instructions in EXIF, web-search results steering style). Mitigations the agent should apply:
    • Only ingest URLs the user explicitly provided for this task. Don't auto-resolve URLs the user pasted in unrelated context.
    • When generation behavior diverges from the prompt, suspect the reference asset, not the prompt.
    • For enable_web_search, default to false; set true only when the user names a real-world entity that requires grounding.
  • Outbound endpoints (allowlist): only model-api.runcomfy.net (request submission) and *.runcomfy.net / *.runcomfy.com (download whitelist for generated outputs). No telemetry. No callbacks to third parties.
  • Generated-file size cap: the CLI aborts any single download > 2 GiB to prevent disk-fill from a runaway model output.
  • Scope of this skill's bash usage: declared allowed-tools: Bash(runcomfy *). The skill never instructs the agent to run anything other than runcomfy <subcommand>npm, curl, export RUNCOMFY_TOKEN=... lines in this document are install / one-time setup steps for the operator, not commands the skill itself executes on each call.

See also

Sibling intent-routed skills that all dispatch through this CLI: