PluginBench
Skill
Official
Fail
Audit score 45

apify-ultimate-scraper

apify/agent-skills

Universal AI-powered web scraper for 100+ platforms including Instagram, TikTok, LinkedIn, YouTube, Google Maps, and more.

What is apify-ultimate-scraper?

Apify Ultimate Scraper is an AI-driven data extraction tool that leverages ~100 Apify Actors to scrape data from 15+ platforms. Use it for lead generation, competitive intelligence, influencer discovery, brand monitoring, review analysis, SEO research, and any structured data extraction task.

  • Scrape data from 15+ platforms (Instagram, Facebook, TikTok, YouTube, LinkedIn, X/Twitter, Google Maps, Google Search, Google Trends, Reddit, Airbnb, Yelp, and more)
  • Access ~100 pre-built Actors via Apify CLI for specialized scraping tasks
  • Extract structured data in JSON or CSV format with machine-readable output
  • Support both blocking (synchronous) and async (long-running) scraping workflows
  • Authenticate via OAuth, environment variables, or .env files
  • Fetch and manage results through Apify dataset storage

How to install apify-ultimate-scraper

npx skills add https://github.com/apify/agent-skills --skill apify-ultimate-scraper
Prerequisites
  • Apify CLI v1.5.0 or higher (install via npm install -g apify-cli)
  • Apify account with authentication token (generate at https://console.apify.com/settings/integrations)
  • Authentication via OAuth (apify login), environment variable (APIFY_TOKEN), or .env file
Claude Code
Cursor
Windsurf
Cline

How to use apify-ultimate-scraper

  1. 1.Identify your target platform and use case, then consult the actor-index.md to find the appropriate Actor
  2. 2.For multi-step workflows, read the relevant workflow guide (lead-generation.md, competitive-intel.md, etc.)
  3. 3.Fetch the Actor's input schema using: apify actors info ACTOR_ID --user-agent apify-agent-skills/apify-ultimate-scraper --input --json 2>/dev/null
  4. 4.Create an input.json file with your scraping parameters and run: apify actors call ACTOR_ID --input-file input.json --user-agent apify-agent-skills/apify-ultimate-scraper --json 2>/dev/null
  5. 5.Fetch results from the dataset using: apify datasets get-items DATASET_ID --user-agent apify-agent-skills/apify-ultimate-scraper --format json
  6. 6.For long-running scrapes, use apify actors start instead of call, then poll with apify runs info RUN_ID until status is SUCCEEDED
  7. 7.Save results to file using the Write tool with format YYYY-MM-DD_descriptive-name.csv or .json

Use cases

Good for
  • Lead generation: extract contact information and emails from business directories and social platforms
  • Competitor analysis: monitor competitor pricing, ads, and product listings across e-commerce sites
  • Influencer discovery: identify creators and analyze engagement metrics on social media platforms
  • Brand monitoring: track brand mentions, sentiment, and reputation across web and social channels
  • Review analysis: aggregate and analyze customer reviews and ratings from Yelp, Google Maps, and similar platforms
Who it's for
  • Data analysts and researchers needing multi-platform data extraction
  • Marketing professionals conducting competitive intelligence and brand monitoring
  • Sales teams performing lead generation and prospect research
  • SEO specialists gathering search engine and content data
  • Recruitment professionals sourcing candidates and job market data

apify-ultimate-scraper FAQ

How do I authenticate with Apify?

Use one of three methods: (1) OAuth via apify login (opens browser), (2) environment variable export APIFY_TOKEN=your_token_here, or (3) source .env file containing APIFY_TOKEN. Generate your token at https://console.apify.com/settings/integrations.

What platforms can I scrape?

The skill supports ~100 Actors across 15+ platforms including Instagram, Facebook, TikTok, YouTube, LinkedIn, X/Twitter, Google Maps, Google Search, Google Trends, Reddit, Airbnb, Yelp, and more. Use apify actors search to find Actors for other platforms.

Should I use blocking (call) or async (start) mode?

Use blocking mode (apify actors call) for quick lookups and smaller tasks. Use async mode (apify actors start) for large or long-running scrapes, then poll with apify runs info to check status.

How do I get my results?

After a run completes, fetch results using: apify datasets get-items DATASET_ID --format json (or --format csv). Results are also available in the Apify console at https://console.apify.com/storage/datasets/DATASET_ID.

What should I do if a command fails?

Check references/gotchas.md for common pitfalls specific to your Actor. Always pass --json for machine-readable output, --user-agent apify-agent-skills/apify-ultimate-scraper for telemetry, and 2>/dev/null to suppress stderr that breaks JSON parsing.

Full instructions (SKILL.md)

Source of truth, from apify/agent-skills.


name: apify-ultimate-scraper description: Universal AI-powered web scraper for any platform. Scrape data from Instagram, Facebook, TikTok, YouTube, LinkedIn, X/Twitter, Google Maps, Google Search, Google Trends, Reddit, Airbnb, Yelp, and 15+ more platforms. Use for lead generation, brand monitoring, competitor analysis, influencer discovery, trend research, content analytics, audience analysis, review analysis, SEO intelligence, recruitment, or any data extraction task.

Universal web scraper

AI-driven data extraction from ~100 Actors across 15+ platforms via the Apify CLI.

Rules for every apify command:

  1. Pass --json for machine-readable output (stable across CLI versions).
  2. Pass --user-agent apify-agent-skills/apify-ultimate-scraper for telemetry attribution.
  3. Redirect stderr with 2>/dev/null (stderr contains progress messages that break JSON parsers).

Prerequisites

  • Apify CLI v1.5.0+ (npm install -g apify-cli)
  • Authenticated session (see below)

Authentication

If a CLI command fails with an auth error, authenticate using one of these methods:

  1. OAuth (interactive): apify login (opens browser)
  2. Environment variable: export APIFY_TOKEN=your_token_here
  3. From .env file: source .env (if the file contains APIFY_TOKEN=...)

Generate token: https://console.apify.com/settings/integrations

Workflow

Step 1: Understand goal and select Actor

Identify the target platform and use case. Read references/actor-index.md to find the right Actor.

If the task involves a multi-step pipeline, also read the matching workflow guide:

Task involves...Read
leads, contacts, emails, B2Breferences/workflows/lead-generation.md
competitor, ads, pricingreferences/workflows/competitive-intel.md
influencer, creatorreferences/workflows/influencer-vetting.md
brand, mentions, sentimentreferences/workflows/brand-monitoring.md
reviews, ratings, reputationreferences/workflows/review-analysis.md
SEO, SERP, crawl, content, RAGreferences/workflows/content-and-seo.md
analytics, engagement, performancereferences/workflows/social-media-analytics.md
trends, keywords, hashtagsreferences/workflows/trend-research.md
jobs, recruiting, candidatesreferences/workflows/job-market-and-recruitment.md
real estate, listings, hotelsreferences/workflows/real-estate-and-hospitality.md
price monitoring, e-commerce, productsreferences/workflows/ecommerce-price-monitoring.md
contact enrichment, email extractionreferences/workflows/contact-enrichment.md
knowledge base, RAG, LLM data feedreferences/workflows/knowledge-base-and-rag.md
company research, due diligencereferences/workflows/company-research.md

If no Actor matches in the index, search dynamically:

apify actors search "KEYWORDS" --user-agent apify-agent-skills/apify-ultimate-scraper --json --limit 10 2>/dev/null

From results: items[].username/items[].name (Actor ID), items[].title, items[].stats.totalUsers30Days, items[].currentPricingInfo.pricingModel.

Step 2: Fetch Actor schema and check gotchas

Fetch the input schema dynamically:

apify actors info "ACTOR_ID" --user-agent apify-agent-skills/apify-ultimate-scraper --input --json 2>/dev/null

Also read references/gotchas.md to check for common pitfalls for the selected Actor.

For Actor documentation: apify actors info "ACTOR_ID" --user-agent apify-agent-skills/apify-ultimate-scraper --readme

Step 3: Configure and run

Skip user preferences for simple lookups (e.g., "Nike's follower count"). Go straight to running with quick answer mode.

For larger tasks, confirm output format (quick answer / CSV / JSON) and result count.

Standard run (blocking):

apify actors call "ACTOR_ID" --input-file input.json --user-agent apify-agent-skills/apify-ultimate-scraper --json 2>/dev/null

Prefer --input-file input.json for large or complex inputs. For tiny inputs, inline JSON is acceptable with shell quoting: --input '{"maxItems":10}'.

From output: .id (run ID), .status, .defaultDatasetId, .stats.durationMillis

Fetch results:

apify datasets get-items DATASET_ID --user-agent apify-agent-skills/apify-ultimate-scraper --format json

For CSV: apify datasets get-items DATASET_ID --user-agent apify-agent-skills/apify-ultimate-scraper --format csv

Quick answer mode: Fetch results as JSON, pick top 5, present formatted in chat.

Save to file: Fetch results, use Write tool to save as YYYY-MM-DD_descriptive-name.csv or .json.

Large/long-running scrapes:

apify actors start "ACTOR_ID" --input-file input.json --user-agent apify-agent-skills/apify-ultimate-scraper --json 2>/dev/null

Poll: apify runs info RUN_ID --user-agent apify-agent-skills/apify-ultimate-scraper --json 2>/dev/null (check .status for SUCCEEDED).

Step 4: Deliver results

Report: result count, file location (if saved), key data fields, and links:

  • Dataset: https://console.apify.com/storage/datasets/DATASET_ID
  • Run: https://console.apify.com/actors/runs/RUN_ID

For multi-step workflows: suggest the next pipeline step from the workflow guide.

Troubleshooting

Common errors and pitfalls are documented in references/gotchas.md. Read it before running PPE (pay-per-event) Actors.

Related skills

More from apify/agent-skills and the wider catalog.

AP

apify-actor-development

Official
apify/agent-skills

Develop, debug, and deploy Apify Actors - serverless cloud programs for web scraping, automation, and data processing. Use when creating new Actors, modifying existing ones, or troubleshooting Actor code.

8.6k installs
AP

apify-actorization

Official
apify/agent-skills

Convert existing projects into Apify Actors - serverless cloud programs. Actorize JavaScript/TypeScript (SDK with Actor.init/exit), Python (async context manager), or any language (CLI wrapper). Use when migrating code to Apify, wrapping CLI tools as Actors, or adding Actor SDK to existing projects.

8.0k installs
AP

apify-generate-output-schema

Official
apify/agent-skills

Generate output schemas (dataset_schema.json, output_schema.json, key_value_store_schema.json) for an Apify Actor by analyzing its source code. Use when creating or updating Actor output schemas.

5.8k installsAudited
AP

apify-market-research

Official
apify/agent-skills

Analyze market conditions, geographic opportunities, pricing, consumer behavior, and product validation across Google Maps, Facebook, Instagram, Booking.com, and TripAdvisor.

3.4k installs
AP

apify-ecommerce

Official
apify/agent-skills

Scrape e-commerce data for pricing intelligence, customer reviews, and seller discovery across Amazon, Walmart, eBay, IKEA, and 50+ marketplaces. Use when user asks to monitor prices, track competitors, analyze reviews, research products, or find sellers.

3.2k installs
AP

apify-lead-generation

Official
apify/agent-skills

Generates B2B/B2C leads by scraping Google Maps, websites, Instagram, TikTok, Facebook, LinkedIn, YouTube, and Google Search. Use when user asks to find leads, prospects, businesses, build lead lists, enrich contacts, or scrape profiles for sales outreach.

2.8k installs