Dokumentation (english)

Web Scraper

Scrape a URL and pass the extracted content to the next node, powered by Apify.

The Web Scraper node connects to your Apify account and runs a scraper actor for each URL it receives from an upstream node. The extracted content is passed downstream as structured JSON — ready to feed into an AI node, a CSV export, or any other step.

Setup

You need an Apify account and an API token. Connect it once in the node settings — the token is stored securely and reused for every run.

Configuration

FieldDescription
ScraperWhich Apify actor to use. See the Scrapers section below. Only applies when Smart URL Routing is off.
Smart URL RoutingWhen enabled, the node inspects the URL and automatically selects the best actor for that platform. Falls back to the Generic Web Scraper for all other URLs.

Scrapers

ScraperBest for
Generic Web Scraper (default)Any public website — company pages, blogs, landing pages, news articles
Twitter / XTweets and thread content from a twitter.com or x.com URL
YouTubeVideo metadata, description, channel info, view and like counts
Google MapsBusiness listings — name, address, rating, reviews, phone, website
Amazon ProductsProduct pages — title, price, brand, rating, review count, ASIN

When Smart URL Routing is on, the node detects the platform from the URL and picks the right scraper automatically. You don't need to set anything else.

Inputs & Outputs

NameDescription
Inputjson_dataJSON from an upstream node containing a url field to scrape
Outputaction_resultStructured JSON with the scraped content (fields vary by scraper — see below)

Output fields by scraper

Generic Web Scraper

FieldDescription
urlThe scraped URL
titlePage title
descriptionMeta description
authorPage author if present
keywordsMeta keywords
languageDetected language code
textFull plain-text content
markdownContent as Markdown

Twitter / X

FieldDescription
urlTweet URL
authorUsername
textTweet text
likesLike count
retweetsRetweet count
repliesReply count
posted_atPost timestamp

YouTube

FieldDescription
urlVideo URL
titleVideo title
channelChannel name
channel_urlChannel URL
subscribersSubscriber count
descriptionVideo description
viewsView count
likesLike count
durationVideo duration
published_atPublish date
hashtagsComma-separated hashtags

Google Maps

FieldDescription
urlPlace URL
titleBusiness name
addressFull address
ratingAverage rating score
reviews_countTotal review count
websiteBusiness website
phonePhone number
categoryBusiness category

Amazon Products

FieldDescription
urlProduct URL
titleProduct title
priceListed price
brandBrand name
ratingAverage star rating
reviews_countTotal review count
descriptionProduct description
asinAmazon product identifier

Tips

  • Connect a Google Sheets Trigger upstream to scrape one URL per new row automatically
  • Pass the output to a Use AI Model node to extract structured fields from the raw scraped content
  • Use Smart URL Routing when your sheet contains a mix of URLs from different platforms — the node picks the right scraper for each one
  • The Generic Web Scraper uses a lightweight HTTP crawler by default. It works well for most public pages and keeps costs low on the free Apify tier.
  • If a URL returns no results (e.g. a login-gated page), the node fails and stops the flow — the Flow Completion Event node can catch this and send you an alert

Command Palette

Search for a command to run...

Schnellzugriffe
STRG + KSuche
STRG + DNachtmodus / Tagmodus
STRG + LSprache ändern
STRG + BSidepanel umschalten

Software-Details
Kompiliert vor etwa 2 Stunden
Release: v4.0.0-production
Buildnummer: master@21f7890
Historie: 61 Items