Skip to Content

Brightdata

Description: Search, Crawl and Scrape any site, at scale, without getting blocked

Author: Meirk-Brightdata

Auth: API Key

PyPI VersionLicensePython VersionsWheel StatusDownloads

The Brightdata Server provides for scraping and extracting web content and structured data at scale. Main capabilities include:

  • Scrape web pages and return cleaned content in Markdown (ScrapeAsMarkdown).
  • Perform advanced web searches across Google, Bing, or Yandex with customizable parameters (SearchEngine).
  • Extract structured feeds from many site types (Amazon, LinkedIn, Instagram, Facebook, YouTube, Zillow, Booking, ZoomInfo, X, etc.), including products, reviews, profiles, posts, comments, listings, and videos (WebDataFeed). Note: do not fabricate links—use the search first if needed.

Available Tools

Tool NameDescription
Brightdata.ScrapeAsMarkdown Scrape a webpage and return content in Markdown format using Bright Data.
Brightdata.SearchEngine Search using Google, Bing, or Yandex with advanced parameters using Bright Data.
Brightdata.WebDataFeedExtract structured data from various websites like LinkedIn, Amazon, Instagram, etc.

If you need to perform an action that’s not listed here, you can get in touch with us to request a new , or create your own tools.

Brightdata.ScrapeAsMarkdown


Scrape a webpage and return content in Markdown format using Bright Data.

Parameters

  • url (string, required) URL to scrape

Secrets

This requires the following secrets: BRIGHTDATA_API_KEY, BRIGHTDATA_ZONE (learn how to configure secrets)

Brightdata.SearchEngine


Search using Google, Bing, or Yandex with advanced parameters using Bright Data.

Parameters

  • query (string, required) Search query
  • engine (Enum SearchEngine, optional) Search engine to use
  • language (string, optional) Two-letter language code
  • country_code (string, optional) Two-letter country code
  • search_type (Enum SearchType, optional) Type of search
  • start (integer, optional) Results pagination offset
  • num_results (integer, optional) Number of results to return. The default is 10
  • location (string, optional) Location for search results
  • device (Enum DeviceType, optional) Device type
  • return_json (boolean, optional) Return JSON instead of Markdown

Secrets

This requires the following secrets: BRIGHTDATA_API_KEY, BRIGHTDATA_ZONE (learn how to configure secrets)

Brightdata.WebDataFeed


Extract structured data from various websites like LinkedIn, Amazon, Instagram, etc.

Parameters

  • source_type (Enum SourceType, required) Type of data source
  • url (string, required) URL of the web resource to extract data from
  • num_of_reviews (integer, optional) Number of reviews to retrieve. Only applicable for facebook_company_reviews. Default is None
  • timeout (integer, optional) Maximum time in seconds to wait for data retrieval
  • polling_interval (integer, optional) Time in seconds between polling attempts

Secrets

This requires the following secrets: BRIGHTDATA_API_KEY (learn how to configure secrets)

Secrets

This requires the following secrets:

  • BRIGHTDATA_API_KEY
  • BRIGHTDATA_ZONE

Auth

The Arcade Bright Data Server uses Bright Data  to access proxy networks and web scraping infrastructure.

Global Environment Variables:

  • BRIGHTDATA_API_KEY: Your Bright Data . You can generate this from your Bright Data dashboard  under Settings → API Access.

  • BRIGHTDATA_ZONE: Your Bright Data zone name (e.g., residential_proxy1). This is the zone identifier you created in your Bright Data dashboard under Proxies & Scraping Infrastructure → Zones.

How to get your credentials:

  1. : Navigate to your Bright Data Control Panel  → Settings → API Access → Generate API Token
  2. Zone: Go to Zones section in your dashboard, find your zone name in the format shown in the zone username: brd-customer-{customer_id}-zone-{zone_name}

For more details, see the Bright Data API Documentation .

Reference

Below is a reference of enumerations used by some of the tools in the Brightdata Server:

SearchEngine

  • GOOGLE: google
  • BING: bing
  • YANDEX: yandex

SearchType

  • IMAGES: images
  • SHOPPING: shopping
  • NEWS: news
  • JOBS: jobs

DeviceType

  • MOBILE: mobile
  • IOS: ios
  • IPHONE: iphone
  • IPAD: ipad
  • ANDROID: android
  • ANDROID_TABLET: android_tablet

SourceType

  • AMAZON_PRODUCT: amazon_product
  • AMAZON_PRODUCT_REVIEWS: amazon_product_reviews
  • LINKEDIN_PERSON_PROFILE: linkedin_person_profile
  • LINKEDIN_COMPANY_PROFILE: linkedin_company_profile
  • ZOOMINFO_COMPANY_PROFILE: zoominfo_company_profile
  • INSTAGRAM_PROFILES: instagram_profiles
  • INSTAGRAM_POSTS: instagram_posts
  • INSTAGRAM_REELS: instagram_reels
  • INSTAGRAM_COMMENTS: instagram_comments
  • FACEBOOK_POSTS: facebook_posts
  • FACEBOOK_MARKETPLACE_LISTINGS: facebook_marketplace_listings
  • FACEBOOK_COMPANY_REVIEWS: facebook_company_reviews
  • X_POSTS: x_posts
  • ZILLOW_PROPERTIES_LISTING: zillow_properties_listing
  • BOOKING_HOTEL_LISTINGS: booking_hotel_listings
  • YOUTUBE_VIDEOS: youtube_videos
Last updated on