Brightdata
Description: Search, Crawl and Scrape any site, at scale, without getting blocked
Author: Meirk-Brightdata
Auth: API Key
The Brightdata Server provides for scraping and extracting web content and structured data at scale. Main capabilities include:
- Scrape web pages and return cleaned content in Markdown (ScrapeAsMarkdown).
- Perform advanced web searches across Google, Bing, or Yandex with customizable parameters (SearchEngine).
- Extract structured feeds from many site types (Amazon, LinkedIn, Instagram, Facebook, YouTube, Zillow, Booking, ZoomInfo, X, etc.), including products, reviews, profiles, posts, comments, listings, and videos (WebDataFeed). Note: do not fabricate links—use the search first if needed.
Available Tools
Tool Name | Description |
---|---|
Brightdata.ScrapeAsMarkdown | Scrape a webpage and return content in Markdown format using Bright Data. |
Brightdata.SearchEngine | Search using Google, Bing, or Yandex with advanced parameters using Bright Data. |
Brightdata.WebDataFeed | Extract structured data from various websites like LinkedIn, Amazon, Instagram, etc. |
If you need to perform an action that’s not listed here, you can get in touch with us to request a new , or create your own tools.
Brightdata.ScrapeAsMarkdown
Scrape a webpage and return content in Markdown format using Bright Data.
Parameters
- url (
string
, required) URL to scrape
Secrets
This requires the following secrets: BRIGHTDATA_API_KEY
, BRIGHTDATA_ZONE
(learn how to configure secrets)
Brightdata.SearchEngine
Search using Google, Bing, or Yandex with advanced parameters using Bright Data.
Parameters
- query (
string
, required) Search query - engine (
Enum
SearchEngine, optional) Search engine to use - language (
string
, optional) Two-letter language code - country_code (
string
, optional) Two-letter country code - search_type (
Enum
SearchType, optional) Type of search - start (
integer
, optional) Results pagination offset - num_results (
integer
, optional) Number of results to return. The default is 10 - location (
string
, optional) Location for search results - device (
Enum
DeviceType, optional) Device type - return_json (
boolean
, optional) Return JSON instead of Markdown
Secrets
This requires the following secrets: BRIGHTDATA_API_KEY
, BRIGHTDATA_ZONE
(learn how to configure secrets)
Brightdata.WebDataFeed
Extract structured data from various websites like LinkedIn, Amazon, Instagram, etc.
Parameters
- source_type (
Enum
SourceType, required) Type of data source - url (
string
, required) URL of the web resource to extract data from - num_of_reviews (
integer
, optional) Number of reviews to retrieve. Only applicable for facebook_company_reviews. Default is None - timeout (
integer
, optional) Maximum time in seconds to wait for data retrieval - polling_interval (
integer
, optional) Time in seconds between polling attempts
Secrets
This requires the following secrets: BRIGHTDATA_API_KEY
(learn how to configure secrets)
Secrets
This requires the following secrets:
BRIGHTDATA_API_KEY
BRIGHTDATA_ZONE
Auth
The Arcade Bright Data Server uses Bright Data to access proxy networks and web scraping infrastructure.
Global Environment Variables:
-
BRIGHTDATA_API_KEY
: Your Bright Data . You can generate this from your Bright Data dashboard under Settings → API Access. -
BRIGHTDATA_ZONE
: Your Bright Data zone name (e.g.,residential_proxy1
). This is the zone identifier you created in your Bright Data dashboard under Proxies & Scraping Infrastructure → Zones.
How to get your credentials:
- : Navigate to your Bright Data Control Panel → Settings → API Access → Generate API Token
- Zone: Go to Zones section in your dashboard, find your zone name in the format shown in the zone username:
brd-customer-{customer_id}-zone-{zone_name}
For more details, see the Bright Data API Documentation .
Reference
Below is a reference of enumerations used by some of the tools in the Brightdata Server:
SearchEngine
- GOOGLE:
google
- BING:
bing
- YANDEX:
yandex
SearchType
- IMAGES:
images
- SHOPPING:
shopping
- NEWS:
news
- JOBS:
jobs
DeviceType
- MOBILE:
mobile
- IOS:
ios
- IPHONE:
iphone
- IPAD:
ipad
- ANDROID:
android
- ANDROID_TABLET:
android_tablet
SourceType
- AMAZON_PRODUCT:
amazon_product
- AMAZON_PRODUCT_REVIEWS:
amazon_product_reviews
- LINKEDIN_PERSON_PROFILE:
linkedin_person_profile
- LINKEDIN_COMPANY_PROFILE:
linkedin_company_profile
- ZOOMINFO_COMPANY_PROFILE:
zoominfo_company_profile
- INSTAGRAM_PROFILES:
instagram_profiles
- INSTAGRAM_POSTS:
instagram_posts
- INSTAGRAM_REELS:
instagram_reels
- INSTAGRAM_COMMENTS:
instagram_comments
- FACEBOOK_POSTS:
facebook_posts
- FACEBOOK_MARKETPLACE_LISTINGS:
facebook_marketplace_listings
- FACEBOOK_COMPANY_REVIEWS:
facebook_company_reviews
- X_POSTS:
x_posts
- ZILLOW_PROPERTIES_LISTING:
zillow_properties_listing
- BOOKING_HOTEL_LISTINGS:
booking_hotel_listings
- YOUTUBE_VIDEOS:
youtube_videos