Scarpfly MCP connector

OAuth 2.1/DCRSearchAIDeveloper Tools

Connect to Scrapfly MCP. Scrape web pages, take screenshots, and control a cloud browser with anti-bot bypass, JS rendering, and proxy support.

Scarpfly MCP connector

Install the SDK
Section titled “Install the SDK”
- Node.js
- Python
Terminal window
1 npm install @scalekit-sdk/node
Terminal window
1 pip install scalekit
Full SDK reference: Node.js | Python
Set your credentials
Section titled “Set your credentials”

Add your Scalekit credentials to your .env file. Find values in app.scalekit.com > Developers > API Credentials.
.env
```
SCALEKIT_ENVIRONMENT_URL=<your-environment-url>
SCALEKIT_CLIENT_ID=<your-client-id>
SCALEKIT_CLIENT_SECRET=<your-client-secret>
```

1
import { ScalekitClient } from '@scalekit-sdk/node'
2
import 'dotenv/config'
3

4
const scalekit = new ScalekitClient(
5
  process.env.SCALEKIT_ENV_URL,
6
  process.env.SCALEKIT_CLIENT_ID,
7
  process.env.SCALEKIT_CLIENT_SECRET,
8
)
9
const actions = scalekit.actions
10

11
const connector = 'scarpflymcp'
12
const identifier = 'user_123'
13

14
// Generate an authorization link for the user
15
const { link } = await actions.getAuthorizationLink({ connectionName: connector, identifier })
16
console.log('Authorize Scarpfly MCP:', link)
17
process.stdout.write('Press Enter after authorizing...')
18
await new Promise(r => process.stdin.once('data', r))
19

20
// Make your first call
21
const result = await actions.executeTool({
22
  connector,
23
  identifier,
24
  toolName: 'scarpflymcp_get_page_url',
25
  toolInput: {},
26
})
27
console.log(result)

1
import os
2
from scalekit.client import ScalekitClient
3
from dotenv import load_dotenv
4
load_dotenv()
5

6
scalekit_client = ScalekitClient(
7
    env_url=os.getenv("SCALEKIT_ENV_URL"),
8
    client_id=os.getenv("SCALEKIT_CLIENT_ID"),
9
    client_secret=os.getenv("SCALEKIT_CLIENT_SECRET"),
10
)
11
actions = scalekit_client.actions
12

13
connection_name = "scarpflymcp"
14
identifier = "user_123"
15

16
# Generate an authorization link for the user
17
link_response = actions.get_authorization_link(
18
    connection_name=connection_name,
19
    identifier=identifier,
20
)
21
print("Authorize Scarpfly MCP:", link_response.link)
22
input("Press Enter after authorizing...")
23

24
# Make your first call
25
result = actions.execute_tool(
26
    tool_input={},
27
    tool_name="scarpflymcp_get_page_url",
28
    connection_name=connection_name,
29
    identifier=identifier,
30
)
31
print(result)

What you can do

Connect this agent connector to let your agent:

Scrape web — Fetch a URL with full control over headers, JS rendering, proxy country, anti-scraping protection, and output format
Get web, page url — Quickly fetch a URL with sensible defaults and return the page content
Text type — Type text at the current cursor position in the active cloud browser session
Snapshot take — Take a DOM snapshot of the current page in the cloud browser session
Screenshot take, cloud browser — Take a screenshot of the current page in the active cloud browser session
Option select — Select an option in a dropdown element in the active cloud browser session

Tool list

Use the exact tool names from the Tool list below when you call execute_tool. If you’re not sure which name to use, list the tools available for the current user first.

scarpflymcp_browser_unblock#Unblock a URL using a headless browser with anti-scraping protection. Returns the page content after bypassing bot detection.3 params

Unblock a URL using a headless browser with anti-scraping protection. Returns the page content after bypassing bot detection.

NameTypeRequiredDescription

urlstringrequiredTarget URL to fetch or interact with.

countrystringoptionalTwo-letter ISO 3166-1 alpha-2 country code for the proxy exit node (e.g. US, DE, FR).

timeoutintegeroptionalServer-side timeout in milliseconds. Use alongside rendering_wait for JS-heavy pages.

scarpflymcp_call_webmcp_tool#Call a specific tool from a connected remote WebMCP server by name with provided input.2 params

Call a specific tool from a connected remote WebMCP server by name with provided input.

NameTypeRequiredDescription

tool_namestringrequiredName of the WebMCP tool to call (from list_webmcp_tools)

inputstringoptionalJSON-stringified parameters to pass to the tool. Omit for tools with no parameters.

scarpflymcp_check_if_blocked#Check whether a URL returns blocked or captcha content by scraping it and analyzing the response.10 params

Check whether a URL returns blocked or captcha content by scraping it and analyzing the response.

NameTypeRequiredDescription

contentstringrequiredPage content (HTML/text) from a scrape result. Use raw or clean_html format for best detection accuracy.

urlstringrequiredTarget URL to fetch or interact with.

countrystringoptionalTwo-letter ISO 3166-1 alpha-2 country code for the proxy exit node (e.g. US, DE, FR).

extraction_modelstringoptionalPre-built AI extraction model to apply. Accepted values: article, event, food_recipe, hotel, product, job_posting, organization, and more.

formatstringoptionalOutput format for the scraped content. Accepted values: markdown, text, clean_html, json, raw.

format_optionsarrayoptionalAdditional options (only available for markdown and text formats)

proxy_poolstringoptionalProxy pool to route the request through. Accepted values: public_datacenter_pool, public_residential_pool.

rendering_waitintegeroptionalMilliseconds to wait after JS rendering before returning the response.

response_headersobjectoptionalResponse headers from the scrape result. Enables header-based antibot detection.

status_codeintegeroptionalHTTP status code from the scrape result (e.g. 403, 429, 503). Defaults to 200. Improves detection accuracy.

scarpflymcp_click#Click an element in the active cloud browser session. Requires a uid obtained from take_snapshot.1 param

Click an element in the active cloud browser session. Requires a uid obtained from take_snapshot.

NameTypeRequiredDescription

uidstringrequiredElement UID from take_snapshot. Used to target a specific element for interaction.

scarpflymcp_cloud_browser_close#Close an active cloud browser session by session ID to free up resources.2 params

Close an active cloud browser session by session ID to free up resources.

NameTypeRequiredDescription

session_idstringrequiredActive cloud browser session ID. Obtain from cloud_browser_open.

user_close_requeststringrequiredVerbatim quote of the user's close instruction (e.g. "close the session", "stop the browser"). Must contain at least one of: close, end, stop, terminate, dispose, shut down, kill, quit, exit, fermer, arrêter, terminer. Rejected if empty or meta-phrase.

scarpflymcp_cloud_browser_downloads#Retrieve files downloaded during an active cloud browser session.2 params

Retrieve files downloaded during an active cloud browser session.

NameTypeRequiredDescription

filenamestringoptionalNo description.

session_idstringoptionalActive cloud browser session ID. Obtain from cloud_browser_open.

scarpflymcp_cloud_browser_eval#Fetch a URL in a cloud browser session and optionally execute JavaScript, with full scraping options available.9 params

Fetch a URL in a cloud browser session and optionally execute JavaScript, with full scraping options available.

NameTypeRequiredDescription

expressionstringrequiredJavaScript expression to evaluate in the browser page.

countrystringoptionalTwo-letter ISO 3166-1 alpha-2 country code for the proxy exit node (e.g. US, DE, FR).

extraction_modelstringoptionalPre-built AI extraction model to apply. Accepted values: article, event, food_recipe, hotel, product, job_posting, organization, and more.

formatstringoptionalOutput format for the scraped content. Accepted values: markdown, text, clean_html, json, raw.

format_optionsarrayoptionalAdditional options (only available for markdown and text formats)

proxy_poolstringoptionalProxy pool to route the request through. Accepted values: public_datacenter_pool, public_residential_pool.

rendering_waitintegeroptionalMilliseconds to wait after JS rendering before returning the response.

session_idstringoptionalActive cloud browser session ID. Obtain from cloud_browser_open.

urlstringoptionalTarget URL to fetch or interact with.

scarpflymcp_cloud_browser_navigate#Navigate an active cloud browser session to a new URL.3 params

Navigate an active cloud browser session to a new URL.

NameTypeRequiredDescription

session_idstringrequiredActive cloud browser session ID. Obtain from cloud_browser_open.

urlstringrequiredTarget URL to fetch or interact with.

url_sourcestringrequiredWhere this URL came from. user_prompt = user named it; page_snapshot = found in last take_snapshot; webmcp_tool = returned by a WebMCP tool. Any other provenance means the URL was invented and the call will be rejected.

scarpflymcp_cloud_browser_open#Open a cloud browser session on a URL for multi-step interaction such as clicking, filling forms, and navigating pages.12 params

Open a cloud browser session on a URL for multi-step interaction such as clicking, filling forms, and navigating pages.

NameTypeRequiredDescription

urlstringrequiredTarget URL to fetch or interact with.

blacklistbooleanoptionalNo description.

block_fontsbooleanoptionalNo description.

block_imagesbooleanoptionalNo description.

block_mediabooleanoptionalNo description.

block_stylesbooleanoptionalNo description.

cachebooleanoptionalNo description.

countrystringoptionalTwo-letter ISO 3166-1 alpha-2 country code for the proxy exit node (e.g. US, DE, FR).

debugbooleanoptionalNo description.

optimize_bandwidthbooleanoptionalNo description.

proxy_poolstringoptionalProxy pool to route the request through. Accepted values: public_datacenter_pool, public_residential_pool.

timeoutintegeroptionalServer-side timeout in milliseconds. Use alongside rendering_wait for JS-heavy pages.

scarpflymcp_cloud_browser_performance#Get Core Web Vitals and performance metrics for the current page in a cloud browser session.3 params

Get Core Web Vitals and performance metrics for the current page in a cloud browser session.

NameTypeRequiredDescription

presetstringoptionalNo description.

session_idstringoptionalActive cloud browser session ID. Obtain from cloud_browser_open.

timeout_msintegeroptionalNo description.

scarpflymcp_cloud_browser_screenshot#Take a screenshot of the current page in an active cloud browser session.10 params

Take a screenshot of the current page in an active cloud browser session.

NameTypeRequiredDescription

countrystringoptionalTwo-letter ISO 3166-1 alpha-2 country code for the proxy exit node (e.g. US, DE, FR).

extraction_modelstringoptionalPre-built AI extraction model to apply. Accepted values: article, event, food_recipe, hotel, product, job_posting, organization, and more.

formatstringoptionalOutput format for the scraped content. Accepted values: markdown, text, clean_html, json, raw.

format_optionsarrayoptionalAdditional options (only available for markdown and text formats)

full_pagebooleanoptionalCapture the full scrollable page, not just the viewport. Default: false.

proxy_poolstringoptionalProxy pool to route the request through. Accepted values: public_datacenter_pool, public_residential_pool.

rendering_waitintegeroptionalMilliseconds to wait after JS rendering before returning the response.

selectorstringoptionalCSS selector of an element to screenshot. If provided, only that element is captured.

session_idstringoptionalActive cloud browser session ID. Obtain from cloud_browser_open.

urlstringoptionalTarget URL to fetch or interact with.

scarpflymcp_cloud_browser_sessions#List all active cloud browser sessions for the current account.0 params

List all active cloud browser sessions for the current account.

scarpflymcp_drag#Drag an element to another element in the active cloud browser session. Requires uids obtained from take_snapshot.2 params

Drag an element to another element in the active cloud browser session. Requires uids obtained from take_snapshot.

NameTypeRequiredDescription

from_uidstringrequiredElement UID to drag from. Obtain via take_snapshot.

to_uidstringrequiredElement UID to drag to. Obtain via take_snapshot.

scarpflymcp_evaluate_script#Evaluate a JavaScript expression in the active cloud browser session and return the result.1 param

Evaluate a JavaScript expression in the active cloud browser session and return the result.

NameTypeRequiredDescription

expressionstringrequiredJavaScript expression to evaluate

scarpflymcp_fill#Fill a form field in the active cloud browser session. Requires a uid obtained from take_snapshot.2 params

Fill a form field in the active cloud browser session. Requires a uid obtained from take_snapshot.

NameTypeRequiredDescription

uidstringrequiredElement UID from take_snapshot. Used to target a specific element for interaction.

valuestringrequiredText to fill in

scarpflymcp_get_page_url#Get the current URL of the active cloud browser session.1 param

Get the current URL of the active cloud browser session.

NameTypeRequiredDescription

dummystringoptionalUnused placeholder field required by the MCP protocol. Pass any string value.

scarpflymcp_hover#Hover over an element in the active cloud browser session. Requires a uid obtained from take_snapshot.1 param

Hover over an element in the active cloud browser session. Requires a uid obtained from take_snapshot.

NameTypeRequiredDescription

uidstringrequiredElement UID from take_snapshot. Used to target a specific element for interaction.

scarpflymcp_info_account#Retrieve Scrapfly account details including plan, remaining credits, and usage limits.1 param

Retrieve Scrapfly account details including plan, remaining credits, and usage limits.

NameTypeRequiredDescription

dummystringoptionalUnused placeholder field required by the MCP protocol. Pass any string value.

scarpflymcp_info_api_key#Retrieve information about the current Scrapfly API key including permissions and rate limits.1 param

Retrieve information about the current Scrapfly API key including permissions and rate limits.

NameTypeRequiredDescription

dummystringoptionalUnused placeholder field required by the MCP protocol. Pass any string value.

scarpflymcp_inspect_page#Inspect the current page in a cloud browser session and optionally answer a question about its content.3 params

Inspect the current page in a cloud browser session and optionally answer a question about its content.

NameTypeRequiredDescription

full_pagebooleanoptionalNo description.

questionstringoptionalNo description.

session_idstringoptionalActive cloud browser session ID. Obtain from cloud_browser_open.

scarpflymcp_list_webmcp_tools#List all tools available on the connected remote WebMCP server.1 param

List all tools available on the connected remote WebMCP server.

NameTypeRequiredDescription

dummystringoptionalUnused placeholder field required by the MCP protocol. Pass any string value.

scarpflymcp_press_key#Press a keyboard key in the active cloud browser session (e.g. Enter, Tab, Escape).1 param

Press a keyboard key in the active cloud browser session (e.g. Enter, Tab, Escape).

NameTypeRequiredDescription

keystringrequiredKey to press: Enter, Tab, Escape, ArrowDown, etc.

scarpflymcp_scraping_instruction_enhanced#Get enhanced instructions on how to configure Scrapfly options for a specific scraping task or target site.1 param

Get enhanced instructions on how to configure Scrapfly options for a specific scraping task or target site.

NameTypeRequiredDescription

dummystringoptionalUnused placeholder field required by the MCP protocol. Pass any string value.

scarpflymcp_screenshot#Take a screenshot of a URL using Scrapfly's headless browser. Supports full-page capture, custom resolution, and visual deficiency simulation.15 params

Take a screenshot of a URL using Scrapfly's headless browser. Supports full-page capture, custom resolution, and visual deficiency simulation.

NameTypeRequiredDescription

urlstringrequiredTarget URL to fetch or interact with.

auto_scrollbooleanoptionalIf true, automatically scroll the page to load lazy content.

cachebooleanoptionalIf true, enable response caching.

cache_clearbooleanoptionalIf true, bypass & clear cache for this request.

cache_ttlintegeroptionalThe cache time-to-live in seconds.

capturestringoptionalThe capture to use for the screenshot. Either fullpage or a CSS selector

countrystringoptionalTwo-letter ISO 3166-1 alpha-2 country code for the proxy exit node (e.g. US, DE, FR).

formatstringoptionalImage format for the screenshot. Accepted values: jpg, png, webp, gif.

jsstringoptionalJavaScript code to execute on the page after load.

optionsarrayoptionalScreenshot options to use for the screenshot.

rendering_waitintegeroptionalMilliseconds to wait after JS rendering before returning the response.

resolutionstringoptionalThe resolution to use for the screenshot. e.g. 1920x1080

vision_deficiency_typestringoptionalThe vision deficiency to use for the screenshot.

wait_for_selectorstringoptionalCSS selector to wait for before returning the response. Use when the target content loads asynchronously.

webhookstringoptionalThe webhook to call after the request completes.

scarpflymcp_scroll#Scroll the page or a specific element in the active cloud browser session by pixel delta.3 params

Scroll the page or a specific element in the active cloud browser session by pixel delta.

NameTypeRequiredDescription

deltaXnumberoptionalHorizontal scroll pixels (optional)

deltaYnumberoptionalVertical scroll pixels (optional, e.g. 500 to scroll down)

uidstringoptionalElement UID from take_snapshot. Used to target a specific element for interaction.

scarpflymcp_select_option#Select an option in a dropdown element in the active cloud browser session. Requires a uid obtained from take_snapshot.2 params

Select an option in a dropdown element in the active cloud browser session. Requires a uid obtained from take_snapshot.

NameTypeRequiredDescription

uidstringrequiredElement UID from take_snapshot. Used to target a specific element for interaction.

valuestringrequiredOption value or text to select

scarpflymcp_take_screenshot#Take a screenshot of the current page in the active cloud browser session.1 param

Take a screenshot of the current page in the active cloud browser session.

NameTypeRequiredDescription

dummystringoptionalUnused placeholder field required by the MCP protocol. Pass any string value.

scarpflymcp_take_snapshot#Take a DOM snapshot of the current page in the cloud browser session. Returns element uids needed for click, fill, hover, drag, and scroll operations.1 param

Take a DOM snapshot of the current page in the cloud browser session. Returns element uids needed for click, fill, hover, drag, and scroll operations.

NameTypeRequiredDescription

dummystringoptionalUnused placeholder field required by the MCP protocol. Pass any string value.

scarpflymcp_type_text#Type text at the current cursor position in the active cloud browser session.1 param

Type text at the current cursor position in the active cloud browser session.

NameTypeRequiredDescription

textstringrequiredText to type

scarpflymcp_web_get_page#Quickly fetch a URL with sensible defaults and return the page content. Best for simple one-shot page retrieval.10 params

Quickly fetch a URL with sensible defaults and return the page content. Best for simple one-shot page retrieval.

NameTypeRequiredDescription

powstringrequiredProof-of-work token for anti-bot bypass. Use scraping_instruction_enhanced to get guidance on the correct value.

urlstringrequiredTarget URL to fetch or interact with.

capture_flagsarrayoptionalScreenshot flags to use for the screenshot.

capture_pagebooleanoptionalIf true, also capture the page as a screenshot.

countrystringoptionalTwo-letter ISO 3166-1 alpha-2 country code for the proxy exit node (e.g. US, DE, FR).

extraction_modelstringoptionalPre-built AI extraction model to apply. Accepted values: article, event, food_recipe, hotel, product, job_posting, organization, and more.

formatstringoptionalOutput format for the scraped content. Accepted values: markdown, text, clean_html, json, raw.

format_optionsarrayoptionalAdditional options (only available for markdown and text formats)

proxy_poolstringoptionalProxy pool to route the request through. Accepted values: public_datacenter_pool, public_residential_pool.

rendering_waitintegeroptionalMilliseconds to wait after JS rendering before returning the response.

scarpflymcp_web_scrape#Fetch a URL with full control over headers, JS rendering, proxy country, anti-scraping protection, and output format.26 params

Fetch a URL with full control over headers, JS rendering, proxy country, anti-scraping protection, and output format.

NameTypeRequiredDescription

powstringrequiredProof-of-work token for anti-bot bypass. Use scraping_instruction_enhanced to get guidance on the correct value.

urlstringrequiredTarget URL to fetch or interact with.

aspbooleanoptionalEnable Anti Scraping Protection.

bodystringoptionalRequest body for POST/PUT/PATCH requests.

cachebooleanoptionalEnable caching of the response.

cache_clearbooleanoptionalIf true, bypass & clear cache for this URL.

cache_ttlintegeroptionalCache TTL in seconds when cache is true.

cookiesstringoptionalCookies to send with the request.

countrystringoptionalTwo-letter ISO 3166-1 alpha-2 country code for the proxy exit node (e.g. US, DE, FR).

extraction_modelstringoptionalPre-built AI extraction model to apply. Accepted values: article, event, food_recipe, hotel, product, job_posting, organization, and more.

extraction_promptstringoptionalCustom AI prompt for extracting specific data from the page. Avoid if the model can process the data directly.

formatstringoptionalOutput format for the scraped content. Accepted values: markdown, text, clean_html, json, raw.

format_optionsarrayoptionalAdditional options (only available for markdown and text formats)

headersobjectoptionalHTTP headers to send.

jsstringoptionalJavaScript code to execute on the page after load.

js_scenarioarrayoptionalA schema for validating a sequence of browser actions (JS Scenario) for the Scrapfly API.

langarrayoptionalLanguages to use for the request (Accept-Language header). Empty for auto-detection/Proxy Location alignment

methodstringoptionalHTTP method for the request. Accepted values: GET, POST, PUT, PATCH, OPTIONS, HEAD.

proxy_poolstringoptionalProxy pool to route the request through. Accepted values: public_datacenter_pool, public_residential_pool.

render_jsbooleanoptionalEnable JavaScript rendering with a headless browser.

rendering_waitintegeroptionalMilliseconds to wait after JS rendering before returning the response.

retrybooleanoptionalIf false, disable automatic retry on transient errors.

screenshot_flagsarrayoptionalScreenshot flags to use for the screenshot.

screenshotsstringoptionalScreenshots with target (fullpage, selector). Example: [{ 'name': 'my_screenshot', 'target': 'fullpage' }, { 'name': 'my_screenshot2', 'target': 'selector', 'css_selector': '#price' }]

timeoutintegeroptionalServer-side timeout in milliseconds. Use alongside rendering_wait for JS-heavy pages.

wait_for_selectorstringoptionalCSS selector to wait for before returning the response. Use when the target content loads asynchronously.

Scarpfly MCP connector

Scarpfly MCP connector

Install the SDK

Set your credentials

Authorize and make your first call

What you can do

Tool list