Overview
The Browser Agent Node provides intelligent web browser automation through Browserbase with Playwright and Stagehand tools. This node enables you to:- Navigate websites and interact with web elements
- Extract data from web pages
- Perform complex browsing workflows
- Handle authentication and forms automatically
- Solve captchas and avoid detection
Configuration Parameters
The Browser Agent uses Browserbase for remote browser sessions with advanced stealth and automation capabilities.Node Configuration
-
Task Instructions:
Describe the browsing task for the agent to accomplish.
Search for Browserbase tutorials and summarize the first result
Login to the dashboard, navigate to settings, and extract the API key
Go to the pricing page and extract all plan details
Advanced Settings
Advanced Settings
- Keep Session Running: Leave the browser session running after completion for reuse.
- Model: Select the language model for the agent.
- System Prompt: Customize the agent’s browsing behavior.
- Max Steps: Maximum reasoning steps the agent can take (1-50).
- Enable Model Fallbacks: Allow automatic model fallbacks if needed.
Browserbase Configuration
Browserbase Configuration
- Region: Preferred Browserbase region for new sessions.
- Session Timeout: Override session timeout in seconds (minimum 30s).
- Browser Context ID: Reuse a saved browser context.
- Viewport Size: Set viewport width and height in pixels.
- Log/Record Sessions: Enable session logging and rrweb recording.
- Solve Captchas: Toggle built-in captcha solver.
- Block Ads: Enable ad blocking.
- Advanced Stealth Mode: Control stealth protections.
- Proxy Options: Configure proxy routing and geolocation.
Expected Inputs and Outputs
-
Inputs:
- input: Optional templating inputs for the prompt
- session_id: Optional Browserbase session ID to reuse
-
Outputs:
- output: Agent’s response summary
- conversation: Full agent conversation log
- browser: Browserbase session metadata (includes live view URL)
Use Case Examples
- Web Data Extraction: Use the agent to navigate complex websites and extract structured data without writing scraping code.
- Automated Testing: Configure the agent to test web applications by performing user actions and verifying results.
- Form Automation: Let the agent fill out and submit forms across multiple websites automatically.
- Competitive Intelligence: Automate browsing competitor websites to gather pricing, features, and product information.
Error Handling and Troubleshooting
- Session Issues: If the browser session fails, the node will handle cleanup automatically. Check Browserbase configuration if issues persist.
- Element Not Found: The agent uses natural language tools (Stagehand) that are more reliable than precise selectors. If precise tools fail, the agent will fall back to natural language.
- Iframe Content: When accessing iframe content, the agent will try to get the embedded page URL and navigate directly to it.
- Page Crashes: If the browser session crashes (ERR_ABORTED), the agent will stop and report the error.