Retour au marché
playwright-scraper-skill
Playwright-based web scraping OpenClaw Skill with anti-bot protection, supporting various use cases and customization options.
11,114téléchargements91installations20étoiles
v1.2.0
Overview
Playwright Scraper Skill is a powerful web scraping OpenClaw Skill that leverages Playwright for anti-bot protection. It supports various use cases, from simple sites to complex Cloudflare-protected ones.
Key Features
- Supports various use cases, including regular sites, dynamic sites, and Cloudflare-protected sites
- Includes anti-bot protection techniques, such as hiding automation markers and using realistic User-Agent
- Offers customization options, including environment variables for screenshot path, wait time, and User-Agent
- Supports headful mode for debugging
How It Works
The skill uses Playwright to scrape websites, with different scripts for different use cases. The web_fetch tool is used for simple sites, while playwright-simple.js and playwright-stealth.js are used for dynamic and Cloudflare-protected sites, respectively.
Use Cases
- Regular sites: Use OpenClaw’s built-in
web_fetchtool - Dynamic sites: Use
playwright-simple.js - Cloudflare-protected sites: Use
playwright-stealth.js - YouTube video transcripts: Use
deep-scraper(install separately) - Reddit: Use
reddit-scraper(install separately)
Installation
- Clone the repository
- Run
npm installandnpx playwright install chromium
Quick Start
- Use OpenClaw’s built-in
web_fetchtool for simple sites - Use
playwright-simple.jsfor dynamic sites - Use
playwright-stealth.jsfor Cloudflare-protected sites - Use
deep-scraperfor YouTube video transcripts - Use
reddit-scraperfor Reddit
Avis
Aucun avis pour le moment.