The API to search, scrape, and interact with the web at scale. 🔥
-
Updated
Jun 22, 2026 - TypeScript
The API to search, scrape, and interact with the web at scale. 🔥
Python scraper based on AI
A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.
Python client library for Diffbot APIs
Export Safari reading list to JSON or CSV
The official Node.js SDK for Spidra.
Web Data Frames
The web data layer for AI agents — fetch, search, crawl, extract, screenshot, and monitor the web with 50+ domain extractors and MCP.
Agent skill that gives it hands in the browser. 25+ tools to navigate, extract data, execute scripts, intercept APIs — all in user's own Chrome with their login sessions. No passwords needed. 给Agent一双手,像用户一样使用浏览器,25+自动化工具,数据完全本地处理。
Amazon product data analysis with Python & Jupyter. Includes cleaning, stats, and visualizations of categories, prices, ratings, and availability.
Analyze and parse HTML responses, programmatically scrape web data, and utilize Pandas DataFrames to store, transform, and merge tables.
High-performance web scraping engine that converts any web page into clean markdown --- with 3-layer fallback (Cheerio --> Playwright --> Abrasio) and AI-powered structured extraction
Let AI agents fetch live social media and web data with the official Social Fetch MCP server.
Synoppy MCP server — give your AI agent the whole web. Read, crawl, map, extract, classify & enrich live pages from Claude, Cursor, or any MCP client.
Official Python SDK for Synoppy — the web-data layer for AI agents. Read, crawl, map, extract, classify & enrich any website on a single API key.
Extract text from images using a robust OCR model designed for accuracy and efficiency in varied visual contexts.
Add a description, image, and links to the web-data topic page so that developers can more easily learn about it.
To associate your repository with the web-data topic, visit your repo's landing page and select "manage topics."