web-crawler · GitHub Topics · GitHub Crawlee—A web scraping and browser automation library for Node js to build reliable crawlers In JavaScript and TypeScript Extract data for AI, LLMs, RAG, or GPTs Download HTML, PDF, JPG, PNG, and other files from websites Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP Both
webcrawler · GitHub Topics · GitHub GitHub is where people build software More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects
webcrawler · GitHub Topics · GitHub Webcrawler que capta noticias sobre games do site comboinfinito com br e guarda dados em banco SQL Server sqlserver webcrawler Updated Feb 12, 2021
Crawl4AI: Open-source LLM Friendly Web Crawler Scraper. Crawl4AI is the #1 trending GitHub repository, actively maintained by a vibrant community It delivers blazing-fast, AI-ready web crawling tailored for LLMs, AI agents, and data pipelines Open source, flexible, and built for real-time performance, Crawl4AI empowers developers with unmatched speed