Date: Oct 31, 2023
select.rs is a robust HTML/XML scraping library for Rust. This cheat sheet covers its features, including installation, loading documents, selecting nodes, traversing nodes, extracting/modifying nodes, creating/inserting/removing nodes, output formats, caching and persistence, headless browsers, validation, encoding, advanced selectors, caching and performance, common recipes, troubleshooting, and ecosystem libraries.
Date: Oct 31, 2023
Cheerio is a fast, flexible web scraping library for Node.js. This cheat sheet provides a comprehensive reference of its syntax and capabilities.
Date: Jan 9, 2024
Learn how to scrape Reddit posts using Java, web scraping, HTML parsing, selectors, and user-agent headers.
Date: Sep 25, 2023
Rust is a great language for web scraping with ChatGPT's help. It involves sending HTTP requests, extracting data, and using selectors. ChatGPT can provide explanations and generate code snippets. A web scraping API like Proxies API can be used for more robust solutions.
Date: Jan 9, 2024
Scrape Reddit posts using Kotlin script, send HTTP requests, parse HTML, and extract key data using selectors.
Date: Oct 15, 2023
Web scraping in Rust using reqwest and selectors crates to extract data from multiple pages. Use proxies for scaling up scraping.
Date: Oct 1, 2023
Learn how to scrape Craigslist apartment listings using Rust and the reqwest and selectors crates.
ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!