Stories from the Web Crawling trenches in tree traversal

The Ultimate Floki Cheatsheet for Elixir

Author: Mohan Ganesan

Date: Oct 31, 2023

Floki makes it easy to parse and query HTML documents in Elixir using CSS selectors and tree traversal.

The Ultimate HTML::Parser Perl Cheat Sheet

Author: Mohan Ganesan

Date: Oct 31, 2023

HTML::Parser is a Perl module for parsing HTML/XML documents and extracting/manipulating their content.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!