Stories from the Web Crawling trenches in DOM traversal

The Ultimate JSoup Kotlin Cheatsheet

Author: Mohan Ganesan

Date: Oct 31, 2023

JSoup is a Java library for working with real-world HTML. It provides a convenient API for extracting and manipulating data from HTML documents.

The Ultimate JSoup Scala Cheatsheet

Author: Mohan Ganesan

Date: Oct 31, 2023

JSoup is a Java library for working with real-world HTML. It provides a convenient API for extracting and manipulating data from HTML documents.

Scraping Reddit Posts In C++

Author: Mohan Ganesan

Date: Jan 9, 2024

Web scraping C++ program that extracts post data from Reddit using HTML parsing and curl library.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!