Stories from the Web Crawling trenches in validation

The Ultimate KSoup Cheatsheet for Kotlin

Author: Mohan Ganesan

Date: Oct 31, 2023

KSoup is an HTML parser for Kotlin that provides a convenient DSL for extracting and manipulating data from HTML documents.

Extracting URLs from Text in Python

Author: Mohan Ganesan

Date: Feb 20, 2024

When working with text data in Python, you can use regular expressions and the urllib module to detect and validate URLs. This article provides examples and tips for effectively detecting links in text.

Working with Request Parameters in aiohttp

Author: Mohan Ganesan

Date: Mar 3, 2024

aiohttp makes it easy to get request parameters. Adding validation middleware helps create robust APIs and catch issues early.

Submitting Form Data with aiohttp in Python

Author: Mohan Ganesan

Date: Mar 3, 2024

Access and validate form data in aiohttp, store and process it, and handle errors to provide user feedback.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!