Stories from the Web Crawling trenches in modules

How to Build a Super Simple HTTP Proxy in Perl in just 20 lines of code

Author: Mohan Ganesan

Date: Oct 1, 2023

Build a basic HTTP proxy server in Perl using less than 20 lines of code. Use rotating proxy service to avoid IP blocking.

Automating Downloads in Python with urllib and wget

Author: Mohan Ganesan

Date: Feb 8, 2024

Python provides modules like urllib and wget for programmatically downloading files and web content. urllib is part of Python's standard library and provides more control, while wget is a feature-rich command line tool with advanced capabilities. Both can be used together for different downloading tasks.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!