Stories from the Web Crawling trenches in benchmarking

What is the fastest XML parser in Python?

Author: Mohan Ganesan

Date: Feb 5, 2024

Choosing the right XML parsing library is crucial for performance. lxml is the fastest option, taking only 0.35 seconds compared to over 2 seconds with xml.etree.ElementTree. It's well worth the extra setup.

Benchmarking aiohttp Web Performance

Author: Mohan Ganesan

Date: Feb 22, 2024

The Python aiohttp library provides powerful async HTTP client/server functionality. Benchmarking quantifies metrics like requests per second, latency distributions, and resource usage to guide optimization and capacity planning.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!