Stories from the Web Crawling trenches in xml.etree.ElementTree

What is the fastest XML parser in Python?

Author: Mohan Ganesan

Date: Feb 5, 2024

Choosing the right XML parsing library is crucial for performance. lxml is the fastest option, taking only 0.35 seconds compared to over 2 seconds with xml.etree.ElementTree. It's well worth the extra setup.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!