Stories from the Web Crawling trenches in HTML tables

Downloading Images from a Website with PHP and DOM

Author: Mohan Ganesan

Date: Oct 15, 2023

Learn how to use PHP and the DOM extension to download images from a Wikipedia page and extract data from HTML tables. Use Proxies API for scraping at scale.

Parsing HTML Tables with BeautifulSoup

Author: Mohan Ganesan

Date: Oct 6, 2023

BeautifulSoup is a useful library for extracting data from HTML tables in Python. With a few simple lines of code, you can parse an HTML table and convert it into a pandas DataFrame for further analysis.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!