Stories from the Web Crawling trenches in website

Speed Up Your Website: Measuring Page Load Times in Python

Author: Mohan Ganesan

Date: Feb 3, 2024

Measure page load times in Python using the requests module to provide a good user experience. Fetch a webpage and calculate the duration it takes to fully load.

How To Find All URLs On A Domain's Website

Author: Mohan Ganesan

Date: Apr 30, 2024

Find all URLs on a website for SEO analysis, broken link detection, competitive analysis, web scraping, and website migration.

Can I crawl any website?

Author: Mohan Ganesan

Date: Feb 20, 2024

When creating a web crawler, it is important to respect websites' permissions and crawl ethically. The Robots Exclusion Protocol and proper identification of the crawler are key factors. Legal risks can be avoided by obtaining explicit permission from website owners.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!