Stories from the Web Crawling trenches in Exception handling

The Ultimate Jsoup Cheatsheet in Java

Author: Mohan Ganesan

Date: Oct 31, 2023

Jsoup is a Java library for parsing and manipulating HTML using DOM, CSS, and jquery-like methods.

Handling URL Errors Gracefully in Python urllib

Author: Mohan Ganesan

Date: Feb 6, 2024

Handle errors when working with URLs in Python using the urllib module. Catch HTTPError and URLError exceptions, and apply targeted handling and retries where applicable.

Handling Errors with aiohttp ClientResponseError

Author: Mohan Ganesan

Date: Feb 22, 2024

Handle aiohttp ClientResponseError in Python for robust and user-friendly applications.

Asyncio task exception handling

Author: Mohan Ganesan

Date: Mar 25, 2024

Asynchronous programming with asyncio in Python has advantages and challenges. Proper exception handling is key to creating robust asyncio code.

Handling Errors Gracefully with Asyncio Exceptions

Author: Mohan Ganesan

Date: Mar 25, 2024

Asyncio provides an asynchronous programming framework in Python for non-blocking I/O code. Exception handling in asyncio requires special care, including handling CancelledError and propagating exceptions from tasks.

Debugging urllib Issues

Author: Mohan Ganesan

Date: Feb 8, 2024

Using urllib module for HTTP requests in Python can run into issues. Tips for debugging: validate URL, handle exceptions, use logging, inspect request details.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!