Stories from the Web Crawling trenches in troubleshooting

Authenticating Python Requests: A Practical Guide to Using Tokens for API Access

Author: Mohan Ganesan

Date: Dec 6, 2023

API tokens are critical for securing web APIs. Learn how to obtain and use tokens for authenticated API calls in Python, and troubleshoot common token-related issues.

Troubleshooting 403 Errors when Web Scraping in Python Requests

Author: Mohan Ganesan

Date: Dec 6, 2023

Learn how to troubleshoot and prevent 403 Forbidden errors in web scraping. Understand common causes, diagnose the root cause, and implement solutions using Python. Use techniques like retrying requests, analyzing HTTP traffic, simplifying requests, and verifying authentication. Prevent future errors by using proxies, randomizing user agents, solving CAPTCHAs, and throttling requests. Consider using a professional proxy service like Proxies API for large-scale scraping.

Troubleshooting aiohttp ServerDisconnectedError

Author: Mohan Ganesan

Date: Feb 22, 2024

If you're using Python's aiohttp library for asynchronous HTTP requests and getting ServerDisconnectedErrors, here are some troubleshooting tips to handle the response inside the context manager and check for connectivity issues.

Using Proxies With C++ httplib in 2024

Author: Mohan Ganesan

Date: Jan 9, 2024

Using a proxy with C++ httplib is easy. Set up authentication, chain multiple proxies, customize settings, and troubleshoot issues. Proxies API offers a better solution for unblockable scraping.

Making the Most of aiohttp's TCPConnector for Asynchronous HTTP Requests

Author: Mohan Ganesan

Date: Mar 3, 2024

Carefully configuring aiohttp's TCPConnector is key to get the most out of asynchronous HTTP in Python.

Troubleshooting "ImportError: No module named requests" in Python

Author: Mohan Ganesan

Date: Feb 3, 2024

When working with Python, you may encounter the error ImportError: No module named requests. Here are some troubleshooting tips to resolve this issue.

Debugging Empty Responses from HTTP Requests in Python

Author: Mohan Ganesan

Date: Feb 3, 2024

Frustrated with empty response bodies in Python HTTP requests? Check response body format, content encoding, decode response bytes, log full response details, test in Postman.

The Definitive Guide to Handling Proxies in Go in 2024

Author: Mohan Ganesan

Date: Jan 9, 2024

Dealing with proxies in Go for web scraping: setup, security, privacy, performance, and troubleshooting. Proxies API offers a solution for developers.

Troubleshooting Slow and Failing Requests in Python

Author: Mohan Ganesan

Date: Feb 3, 2024

Making HTTP requests in Python applications can sometimes be problematic. This article provides tips for troubleshooting slow or failing requests, including checking for network/server issues, setting sensible timeouts, inspecting the request object, and profiling long requests.

Troubleshooting Aiohttp Connecting to the Wrong Host

Author: Mohan Ganesan

Date: Mar 3, 2024

When using the aiohttp library in Python, you may occasionally see errors where aiohttp attempts to connect to the wrong host. There are a few things you can try to resolve it: check your DNS configuration, specify the host explicitly, use IP addresses instead of hostnames, and add server name indication (SNI) for HTTPS connections.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!