Stories from the Web Crawling trenches in URL

Accessing Your Local Web Server from Python Requests

Author: Mohan Ganesan

Date: Feb 3, 2024

Accessing a development server on localhost is easy with Python requests: Use http://localhost or http://127.0.0.1, Add the port your server uses like :8000, Disable SSL warnings for HTTPS, Import requests and call get/post as usual!

Debugging HTTP Requests in Python with Request Logging

Author: Mohan Ganesan

Date: Feb 3, 2024

Add comprehensive logging to Python requests for visibility into issues when making HTTP requests.

How to fix MissingSchema error in Python requests

Author: Mohan Ganesan

Date: Oct 22, 2023

The MissingSchema error occurs when making a request to a URL without specifying the protocol. This article explains the causes of the error and provides various ways to fix and handle it properly.

Passing Data in URLs with urllib Query Parameters in Python

Author: Mohan Ganesan

Date: Feb 8, 2024

Pass data through URLs using query parameters in Python's urllib module for HTTP requests.

What is the difference between URL and API?

Author: Mohan Ganesan

Date: May 7, 2024

URLs handle locating web assets, while APIs handle application integration. Both serve important, yet distinct roles in modern web technologies.

Accessing Specific Paths with the Python Requests Library

Author: Mohan Ganesan

Date: Feb 3, 2024

Making HTTP requests in Python and accessing specific paths on a server using the Requests library and URL encoding.

Troubleshooting Python Requests Get When Webpage Isn't Loading

Author: Mohan Ganesan

Date: Feb 3, 2024

When using Python's Requests library to load a webpage, troubleshoot by checking the URL, status code, response headers.

Sending Data in GET Requests with Python Requests

Author: Mohan Ganesan

Date: Feb 3, 2024

The Python Requests library provides a simple way to send GET requests with data using the requests.get() method. It encodes the data into a query string that is appended to the URL, making it perfect for sending non-sensitive data like filters or pagination options.

Accessing Websites in Python with urllib.request.urlopen

Author: Mohan Ganesan

Date: Feb 6, 2024

The urllib.request module in Python 3 provides a simple way to access and download data from websites via HTTP and HTTPS.

Accessing Array Data in URLs with Python's urllib

Author: Mohan Ganesan

Date: Feb 6, 2024

Python's urllib provides simple utilities to encode array data into URLs and restore it on the other end.

urllib attribute error

Author: Mohan Ganesan

Date: Feb 6, 2024

The urllib module in Python provides functions for fetching data from the web. Common errors include attribute errors and invalid URLs. Handling redirects and errors is important.

Fetching Data from the Web with urllib's read()

Author: Mohan Ganesan

Date: Feb 8, 2024

Python's urllib module provides a simple way to retrieve data from the internet using the read() method. It handles network I/O and allows you to focus on working with the downloaded data.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!