Stories from the Web Crawling trenches in CSharp

Downloading Images from URLs in CSharp

Author: Mohan Ganesan

Date: May 5, 2024

Learn how to download images from URLs using C# with WebClient, HttpClient, RestSharp, Flurl, and System.Drawing.

How to Build a Simple HTTP Proxy in CSharp in just 25 lines of code

Author: Mohan Ganesan

Date: Oct 1, 2023

Build a basic proxy server in C# using the .NET framework. Use HttpListener and WebClient classes. Avoid IP blocking with rotating proxy service.

How to Build a Simple HTTP Proxy in CSharp in just 25 lines of code

Author: Mohan Ganesan

Date: Oct 1, 2023

Build a basic proxy server in C# using the .NET framework. Use HttpListener and WebClient classes. Avoid IP blocking with rotating proxy service.

Web Scraping in CSharp - The Ultimate Guide

Author: Mohan Ganesan

Date: Mar 24, 2024

Learn web scraping with C# using powerful libraries like HtmlAgilityPack and AngleSharp. Understand the importance of XPath and CSS selectors for extracting data from HTML. Overcome challenges like dynamic content and anti-scraping measures. Rotate user agents and headers to mimic human behavior and avoid detection.

The Complete HTML Agility Pack Cheat Sheet in VB

Author: Mohan Ganesan

Date: Oct 31, 2023

HTML Agility Pack is an HTML parser for .NET that allows easy manipulation and data extraction from HTML documents.

Scrape Any Website with OpenAI Function Calling in C++

Author: Mohan Ganesan

Date: Sep 25, 2023

Web scraping with OpenAI in C++ allows for resilient data extraction from websites using function calling.

Web Scraping with C++ & ChatGPT

Author: Mohan Ganesan

Date: Sep 25, 2023

C++ is a powerful language for web scraping with ChatGPT. Use libraries like libcurl and libxml2 for HTTP requests and HTML parsing. ChatGPT can provide explanations and generate code snippets. Get started now!

Downloading Images from a Website with CSharp and HtmlAgilityPack

Author: Mohan Ganesan

Date: Oct 15, 2023

Learn how to use C# and HtmlAgilityPack to download images from a Wikipedia page and extract data from HTML tables.

Web Scraping Wikipedia with CSharp

Author: Mohan Ganesan

Date: Dec 6, 2023

Learn how to scrape data from Wikipedia using C# and the HtmlAgilityPack library. Extract information from websites for data collection, analysis, and automation.

Scrape Any Website with OpenAI Function Calling in CSharp

Author: Mohan Ganesan

Date: Sep 25, 2023

Web scraping with OpenAI in C# allows for resilient data extraction from websites using natural language processing.

Scraping all the Images from a Website using CSharp

Author: Mohan Ganesan

Date: Dec 13, 2023

Learn how to scrape data and images from a website using C# and HtmlAgilityPack library. Extract data from a webpage, check HTTP status code, store data, and download images.

Scraping Reddit Posts in CSharp

Author: Mohan Ganesan

Date: Jan 9, 2024

Download and parse a Reddit page using AngleSharp in C# to extract information from posts.

Web Scraping with CSharp & ChatGPT

Author: Mohan Ganesan

Date: Sep 25, 2023

Web scraping in C# using ChatGPT and HtmlAgilityPack for data extraction and code generation.

Scraping New York Times News Headlines in CSharp

Author: Mohan Ganesan

Date: Dec 6, 2023

Automate data extraction from websites using C# and HTML Agility Pack for web scraping. Use HTTP client for making requests and XPath for parsing HTML elements.

Scraping New York Times News Headlines in CSharp

Author: Mohan Ganesan

Date: Dec 6, 2023

Automate data extraction from websites using C# and HTML Agility Pack for web scraping. Use HTTP client for making requests and XPath for parsing HTML elements.

Scraping Real Estate Listings from Realtor with CSharp

Author: Mohan Ganesan

Date: Jan 9, 2024

Scrape real estate listing data from Realtor.com using C# and HtmlAgilityPack library. Extract information like broker name, price, beds, baths, sqft, lot size, and address.

Scraping Multiple Pages in CSharp with HtmlAgilityPack

Author: Mohan Ganesan

Date: Oct 15, 2023

Web scraping in C# using HtmlAgilityPack to extract data from multiple pages. Use proxies for scaling up and avoiding IP blocks.

Scraping Craigslist Listings with CSharp

Author: Mohan Ganesan

Date: Oct 1, 2023

Learn how to scrape Craigslist apartment listings using C# and HtmlAgilityPack. Avoid IP blocking with a rotating proxy server.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!