Stories from the Web Crawling trenches in goquery

The Ultimate Goquery Cheatsheet

Author: Mohan Ganesan

Date: Oct 31, 2023

Goquery is a Go library for easy HTML manipulation and extraction using jQuery-style syntax. Great for web scraping and building web apps.

Web Scraping Wikipedia Data in Go

Author: Mohan Ganesan

Date: Dec 6, 2023

Web scraping is the process of automatically collecting structured data from websites. This tutorial demonstrates how to scrape a Wikipedia table using Golang and goquery library.

Scraping Multiple Pages in Go with net/http and goquery

Author: Mohan Ganesan

Date: Oct 15, 2023

Web scraping in Go using net/http and goquery to extract data from multiple pages. Use a base URL pattern with %d placeholder and loop through pages to construct each page URL. Send request and parse HTML with goquery to find and extract data. Print or store scraped data.

Downloading Images from a Website with Go and goquery

Author: Mohan Ganesan

Date: Oct 15, 2023

Learn how to use Go and goquery to download images from a Wikipedia page, extract data from HTML tables, and scrape websites. Use Proxies API for IP rotation and CAPTCHA solving.

Scraping Craigslist Listings with Go

Author: Mohan Ganesan

Date: Oct 1, 2023

Learn how to scrape Craigslist apartment listings using Go and goquery. Avoid IP blocking with a rotating proxy server.

How to Scrape Reddit Posts in Go

Author: Mohan Ganesan

Date: Jan 9, 2024

Learn how to scrape Reddit using Go with a step-by-step guide. Extract information about posts using HTML parsing and HTTP requests.

Scraping Real Estate Listings From Realtor with Go

Author: Mohan Ganesan

Date: Jan 9, 2024

Learn how to scrape real estate listing data from Realtor.com using Go and the goquery library. Use web scraping to collect and analyze housing data.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!