Stories from the Web Crawling trenches in article sections

Scraping New York Times News Headlines with Java

Author: Mohan Ganesan

Date: Dec 6, 2023

Scrape New York Times articles using Java and Jsoup library, extract headlines and links, and simulate a browser's user agent string.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!