Stories from the Web Crawling trenches in The New York Times

Scraping New York Times News Headlines in Scala

Author: Mohan Ganesan

Date: Dec 6, 2023

Web scraping is a technique for extracting data from websites automatically. This article explains how to scrape article titles and links from The New York Times homepage using Scala and the Jsoup library.

Tired of getting blocked while scraping the web?

ProxiesAPI handles headless browsers and rotates proxies for you.
Get access to 1,000 free API credits, no credit card required!