This collection contains crawl data collected daily from a seed
list of US-based local news websites. Between 2005 and 2023, almost 2900
local newspapers shut down. This collection aims to archive existing
news sites should they shut down too.
Seed list used as of 2025-07-17 located here. The majority of seeds are provided courtesy of Northwestern University's State of Local News Project.
TIMESTAMPS
The Wayback Machine - https://web.archive.org/web/20250505055655/https://github.com/signup
You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
Create your free account
Explore GitHub's core features for individuals and organizations.
See what's included
Access to GitHub CopilotIncrease your productivity and accelerate software development.
Unlimited repositoriesCollaborate securely on public and private projects.
Integrated code reviewsBoost code quality with built-in review tools.
Automated workflowsSave time with CI/CD integrations and GitHub Actions.
Community supportConnect with developers worldwide for instant feedback and insights.