Check if a Deleted Web Page Still Exists on the Wayback Machine
Patrick Bushe
October 25, 2025 · 5 min read
You click a link and get a 404. Or a domain that's been parked. Or a
blank page where something used to be. The content is gone — or is it?
The Internet Archive's Wayback Machine has been crawling the web since
1996. If a page existed for any meaningful length of time and received
regular traffic, there's a decent chance it was crawled at least once.
Knowing how to check that quickly is a genuinely useful skill.
When Pages Disappear
Pages get removed for a lot of reasons. Companies delete old blog posts.
Startups shut down and let domains expire. Government agencies archive
content or move it without redirects. Journalists unpublish articles.
Forum threads get moderated. None of that means the content is gone
forever — it just means you need to know where to look.
The most common scenario: you followed a link from somewhere, it's dead
now, but you need the content that was there. Maybe it was documentation,
a tutorial, a product page, or a press release.
How to Check Manually
The direct URL approach is the most reliable:
1. Take the dead URL (the one returning 404)
2. Go to web.archive.org
3. Paste the URL into the Wayback Machine search
4. If it was ever crawled, you'll see a calendar with snapshot dates
5. Click the most recent highlighted date before the page disappeared
If there are no results, try stripping the URL back to the domain and
checking the site's archive more broadly — sometimes the URL changed
but the content was moved rather than deleted.
Faster Method With Wayback Quick Access
Here's the problem: the URL you want to check is dead, so you can't
just navigate to it and click a button — there's nothing to navigate to.
But if you're coming from a page with a broken link, you can:
1. Right-click the dead link
2. Copy the link address
3. Open a new tab, paste the URL in the address bar (don't press Enter
yet, or press Enter knowing it'll 404)
4. Once the URL is in your address bar, click the Wayback Quick Access
button — it'll pick up whatever URL is currently in the address bar,
even a 404 page
This is faster than the manual flow because you skip the step of going
to archive.org and pasting. The extension opens the archive lookup
directly.
What to Do When You Find a Snapshot
Once you find an archived version of a deleted page:
- Read it inline in the Wayback Machine interface
- Copy the content you need
- Save the archived URL — it's a permanent link you can share
- Use the Download button (on some archive pages) to save the HTML
For research purposes, citing an archived URL is acceptable in most
academic and journalistic contexts as a source for content that no
longer exists at its original location.
When the Page Wasn't Archived
- Sometimes a page simply wasn't crawled. This happens with:
- Very new pages taken down quickly
- Pages behind login walls
- Pages that blocked archive.org's crawler in robots.txt
- Pages on very low-traffic sites
If the Wayback Machine doesn't have it, check Google Cache (though
Google's cache is often short-lived), Bing's cached pages, or try
searching for quoted text from the page title to find mirrors or
sites that may have reproduced the content.
Conclusion
A 404 doesn't always mean gone. The Wayback Machine is the first place
to check, and the less friction between you and that lookup, the more
often you'll actually use it. Most researchers and developers who use
archive lookup regularly end up installing a quick-access extension
precisely because the default workflow is just slow enough to make
them skip the check when they're in a hurry.