Cloudflare and Internet Archive have joined forces to archive more of the public web, touting it would make the web more reliable. As part of this joint effort, websites that use Cloudflare's Always Online service will be able to allow the web infrastructure company to share their hostname and URLs with Internet Archive's Wayback Machine so their website can be automatically archived. When a site is down, Cloudflare will then be able to retrieve the most recently archived version from Internet Archive so that a site's content can be accessed by users. "The Internet Archive's Wayback Machine has an impressive infrastructure that can archive the web at scale," Cloudflare CEO and co-founder Matthew Prince said. "By working together, we can take another step toward making the internet more resilient by stopping server issues for our customers and in turn from interrupting businesses and users online." According to Internet Archive, more than 468 billion web pages are available via the Wayback Machine to date.
The web is decentralized and fluid by design, but all that chaos and ephemerality can make it difficult to keep a site up and online without interruption. That's what has made the Internet Archive's Wayback Machine feature so invaluable over the years, maintaining a history of long-forgotten pages. Now its deep memory will help make sure the sites you visit never go down, through a partnership with the internet infrastructure company Cloudflare. Since 2010, Cloudflare has offered a feature called Always On, which caches a static version of sites that it can serve to visitors in case of downtime. Always On was one of CloudFlare's original offerings; John Graham-Cumming, the company's chief technology officer, says the infrastructure powering it was due to be rearchitected.
Even the Wayback Machine is getting into fact-checking now. In a blog post on its website, the Internet Archive announced it was rolling out fact-checking annotations on certain webpages archives by its Wayback Machine. According to Mark Graham, director of the Wayback Machine, the organization felt the need for this feature after noticing a number of fact-checking groups linking to archived versions of pages. "We are attempting to preserve our digital history but recognize the issues around providing access to false and misleading information coming from different sources," Graham wrote in the post. "By providing convenient links to contextual information we hope that our patrons will better understand what they are reading in the Wayback Machine."
It can be hard to find sites that have disappeared from the Internet. But the Internet Archive's Wayback Machine is on the verge of rolling out a feature that will make tracking down dead websites much easier, according to Internet Archive founder Brewster Kahle. The Wayback Machine has been helping people see past Internet sites over the past 15 years, but searchers always needed to know the URL of a website to find the archived copies. Soon, however, you'll be able to use keyword searches to find old websites -- in fact, you can already test it out through a public beta. The new search feature is not quite like Google, where all the text on each page on a website is indexed to help with searches.
You know that thing where you're reading something interesting but then you see a super interesting link and you're like, yeah, I'm gonna click the shit out of this, and then you do, and your computer fritzes, blinks at you, and then you get a 404 error? Internet Archive has built a tool to make that stop happening. Internet Archive has long chronicled deceased web pages with the Wayback Machine, allowing users to access stuff that is no longer readily available. But, of course, users would have to know about the Wayback Machine and how to use it. Now Internet Archive has built a Wayback Machine Chrome extension.