互联网(Internet)已经存在了二十多年。但是在过去的几年里,很多资源一直在仔细地完整地归档互联网。(Internet)WayBack Machine是最受欢迎的服务之一,可让您浏览过去的万维网(World Wide Web)。除了它已经归档了超过 4450 亿个网页这一事实之外,奇怪的是它从未发布过它所归档的网站的清单或它用来确定要捕获什么以及何时捕获的算法。
回程机
随着互联网(Internet)达到了可供机构研究的成熟时代,这些档案现在比以往任何时候都更加重要。尽管在Wayback Machine(Wayback Machine)上存档了 4450 亿个网页,但肯定有很多松散的结局。例如,BBC 的存档始于 1996 年,但正确对齐的图像仅在 2012 年之后才开始出现。Wayback Machine发布所有存储网页的网站的工作方式略有不同。它仅发布来自 70 个主要国家/地区的前 100 万个网站的网页,由Alexa排名。
“The WayBack Machine is used by hundreds of thousands of people every day, presenting snapshots, back in time, from more than 1.5 billion websites,” says Mark Graham, director of the Wayback Machine.
错误页面的解决方案(SOLUTION TO ERROR PAGES)
Wayback Machine的另一个功能是 Chrome插件可以识别您在浏览您喜欢的网站时遇到 404 或任何其他网页错误。然后它会继续检查并查看该站点是否有存档版本。因此,无论是否有一个网页被可疑地从Internet 上(Internet)删除,或者该网站太烂而无法继续运行,Wayback都有档案供您调查。简单来说,它是一种对抗链接失效威胁的方法。
政府记录(GOVERNMENT RECORDS)
不过,互联网档案馆(Internet Archive)对这款新产品有着更高的抱负。据报道,在奥巴马(Obama)政府执政期间,几乎 83% 的信息文件和 49% 的最高法院(Supreme Court)记录都从互联网(Internet)上丢失。这就是Wayback Machine正在寻求解决的问题。臭名昭著的链接失效问题日益引起人们的关注,在线档案对于保存大量重要数据至关重要。
有趣的经历(INTERESTING EXPERIENCES)
在接受Entrepreneur Magazine(Entrepreneur Magazine)采访时,导演Mark Graham分享了该服务用户的有趣体验。
“On July 17, 2014, Igor (Strelkov) Girkin, a Ukrainian separatist leader, claimed responsibility online for the downing of what he thought was a Ukrainian military transport plane near the rebel-held Ukrainian city of Donetsk. When reports that Malaysian Airlines Flight MH17, with 295 passengers, had been shot down in the same area, his post was removed. But not before it had been preserved several times by the Wayback Machine, where it is available today.”
USP 与未来(USP AND THE FUTURE)
Wayback Machine最大的特点是网站抓取所有这些数十亿和数万亿网页以获取信息和快照的方式。他们超过 5 万亿网络捕获的库存不是单个连续爬取过程的结果,而是多年来由数千人定义的数百万次单独爬取的结果。该公司的目标是建立整个互联网(Internet)的终极数据库,让所有好奇想要访问的人永久可用。
因此,您可以使用WayBack 机器(WayBack Machine)查看 Internet上的存档或缓存网页(view Archived or Cached web pages on the Internet),也可以保存网页作为它首先出现在 Internet 上的证据。
WayBack 机器 Chrome 扩展程序
WayBack Machine发布了一款出色的浏览器扩展,可以减少烦人的 404 页面。此扩展程序将检测错误代码 404、408、410、451、500、502、503、504、509、520、521、523、524、525 和 526,并提供显示存档版本。你可以在这里(here)下载。(here.)
WayBack 机器替代品
如果您正在寻找Wayback Machine替代品,请查看archive.is和 screenshots.com。
WayBack Machine: Chrome extension & Alternative Internet Archive sites
The Internet hаs been around for more than a couple of decades now. But a lot of resoυrces have been carefully archiving the Internet in its entirety ovеr the past years. Onе of the most popular services that let you browsе the yestеryears of the World Wіde Web is WayBack Machine. Apart from the fact that it has archived more than 445 billion web pages, the weird part is that it has never published an inventory of the websites it archives or the algorithms it uses to determine what to capture and when.
WayBack Machine
With the Internet reaching a mature age for institutions to research on, these archives are now more important than ever. Despite the 445 billion web pages archived on Wayback Machine, there are certainly a lot of loose ends. For instance, BBC’s archive started in 1996, but the properly aligned images started appearing only after 2012. And the website where Wayback Machine posts all the stored web pages works in a slightly different manner. It posts only the web pages from top 1 million websites in 70 major countries, as ranked by Alexa.
“The WayBack Machine is used by hundreds of thousands of people every day, presenting snapshots, back in time, from more than 1.5 billion websites,” says Mark Graham, director of the Wayback Machine.
SOLUTION TO ERROR PAGES
Another feature of Wayback Machine is that the Chrome plugin recognizes whenever you come across a 404 or any other web page error while browsing your favorite sites. It then proceeds to check and see if there’s an archived version of that site. So, whether there is a web page that has been suspiciously removed from the Internet or the site is just too rotten to continue functioning, Wayback has the archive for you to investigate just that. In simpler terms, it is a way of fighting the menace of link rot.
GOVERNMENT RECORDS
The Internet Archive has a much nobler ambition for this new product, though. According to reports, almost 83% of the information documents under the Obama administration, and 49% of all Supreme Court records are missing from the Internet. And this is the problem that the Wayback Machine is looking to solve. The infamous link rot is a growing concern, and online archives are vital to preserving a vast plethora of important data.
INTERESTING EXPERIENCES
In an interview with Entrepreneur Magazine, director Mark Graham shared an interesting experience from the service’s users.
“On July 17, 2014, Igor (Strelkov) Girkin, a Ukrainian separatist leader, claimed responsibility online for the downing of what he thought was a Ukrainian military transport plane near the rebel-held Ukrainian city of Donetsk. When reports that Malaysian Airlines Flight MH17, with 295 passengers, had been shot down in the same area, his post was removed. But not before it had been preserved several times by the Wayback Machine, where it is available today.”
USP AND THE FUTURE
The biggest feature of Wayback Machine is the way the site crawls all these billions and trillions of web pages for information and snapshots. What their inventory of more than half a trillion web captures is not the result of a single continuous crawling process but rather millions of separate crawls, defined by thousands of people, over the years. The company is aiming to build the ultimate database of the entire Internet that is permanently available to everyone that is curious enough to want access.
Thus you can use the WayBack Machine to view Archived or Cached web pages on the Internet as well to save a web page as proof that it appeared first on the Internet.
WayBack Machine Chrome extension
WayBack Machine has released an excellent browser extension that can reduce annoying 404 pages. This extension will detect error codes 404, 408, 410, 451, 500, 502, 503, 504, 509, 520, 521, 523, 524, 525, and 526 and offer to display the archived version. You can download it here.
WayBack Machine alternative
If you are looking for Wayback Machine alternatives, then check out archive.is and screenshots.com.