互联网存档(Internet archiving)服务是维护开放透明互联网的重要组成部分。网页和社交媒体帖子不能保证永远在线。但是使用存档服务可以让您高枕无忧,因为您知道每个重要的文档、推文或任何其他网络媒体都会在您需要的时候出现。
这些归档服务执行一项简单的任务。您输入要存档的任何网页的网址,存档服务会对其进行爬网并将所有图像和其他外部文件保存到自己的服务器上,以便有效地镜像页面。(web address)
归档网页可以通过多种方式发挥作用。最明显的是保留网页以防万一以后被删除,但它也可以用于从源复制内容,同时避免某些障碍。
一个例子是避免给网站带来广告收入(ad revenue)。记者经常存档包含他们不想宣传但仍要报道的内容的网页,这样他们就不会直接向该网站发送流量。
无论出于何种原因,您需要存档网页,都有一些服务优于其他服务。让我们谈谈最好的网站。
互联网档案
Internet Archive通常被称为Wayback Machine,是网络上领先的归档服务。如果您对十年或更长时间前的网站感到怀旧,并且想重新访问它,那么Internet Archive很有可能拥有它的一些快照。
但是,您知道您可以分配 Internet Archive 的 Web 服务器来创建您感兴趣的任何页面的即时快照吗?通过导航到此页面(this page),您可以开始此过程。
在立即保存页面(Save Page Now)列下,只需输入您希望Internet Archive抓取和保存的任何页面的URL 。在按下 Enter(Enter)键或SAVE PAGE按钮后,您将被带到正在归档的页面。
在几秒钟内,根据页面的大小,Internet Archive将创建一个永久快照。
存档.今天
Archive.today正在迅速成为一种流行的存档服务,可能是因为它的使用和搜索非常简单。
您需要的一切都在一个页面上,其中包括一个用于归档页面的字段(顶部)和另一个用于搜索已保存快照的字段(底部)。
选择archive.today 作为您的归档服务的最佳理由之一是它的书签。如果您将archive.today小(archive.today) 书签按钮(bookmarklet button)拖放到浏览器的书签栏中,您可以导航到要创建快照的任何页面,然后单击小书签。
这将打开一个新页面并立即开始保存过程。
存档.st
Archive.st通过其服务将归档服务的最小化提升到一个新的水平。
如果reCAPTCHA 复选框没有(reCAPTCHA checkbox doesn)让您失望,Archive.st是一种快速简便的归档解决方案。它不仅会创建页面的镜像,还会为您生成整页截图。
对于
已归档的URL , (URLs)Archive.st将显示错误并为您提供指向其最新快照的链接。但是,您只需再次单击
存档(Archive)按钮即可强制保存新快照。
在线(Online)
存档服务作为防止内容消失的额外措施非常有用。虽然它们并非万无一失(t foolproof),但使用其中几个是确保您需要的 Web 数据能够长期存在的好方法。
如果您正在寻找一种更本地化的方式来保存Web 内容(web content),请查看我们最近关于如何将网页保存到Word文档的文章。
The 3 Best Sites To Use For Archiving Webpages
Internеt archiνing services are a very important part of preserving an open and transparent internet. Webpages and social media posts aren’t guaranteеd to remain online forever. But using an archiving service рrovidеs peace of mind in knowing that each important document, tweet, or any other piece of web media will be around when you nеed it.
These archiving services perform a simple task. You enter the web address of any webpage that you want to be archived, and the archiving service crawls it and saves all images and other external files to their own servers so that the page is effectively mirrored.
Archiving a webpage can be useful in several ways. The most obvious is to preserve a webpage in case it’s later deleted, but it can also be used to copy content from a source while avoiding certain barriers.
One example is to avoid giving websites ad revenue. Journalists often archive webpages that include content they don’t want to promote, yet still report on, so that they aren’t sending traffic directly to the site.
For whatever reason you need to archive a webpage, there are a few services that stand above the rest. Let’s talk about the best sites.
Internet Archive
Commonly referred to as the Wayback Machine, Internet Archive is the leading archiving service on the web. If you’re ever feeling nostalgic about a website from a decade or longer ago, and you want to revisit it, there’s a good chance that Internet Archive has a few snapshots of it.
However, did you know that you can assign Internet Archive’s web servers to create an instant snapshot of any page you’re interested in? By navigating to this page, you can begin this process.
Under the Save Page Now column, simply input the URL of any page that you’d like for Internet Archive to crawl and save. After hitting the Enter key or SAVE PAGE button, you’ll be taken to the page while it’s being archived.
Within a few seconds, depending on how large the page is, Internet Archive will create a permanent snapshot.
archive.today
Archive.today is quickly becoming a popular archive service, likely because of how simple it is to use and search.
Everything
you’ll need is on a single page, which includes a field to archive
a page (top) and another to search through saved snapshots (bottom).
One of the best reasons to choose archive.today as your archiving service is because of its bookmarklet. If you drag and drop the archive.today bookmarklet button into your browser’s bookmarks bar, you can navigate to any page you want to create a snapshot of and simply click on the bookmarklet.
This will open up a new page and begin the saving process instantly.
Archive.st
Archive.st takes the minimalization of archiving services to the next level with its service.
If
the reCAPTCHA checkbox doesn’t turn you away, Archive.st is a quick
and easy archiving solution. Not only will it create a mirror of the
page, but it also generates a full-page screenshot for you.
For
URLs already archived, Archive.st will display an error and give you
a link to its latest snapshot. However, you can simply click the
Archive button again to force-save a fresh snapshot.
Online
archiving services work great as an extra measure against
disappearing content. While they aren’t foolproof, using several of
them is a great way to ensure that the web data you need will be
around for a long time.
If you’re looking for a more localized way to save web content, check out our recent article on how to save webpages to Word documents.