MS Word比您想象的更强大。流行的文字处理工具(word processing tool)用于各种任务。不是每个人都知道这一点,但它也可以用来打开HTML 格式(HTML format)的网页。
在这篇文章中,您将了解如何在Word中打开(Word)HTML文件并将它们保存为一种受支持的文件格式。
将网页保存到 Word
最简单但通常结果最差的方法是简单地将网页保存到您的计算机,然后尝试在Word中打开它。
首先,打开您的浏览器并转到您要保存的网页。在下面的示例中,我们使用的是Google Chrome,但该过程应该适用于任何网络浏览器。
右键单击(Right-click)页面
上的任意位置,然后单击(page and click) 另存为(Save As)。
这将提示另存为框(Save As box)弹出。根据自己的喜好编辑文件名。在Save As Type下,选择Webpage, HTML Only。单击保存(Save)。
启动MS Word,然后打开要编辑的HTML 文件。(HTML file)
HTML 文件(HTML file)将在Word中打开。但是,文档本身不会像在线那样显示。对于初学者,格式将被破坏。例如,导航栏(navigation bar)将被分解,菜单将显示在单独的行中。
您将不得不手动编辑文档以使页面更具凝聚力。
删除无用的链接和网页元素将使文章更易于阅读。使用 Word 的Web 布局(Web Layout)视图使清理过程变得更加容易。
经过一些编辑后,您的文档应该有点类似于它的源代码。
当您对编辑感到满意时,请转到文件(File)
>另存为(Save As)。选择.docx作为您的文件格式。
您现在应该能够在Microsoft Word中打开该文件。
使用替代解决方案
如果您对第一个解决方案不满意,请知道还有其他方法可用于保存网页并在Word中打开它们。
有Google Chrome扩展程序,例如将网页另存为 Word 文档(Save Webpage As Word Document),可让您将HTML 文件(HTML file)下载为Word 文档(Word document)。我还在下面提到了一个在线转换工具(conversion tool)。
虽然最终产品(end product)看起来完全一样,但该工具使您不必将网页下载为HTML 文件(HTML file)。安装扩展并激活它。转到要转换的页面。单击扩展并选择(extension and select) Complete Page。
然后该工具会将整个网页下载为Word 文档(Word document)。
提示:(Tip:)您还可以突出显示页面的一部分并单击(page and click) 当前选择(Current
Selection)。这将允许您下载特定部分而不是整个页面。
您也可以尝试复制整个网页并将其粘贴到Word中。
但是,您不必像往常一样粘贴文件,而是必须使用Paste Special。右键(Right-click)单击
Word。在Paste下,应该有几个选项。选择保留源格式(Keep Source
Formatting)。
此粘贴选项(paste option)
可让您复制HTML 代码(HTML code),同时保持大部分格式不变。
但是请注意,没有完美的解决方案。一旦文档在Word中,您可能需要进行一些编辑以使文档与源材料(source material)相似。
完成所有必要的更改后,转到文件菜单(file menu)并将文档另存为 .docx。
在线转换工具
最后一种方法,有时比上面两种方法可以得到更好的结果,就是使用在线工具。给我带来最好结果的是Convertio。
您可以选择下载的HTML 文件(HTML file),也可以单击最右侧的链接图标,简单地粘贴任何网站的(link icon)URL。如果您愿意,将弹出一个框,您可以在其中输入多个URL。单击(Click)提交,它将(Submit)分析您输入的URL 。(URLs)
完成后,确保为文件类型选择了(file type)Doc,然后单击大转换(Convert)按钮。
转换完成需要一点时间,具体取决于网页的大小。就我而言,它花了一分钟多的时间,因为我想将Wikipedia上的Mona Lisa 页面(Mona Lisa page)转换为Word。
如您所见,它确实看起来像Word中的网页,但在这种情况下,它只是将网页转换为图像并将其放入Word中。没有文本是可编辑的或任何东西。
对于较小的网页,我对其进行了测试,它确实将文本作为可编辑文本引入。所以这真的取决于网页的复杂程度和规模。
您还可以尝试其他一些工具,看看是否能获得更好的结果:
https://www.onlineconverter.com/html-to-docx
https://cloudconvert.com/html-to-doc
https://www.coolutils.com/online/HTML-to-DOC
希望上述方法之一已经为您提供了您想要的结果。这不是一个非常漂亮的过程,不幸的是,并没有太多好的工具来实现这一点,因为这不是人们经常做的事情。
How to Save Web Page to Word Document Fast and Easy
MS Word is more powerful than you may think. The popular word processing tool iѕ usеd for all sorts of tasks. Not everyonе knows this but it can also be used to open web pages in HTML fоrmat.
In this post,
you’ll discover how you can open HTML files in Word and save them in one of the
supported file formats.
Saving
Web Pages to Word
The easiest method, but with the worst results usually, is to simply save the webpage to your computer and then try to open it in Word.
First, open your browser and go to the webpage you want to save. In the example below, we’re using Google Chrome but the process should work for any web browser.
Right-click
anywhere on the page and click Save As.
This will prompt
the Save As box to pop up. Edit the filename to your liking. Under Save As Type, select Webpage, HTML Only. Click Save.
Launch MS Word, then open the HTML file you want to edit.
The HTML file will open in Word. However, the document itself will not display as it does online. For starters, the formatting will be ruined. The navigation bar, for example, will be broken up and menus are displayed in separate lines.
You will have to
edit the document manually to make the page more cohesive.
Removing useless
links and web elements will make articles easier to read. Using Word’s Web Layout view makes the cleanup
process that much easier.
After some edits,
your document should somewhat resemble its source.
When you’re
satisfied with your edits, go to File
> Save As. Select .docx as your file format.
You should now be
able to open the file in Microsoft Word.
Using
Alternative Solutions
If you’re not
satisfied with the first solution, know that there are other methods available
for saving web pages and opening them in Word.
There are Google Chrome extensions like Save Webpage As Word Document that lets you download the HTML file as a Word document. I also mention a online conversion tool down below.
While the end product looks exactly the same, this tool spares you from having to download the webpage as an HTML file. Install the extension and activate it. Go to the page you want to convert. Click the extension and select Complete Page.
The tool will
then download the entire webpage as a Word document.
Tip: You can also highlight a
section of the page and click Current
Selection. This will allow you to download a specific section instead of
the whole page.
You can also try
copying the entire webpage and pasting it in Word.
But instead of
pasting the file as normal, you’d have to use Paste Special. Right-click on
Word. Under Paste, there should be
several options. Select Keep Source
Formatting.
This paste option
lets you copy the HTML code while keeping most of the formatting intact.
Do note however
that there is no perfect solution. Once the document is in Word, you’d likely have
to make a couple of edits to make the document resemble the source material.
Once you’re finished making all the necessary changes, go to the file menu and save the document as .docx.
Online Conversion Tools
The last method, which can sometimes get better results than the two method above is to use an online tool. The one that got me the best results was Convertio.
You can either choose an HTML file that you downloaded or you can click the link icon at the far right to simply paste in a URL for any website. A box will pop up where you can enter more than one URL if you like too. Click on Submit and it will analyze the URLs you entered.
Once complete, make sure Doc is selected for the file type and then click on the big Convert button.
It will take a little time for the conversion to complete, depending on how large the web page is. In my case, it took over a minute because I wanted to convert the Mona Lisa page on Wikipedia into Word.
As you can see, it really does look like the webpage in Word, but in this case, it simply converted the webpage into images and put those into Word. None of the text is editable or anything.
For a smaller webpage, I tested it and it does bring in the text as editable text. So it really depends on how complex and large the webpage is.
You can also try out a couple of other tools to see if you get better results:
https://www.onlineconverter.com/html-to-docx
https://cloudconvert.com/html-to-doc
https://www.coolutils.com/online/HTML-to-DOC
Hopefully, one of the methods above has gotten you the result you desire. It’s not a very pretty process and unfortunately there aren’t really that many good tools to accomplish this since it’s not something people do very often.