光学字符识别(Optical Character Recognition)( OCR ) 技术允许您编辑图形图像或扫描文档中的文本。它被广泛用于将书籍和文档转换为电子文件,将办公室的记录保存系统计算机化,或在网站上发布文本。
OCR使编辑文本、搜索单词或短语以及应用机器翻译、文本到语音和文本挖掘等技术成为可能。
Microsoft Office包括Microsoft Document Imaging OCR。但是,还有其他几种免费软件替代品也可以完成这项工作,包括扫描仪或一体机附带的应用程序软件。
本质上,要使用OCR(OCR)扫描图像并将其转换为文本,您需要遵循标准步骤:
- 将文档放在扫描仪床上。
- 在计算机上,使用扫描仪的应用软件和OCR作为类型。
- 选择要进行OCR扫描的预期区域,然后检查预览。
- 扫描并保存文本文件,您可以在任何文字处理应用程序中进行编辑。
免费在线 OCR 网站和服务
虽然您可以使用Google Drive 将图像转换为文本 (OCR),但网络上还有其他一些免费的在线OCR服务允许您在浏览器中对图像或扫描的文档执行OCR 。
- NewOCR.com是一项免费的在线OCR服务。NewOCR.com分析您上传的任何图像文件中的文本,然后将图像中的文本转换为文本。该服务为您提供无限上传和布局分析(多列文本识别)。友好的服务不需要注册,并支持 29 种语言,尽管该网站非常基础和业余。该服务采用任何JPEG、PNG、GIF、BMP和最多 5 MB 的多页TIFF以及最多 20 MB 的多页 PDF 文档。
- Free-OCR.com是另一个免费的在线OCR工具。您可以上传图片文件(JPG、GIF、TIFF BMP或PDF),大小不超过 2MB,宽度不超过 5000 像素,并且每小时限制上传 10 张图片。Free - OCR可以处理具有多列文本的图像,并且还支持 29 种语言。一个奇怪的限制是它在执行OCR时只占用(OCR)PDF文档的第一页。
- OnlineOCR.net服务允许您从PDF扫描文档(包括多页文件)、照片和数码相机捕获的图像中识别文本和字符。它允许用户选择 32 种语言来识别多语言文档并直接转换为PDF、DOC、XLS、RTF、HTML和TXT等多种格式。您可以选择将OCRed文件在线存储在您的安全工作区或将它们下载到您的 PC。
其他一些网站是:
- Totext.net
- 将图像转换为文本.net
- OCRconvert.com
- Ocronline.com
- 将PDF转换为Word.net
- I2ocr.com
- FineReaderOnline.com。
您还可以选择安装应用程序来执行OCR,而不是安装扫描仪的默认程序。
FreeOCR可以从大多数TWAIN和WIA扫描仪进行扫描,还可以打开扫描的 PDF 和多页TIFF图像。免费OCR包括Windows编译的Tesseract免费OCR引擎,也称为Tesseract GUI。FreeOCR是免费软件,包括用于商业用途。FreeOCR需要 .NET Framework v2.0。可在paperfile.net下载。
提示(TIP):有更多适用于 Windows 10 的免费 OCR 软件(free OCR software for Windows 10)可用。
我确信这个列表并不详尽。如果您知道更多信息,为什么不将更多此类服务和应用程序添加到此帖子的评论中?(I’m sure this list isn’t exhaustive. Why don’t you add more such services and applications as comments to this post, if you know of any more?)
Free Online OCR sites and services to extract Text from Image
Optical Character Recognition (OCR) technology allows you to edit text in the graphic image or scanned document. It is widely used to convert books and documents into electronic files, to computerize a record-keeping system in an office, or to publish the text on a website.
OCR makes it possible to edit the text, search for a word or phrase, and apply techniques such as machine translation, text-to-speech and text mining to it.
Microsoft Office includes Microsoft Document Imaging OCR. However, several other freeware alternatives exist which do the job too, including the application software that came with your scanner or all-in-one device.
Essentially, to scan and then convert an image to text using OCR, you need to follow standard steps:
- Place the document on the scanner bed.
- On the computer, use the scanner’s application software and OCR as type.
- Select the intended area to do an OCR scan, and check the preview.
- Scan and then save the text file, and you can edit the same in any word-processing application.
Free Online OCR sites and services
While you can use use Google Drive to convert Images to Text (OCR), there are several other free online OCR services on the web that allow you to perform an OCR on an image or scanned document within your browser.
- NewOCR.com is a free online OCR service. NewOCR.com analyses the text in any image file that you upload, and then convert the text from the image into text. The service gives you unlimited uploads and layout analysis (multi-column text recognition). The friendly service requires no registration, and supports 29 languages, although the site is pretty basic and amateurish. The service takes any JPEG, PNG, GIF, BMP, and multipage TIFF up to 5 MB as well as multipage PDF documents up to 20 MB.
- Free-OCR.com is another free online OCR tool. You can upload your image files (JPG, GIF, TIFF BMP or PDF) not larger than 2MB, no wider or higher than 5000 pixels and there is a limit of 10 image uploads per hour. Free-OCR can handle images with multi-column text and also supports 29 languages. One odd restriction is that it takes only the first page of a PDF document while performing OCR.
- OnlineOCR.net service allows you to recognize text and characters from PDF scanned documents (including multipage files), photographs and digital camera captured images. It allows users to select 32 languages to recognize multilingual documents and converts directly to several formats like PDF, DOC, XLS, RTF, HTML, and TXT. You can choose to store the OCRed files online in your secure workspace or download them to your PC.
Some other sites are:
- Totext.net
- ConvertImagetoText.net
- OCRconvert.com
- Ocronline.com
- ConvertPDFtoWord.net
- I2ocr.com
- FineReaderOnline.com.
You can also choose to install an application to perform OCR instead of the scanner’s default program.
FreeOCR can scan from most TWAIN and WIA scanners and can also open scanned PDF’s and multipage TIFF images. Free OCR includes the Windows compiled Tesseract free OCR engine also known as a Tesseract GUI. FreeOCR is freeware, including for commercial use. FreeOCR requires the .NET Framework v2.0. It is available for download at paperfile.net.
TIP: There are more free OCR software for Windows 10 available.
I’m sure this list isn’t exhaustive. Why don’t you add more such services and applications as comments to this post, if you know of any more?