Scanning papers into digital documents plays an essential part to build an environment-friendly workplace. But there arose the issue to deal with scanned documents for different editing purposes. Luckily, Google Drive OCR provides such a convenience for us to deal with scanned files.
In this article, we will disclose several facts about Google Drive OCR and offer the steps to OCR PDF, image even handwriting files with Google OCR. Also, there are solutions available if Google Drive OCR not working.
But what is Google OCR? The answer will begin with brief information about what is OCR.
OCR, the short term of Optical Character Recognition, refers to mechanical or electronic conversion of image-based files into machine-encoded text.
Google Drive, is the free cloud storage service offered by Google to store and share photos, videos, files and others in the cloud. It is more like your flash drive or disk to store files, but located in the cloud.
Google Docs, is the online platform in Google Drive helping users to manage files online, users can open, view, edit, share and sync the files in Google Docs. When dealing with scanned PDF or images, Google Docs will perform OCR on the files to make them editable and searchable.
So, Google OCR, is also called as Google Drive OCR or Google Docs OCR, refers to OCR processed by Google Docs, the web-based office suite program in Google Drive, where Google allows users to store and synch files, helping users convert image-based files to editable formats.
While, the Google OCR Tesseract, is an open-source OCR engine sponsored by Google, and it is believed that the Google Drive/Docs OCR uses Google Tesseract to offer online OCR services.
To well understand how Google Drive OCR helps, here are 6 facts you need to know before using Google OCR.
Yes, 100% free.
Unlike other online OCR services, Google Drive OCR provides 100% free services, no matter how many files and how many pages you want to perform OCR in Google Drive, unless you have reach the storage limit of its free Google Drive service, you will need to pay to get more storage space for your scanned files. However, this rarely happens.
Even its API, the Google Tesseract is widely used by software developers as open source to create their owns.
Supported Input---Scanned PDF, Images (JPG, GIF, PNG, BMP), Handwriting files.
In fact, Google Docs supports to upload files in various formats, but its OCR only works when you importing image-based files, such as scanned PDF, JPG, GIF, PNG BMP and handwriting files. As for other image formats, such as TIFF or PSD, Google Docs OCR doesn’t support.
Support Output---Docx, ODT, RTF, PDF, TXT, HTML, ePub
That’s to say, you can use Google Drive OCR to save an image-based file as these 7 formats. Also, you can copy and paste the texts to a preferred text processor if you want to export as other formats.
Google Drive OCR supports reading more than 100 language. That is to say, only if you live on earch, it is much likely that your files can be recognized by Google OCR.
But according to users, the accuracy on different languages differs. If you are doing OCR on a English file with Google OCR, the accuracy is highly guaranteed and there is no need to manually adjust heavily; but if you are working on Chinese or Japanese, the accuracy is relatively lower and you will need to double check the OCR results carefully.
Yes, it is really a good news to know that Google Drive OCR comes to support handwriting files. However, it seems that improvements on performing OCR in Google Drive are needed.
I uploaded a handwritten file onto Google drive and open with Google Docs, the OCR result is displayed as following:
As you can see, though Google Drive OCR can recognize most of the words, it puts them in the wrong place and there is no formatting retained. To use Google Drive OCR on handwritten files to your benefit, there are some tips:
tips: Native PDF means the PDF contains editable and searchable texts, not just a picture that its texts cannot be selected or searched.
This is quite important, since Google Drive OCR only works when you import an image-based file for conversion. It can just recognize if your files is scanned or a text image.
If you have uploaded a native PDF with some images, Google Docs won't recognize text from those images, it will omit all the images and only texts are displayed in the program:
So, in case that you want to do OCR an image in a native PDF, you’d better extract the image or save the PDF as image first, then upload for Google Drive OCR.
Before the step-to-step tutorial about Google OCR, you will need to go to Google Sign-up Page to create a Google Account. Once you have become a Google user, let’s get start with the following 2 tutorials to use Google Drive OCR.
Check the uploaded scanned PDF or image on the right panel, right click on the scanned PDF or image, open with Google Docs
Wait for the Google OCR to automatically process OCR on scanned PDF, then you will find the scanned PDF or image editable in Google Docs.
As you may have noticed, Google Docs OCR saves both the original image file and newly generated texts, you need to delete the image file and edit as needed.
Go to File>Download as, export the scanned PDF to Word, ePub, Text, or other output format Google OCR supports.
If you want to do Google Drive OCR and search within a file, you can turn files to searchable PDF Google online. The steps to make PDF searchable with Google Drive OCR is quite similar to the above-mentioned steps to OCR PDF or images as editable formats. Here we simplify as following steps:
As you may have noticed, the accuracy of Google OCR may vary greatly on different types of scanned documents. Take a comparison between the original file and editable file processed by Google OCR as following:
In this above-mentioned case, you will need to manually adjust the OCR errors and format mess in Google Docs. Even, there are situations that Google Drive OCR won’t work. What to do? Here are some tips to try before find an alternative to Google Drive OCR:
To fix all the unsolved issues of Google OCR, Mac users can have a try on the Cisdem PDF Converter OCR, a dedicated OCR PDF Converter for Mac, to export any PDFs as 16 formats accurately. It supports batch OCR and maintain original file format.
If you are a Windows user, Readiris 16 can be worked as the best Google OCR alternative due to its outstanding OCR performance: accurate OCR results and retention of original file resolution&formatting.
Online2pdf, is an online free OCR service to convert scanned PDF and images into PDF, Docx, ODT, XLSX, PPTX, TXT, RTF, ODS and Image. But it only supports recognizing 6 languages (English, German, French, Spanish, Italian, Portuguese).
Google OCR really brings us convenience and benefit to manage scanned document online. Though Google OCR is limited in feature, the free solution it offers to OCR PDF worth everyone’s try. For better or for worse, you still have Google OCR alternatives to OCR your PDFs or handwritten files with expected results.
Connie has been writing for Mac productivity and utility apps since 2009. Each review and solution is based on her practical tests, she is aways energetic and trustworthy in this field.