Converting scans to word with OCR
Thread poster: Roy Williams
Roy Williams
Roy Williams  Identity Verified
Austria
Local time: 17:14
German to English
Jan 17, 2012

Hi Everyone,

I have a hard copy of a document that I would like to scan and convert to a word file with OCR software.
The document is several pages long and as I've never tried this before, I'm wondering what is the best was to do this without ending up with a separate word file for converted each page.

Any suggestions?

Thanks in advance,

Roy

[Edited at 2012-01-17 13:53 GMT]


 
Sergei Leshchinsky
Sergei Leshchinsky  Identity Verified
Ukraine
Local time: 18:14
Member (2008)
English to Russian
+ ...
Press F1 in you OCR application Jan 17, 2012

...

 
Roy Williams
Roy Williams  Identity Verified
Austria
Local time: 17:14
German to English
TOPIC STARTER
Test version Jan 20, 2012

Hi,

Thanks for responding. Im current using Abby Finereader in trail mode and therefore can save more than one page.

I've tried some free OCRs but the results have been disappointing to say the least. With OCR do you use?


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 17:14
Member (2006)
English to Afrikaans
+ ...
ABBYY FineReader 8.0 Pro Jan 20, 2012

Roy Williams wrote:
The document is several pages long and as I've never tried this before, I'm wondering what is the best was to do this without ending up with a separate word file for converted each page.


I have ABBYY FineReader 8.0 Pro, and when you save the batch, it gives you the option of saving the entire batch as a single file, to save the pages of each source file in a single file named after the source file, or to save the individual pages using a name scheme.

If you're scanning and OCR'ing in one operation, then the OCR program might OCR only the currently scanned page. My suggestion is to scan all the pages to JPG and then use the OCR program to process all of them at once.

I've tried some free OCRs but the results have been disappointing to say the least.


There are no *acceptably good* free OCR systems that I know of. Certain versions of MS Word has it built-in (not sure about that) and some CAT tools also have it (I think WFA has it).


 
Tomás Cano Binder, BA, CT
Tomás Cano Binder, BA, CT  Identity Verified
Spain
Local time: 17:14
Member (2005)
English to Spanish
+ ...
ABBYY FineReader will do nicely Jan 20, 2012

We use ABBYY FineReader in the office.

It works fine for simple documents, but if you have documents with tons of little cells, tables, and diagrammes... I am affraid no tool will yield a perfect result if you don't want to do formatting work yourself.

ABBYY FineReader allows you to save different types of Word documents. Try them all and see which one works best for you. At times, it is best to OCR simple, unformatted text and format it yourself if you know how to use
... See more
We use ABBYY FineReader in the office.

It works fine for simple documents, but if you have documents with tons of little cells, tables, and diagrammes... I am affraid no tool will yield a perfect result if you don't want to do formatting work yourself.

ABBYY FineReader allows you to save different types of Word documents. Try them all and see which one works best for you. At times, it is best to OCR simple, unformatted text and format it yourself if you know how to use Microsoft Word.

If your target language is usually longer than the source language, you might have to enlarge the boxes ABBYY creates, and that always means manual work.

If the document is very complex with many bits and pieces, images, stamps, and images spread over the page, trying to deliver a document that looks like the original will prove to be quite cumbersome, so make sure you include an extra formatting charge in your invoice/quotation.

[Edited at 2012-01-20 12:55 GMT]
Collapse


 
Anna Villegas
Anna Villegas
Mexico
Local time: 09:14
English to Spanish
Microsoft Office Document Imaging Jan 20, 2012

Totally free if you have MS Office:

Scan images and save them, one by one, in the TIFF format (selecting "Save as" from the "File" menu and name it with a "TIFF" format).

Navigate to the "Start" menu and select "Programs," "Microsoft Office Tools" and "Microsoft Office Document Imaging."

From the "File" menu, select "Open" to open your scanned document that has been saved in the *.TIFF format. You can import each image, one by one, until completing the full
... See more
Totally free if you have MS Office:

Scan images and save them, one by one, in the TIFF format (selecting "Save as" from the "File" menu and name it with a "TIFF" format).

Navigate to the "Start" menu and select "Programs," "Microsoft Office Tools" and "Microsoft Office Document Imaging."

From the "File" menu, select "Open" to open your scanned document that has been saved in the *.TIFF format. You can import each image, one by one, until completing the full batch.

From the "Tools" menu, select "Send Text to Word." Or, you can select manually the text to be converted. Click "OK" to confirm. Depending on your computer's speed, the process will take anywhere from a few moments to a minute or two.

When the process is done, Microsoft Word will automatically load your document(s), which you can edit and format as you please.

Collapse


 
Roy Williams
Roy Williams  Identity Verified
Austria
Local time: 17:14
German to English
TOPIC STARTER
Can MemoQ? Jan 31, 2012

Can MemoQ merge documents?

 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Converting scans to word with OCR






Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »