Translating OCRed source files
When translating OCRed source files, make sure to choose the new filter for OCR files called ‘Ms Word OCR’, which will suppress nearly all unnecessary tags.
Example segments in CafeTran, using the regular MS Word DOCX filter:




Same segments in CafeTran, using the special MS Word OCR DOCX filter:




Some real-world examples:
Regular Word filter:


Special OCR filter:


See also: Using Condense to OCR snippets