The recognition process starts with a PDF file from the displayed in the Adobe Acrobat Professional window.
Pull down Paper Capture window.→ → . This brings up the
Under Pages, select the radiobutton for , or , or a range of pages by page number.
Under Settings, you will see a
short list, and the second line of that list will
display the “PDF Output Style”. If this
element does not say “
Style: Formatted Text & Graphics”, click the
button to bring up the Paper Capture
Settings Menu; click the pulldown menu
Formatted Text &
Graphics”; and click .
Clickto start the recognition process. This will take at least a few seconds. Once it is complete, your page will be redisplayed.
At this point, depending on how well the software has coped with your original, it may have recognized all the text, some of the text, or no text at all. When the software cannot recognize an area of the original, its fallback strategy is to represent that area as an image, instead of text. So in the general case what you now have is a mixture of text and images. You will find out how well it worked after you have saved the document in your desired format.
Click Save As window.→ . This brings up the
At the top of this window is the usual “
U: drive to save the file in your TCC
.doc for MS-Word
.txt for text files, and
In thepulldown menu, select the file format you want to save in. Recommend choices include:
HTML 3.2 (*.htm) HTML 4.01 with CSS 1.00 (*.htm) Microsoft Word Document (*.doc) Rich Text Format (*.rtf) Text (Plain) (*.txt)
Check the file produced by this process, comparing the result to the original to see how well it worked. There will probably be some cleanup necessary. For example, if you exported the file to MS-Word, you may see some parts of the document represented as images. You'll need to retype those. Also, MS-Word representations of the text may be full of strange paragraph formatting and spurious changes to text fonts, styles and size.
If all else fails, you may need to retype the document. Unless your original is quite clean, the author's experience has been that most decent typists can retype it in less time than it will take you to scan the document and clean it up.