Talk:422: Difference between revisions

Jump to navigation Jump to search
(Created page with "The PDFs that were marked "bad OCR" were done with ABBYY FineReader and have major problems. FineReader overzealously "corrected" what it thinks is skew. See page 54 of 070-0434-02.pdf for an example. In general FineReader mangles the images rather than just adding an invisible text layer. I don't know whether the mangling can be turned off. I stopped using it. Also, see page 4 of 070-0895-00.pdf. Again, it incorrectly "corrected" the skew. Also see the figures on page 1...")
 
m (ah - thanks)
Line 7: Line 7:
and it works fine most of the time. And if it encounters and error, it prints error messages rather than silently mangling your document, like ABBYY FineReader. It isn't really practical to babysit OCR software to make sure it didn't mangle the page images. It needs to be reliable.
and it works fine most of the time. And if it encounters and error, it prints error messages rather than silently mangling your document, like ABBYY FineReader. It isn't really practical to babysit OCR software to make sure it didn't mangle the page images. It needs to be reliable.
[[User:Kurt|Kurt]] ([[User talk:Kurt|talk]]) 11:26, 10 January 2024 (PST)
[[User:Kurt|Kurt]] ([[User talk:Kurt|talk]]) 11:26, 10 January 2024 (PST)
:Thanks, Kurt - now I understand :-) I'll re-OCR them and see what I get... [[User:Qfissler|Qfissler]] ([[User talk:Qfissler|talk]]) 06:53, 11 January 2024 (PST)