|
|
Log in / Subscribe / Register

Faces of Open Source

Faces of Open Source

Posted Feb 18, 2022 11:40 UTC (Fri) by Wol (subscriber, #4433)
In reply to: Faces of Open Source by Ross
Parent article: Lorinda Cherry RIP

Changing topic slightly, I'm just mucking about with OCR. Why can't OCR process a PLAIN TEXT FILE CORRECTLY?

My document looks like it came out of nroff, with nicely aligned sections and indents. I don't know how it was OCR'd (probably with one of Google's Artificial Stupidity tools), but the OCR software has decided that the document is formatted in columns (no it isn't), so all the text fragments are all over the place and it's a fing nightmare trying to put it all back together again!

If it's a simple document, ffs OCR it as a simple document! Left-to-right, top-to-bottom, DON'T assume fancy formatting that isn't there!

Cheers,
Wol


to post comments


Copyright © 2026, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds