LWN.net Logo

Google's Tesseract OCR engine is a quantum leap forward (Linux.com)

Nathan Willis looks at the Tesseract Optical Character Recognition (OCR) engine on Linux.com. "The Tesseract code was written at Hewlett-Packard in the 1980s and '90s. In 1995, it was one of the top-tier performers at UNLV's OCR competition, but when HP withdrew from the OCR software marketplace, the code languished. Then in 2005, HP handed off the code to UNLV's Information Science Research Institute (ISRI), an academic center doing ongoing research into OCR and related topics. ISRI discovered that original Tesseract developer Ray Smith was now an employee at Google, and asked the search engine giant if it was interested in the code. Google spent a few months updating the code to compile on modern operating systems, and released it on SourceForge.net."
(Log in to post comments)

quantum leap

Posted Sep 28, 2006 18:14 UTC (Thu) by dark (subscriber, #8483) [Link]

Okay... anyone who knows what a tesseract is should also know that a quantum leap is the smallest possible increment.

quantum leap

Posted Sep 28, 2006 18:26 UTC (Thu) by gravious (subscriber, #7662) [Link]

agreed - ever since this was pointed out to me by mr. bill bryson in his incomporable book troublesome words the ql phrase has become a source of irritation and mirth for me. the older i get the more i realise how mangled the english langwidge is for common usage and you may as well enrol in advanced cat-herding 501 for all the good it will do you pointing this stuff out to people. btw - iana (i am not anal).

respeck, Anto

quantum leap

Posted Sep 28, 2006 18:44 UTC (Thu) by avik (guest, #704) [Link]

That's okay, considering the code wasn't touched for ten years.

quantum leap --- Akin to the leap from classical to quantum physics

Posted Sep 28, 2006 18:50 UTC (Thu) by AnswerGuy (guest, #1256) [Link]

As amusing as this observation might be, I think it's reasonably well-understood by those who use and understand the phrase that "quantum leap" is idiomatic (rather than idiotic).

A leap of only one quanta would, of course, be an incremental advance. However to say that this is a "leap" that's akin to the one that lead from classical physics into quantum physics. Also there is the implication that the result could not possibly have been attained by smaller increments ... that their is, figuratively, some chasm that most be lept across --- a paradigm shift (where they old pattern of progress no longer applies).

Of course terms like "quantum leap" and "paradigm shift" are used rather loosely. Often they are hype rather than the result of any real insights into the process by which any particular points of progress have been achieved.

JimD

quantum leap --- Akin to the leap from classical to quantum physics

Posted Sep 28, 2006 21:26 UTC (Thu) by drag (subscriber, #31333) [Link]

Heh.

Personally I am more interested in how the stupid software performs and when I can expect a plugin for Xsane or OpenOffice.org whatever!

Also a plugin for spamassasin to catch those fools trying to get past spam filters by using jpegs instead of text.

:)

quantum leap

Posted Sep 28, 2006 19:27 UTC (Thu) by cpm (guest, #3554) [Link]

Yeah,
Kinda like Penultimate.

Furthermore

Posted Sep 28, 2006 22:20 UTC (Thu) by felixfix (subscriber, #242) [Link]

Anyone who knows what a quantum leap is AND has even rudimentary language skills (in any language) is will readily understand that it is not the size of the leap under consideration. It is a reference to nothing smaller being possible and of the importance of the change. If I say it is heads and shoulders above the competition, would any of you actually condescend to point out that OCR programs are not humans? If I say it is light years ahead of the competition, how many of you will belabor the obvious, that prorgams don't have physical distances between them?

Ice changes to water over a miniscule temperature rise, but anyone who knows what a phase change is will understand the importantance of that final small temperature difference compared to the previous same change. Anyone who has stood at the edge of a deck or roof or mountain will understand how important that last step is compared to the step before.

Only the truly arrogant and misinformed quibble about the meaning of "quantum leap" in every day usage, as opposed to a quantum mechanics environment. It is smugness at its worst.

Have you looked in a mirror lately?

Posted Sep 29, 2006 12:17 UTC (Fri) by dark (subscriber, #8483) [Link]

My arrogance meter went "ping" on my post. But it actually exploded and caught fire when I tried to measure yours.

quantum leap

Posted Sep 29, 2006 10:04 UTC (Fri) by anandsr21 (guest, #28562) [Link]

Thats like saying noses should not run ;-). Well its English deal with it.

Google's Tesseract OCR engine is a quantum leap forward (Linux.com)

Posted Sep 28, 2006 23:54 UTC (Thu) by stephenjudd (subscriber, #3227) [Link]

"Google spent a few months updating the code to compile on modern operating systems"

Hah. I tried to compile it last weekend and gcc complained about bad code almost straight away. Figuring out why is on my to-do list now.

Compiling

Posted Sep 29, 2006 1:01 UTC (Fri) by gjmarter (subscriber, #5777) [Link]

Someone on the distributed proofreaders forum is also working on this. It compiled but apparently did not work

Copyright © 2006, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds