[Libre-soc-bugs] [Bug 602] low performance bare minimum functionality SIMD emulator required

bugzilla-daemon at libre-soc.org bugzilla-daemon at libre-soc.org
Mon Jun 7 09:12:36 BST 2021


https://bugs.libre-soc.org/show_bug.cgi?id=602

--- Comment #14 from Jacob Lifshay <programmerjake at gmail.com> ---
(In reply to Luke Kenneth Casson Leighton from comment #11)
> yes, we are however using OCR being developed by richard before
> doing it by hand.  this will save massive amounts of time.

I ended up looking through Wikipedia's list of OCR programs, and I noticed
Tessarect (and several others) supports outputting to hOCR format, an
HTML-based format, which seems like it would be waay easier to parse than
trying to manually roll-your-own text column/row/formatting detector based on
Octave and FFTs...

hOCR:
http://kba.cloud/hocr-spec/1.2/

-- 
You are receiving this mail because:
You are on the CC list for the bug.


More information about the libre-soc-bugs mailing list