Doc⚡split
september 2010 by berberich
Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text via OCR if necessary, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)
ruby
ocr
library
document
image
september 2010 by berberich
tesseract-ocr - Google Code
july 2007 by berberich
Open source OCR conversion engine.
ocr
opensource
google
software
code
development
documentation
engine
july 2007 by berberich
ocropus - Google Code
june 2007 by berberich
open source document analysis and OCR system
ocr
google
opensource
software
code
tools
june 2007 by berberich
Copy this bookmark: