boywhoroared + ocr 2
DocSplit
october 2010 by boywhoroared
Looks great for a little project involving web comics that I've always wanted to do. Will have a look see :)
Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text via OCR if necessary, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)
ruby
library
ocr
text
images
documents
Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text via OCR if necessary, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)
october 2010 by boywhoroared
pytesser - Project Hosting on Google Code
february 2010 by boywhoroared
OCR for Python. :)
ocr
python
library
february 2010 by boywhoroared