olleolleolle + textproc   1

Doc⚡split
command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text via OCR if necessary, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)
ruby  tools  textproc 
december 2010 by olleolleolle

related tags

ruby  textproc  tools 

Copy this bookmark:



description:


tags: