sstrudeau + document   4

Doc⚡split
"Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)"
ruby  split  document  parse  search  utility  library  pdf  thumbnail  metadata  text 
december 2009 by sstrudeau
ThurDB
reportedly a SimpleDB-like document-oriented storage DB; site is currently 504
database  document  simpledb  scale  web  cluster  facebook  ec2  scalability  scaling  storage 
june 2008 by sstrudeau

Copy this bookmark:



description:


tags: