jd + data   12

Ookaboo RDF data dump
metadata for nearly 1,000,000 public domain and Creative Commons images of more than 500,000 precise topics such as places, people and organism classifications linked to DBpedia and Freebase.
labelled  image  data 
january 2012 by jd
Data Science Toolkit
toolkit includes file to text (image, pdf, office docs), html to text, text to people, text to times (Chronic Ruby gem), geocoding, etc. tools are hosted, but you can self-host too. results aren't perfect, but interesting nonetheless.
data  machinelearning  analytics 
january 2012 by jd
Protovis - A graphical approach to visualization
high level language, generates SVG for the brwoser
data  visualization 
november 2011 by jd
Welcome | ScraperWiki
collection of scrapers, plus online toolkit for building them
datamining  scraping  data 
march 2011 by jd
generatedata.com
generates data in html, xml, csv, excel, or sql. has a good deal of flexibility. also, have to pay $20 to get over 200 items, and the max is 5000 (software is free, if you want to download and install it)
data 
march 2010 by jd
Bulk Data Downloads: A Breakthrough in Government Transparency - O'Reilly Radar
Tim O'Reilly editorial on open access to government data, with good links
government  data 
march 2009 by jd

Copy this bookmark:



description:


tags: