pwang + data   58

Changing the World with Open Data | Engine Yard Blog
List of links to sources of open and government data
data 
november 2011 by pwang
Advanced In-Database Analytics | Vertica
Vertica is an Analytics Platform with columnar storage, massively parallel processing, and extensions to SQL for in-database analytics and User-Defined Functions
database  data  parallel 
february 2011 by pwang
Needlebase
Online database scraper tool from ITA - scrape & publish tables - free datasets available
data  database  app  web 
february 2011 by pwang
Mattan Erez : Research
Researcher behind Merrimac stream processor
stream  data 
january 2011 by pwang
Apache Pig Tutorial
a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
data  stream  parallel 
january 2011 by pwang
KDE = P2P desktop or Join the global magnetic field!
Using distributed torrents to seemlessly provide and share data; using magnet-URIs the same way as a traditional file
data  internet  programming 
january 2011 by pwang

Copy this bookmark:



description:


tags: