donturn + sql   12

Welcome to Apache Pig!
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
apache  data  hadoop  mapreduce  opensource  sql  database  datascience 
september 2011 by donturn
Computing at Scale » Blog Archive » From SQL To Parallel
good to see more people discussing this. as a long time sql developer but also a skeptic when needed, we need to admit that it's not always the best solution.
database  sql  mapreduce  hadoop  flatfile  rdbms  dbms 
march 2008 by donturn

Copy this bookmark:



description:


tags: