donturn + database   30

Welcome to Hive!
Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. At the same time this language also allows traditional map/reduce programmers to plug in their custom mappers and reducers when it is inconvenient or inefficient to express this logic in HiveQL.
data  database  hadoop  opensource  nosql 
november 2011 by donturn
Welcome to Apache Pig!
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
apache  data  hadoop  mapreduce  opensource  sql  database  datascience 
september 2011 by donturn
Needlebase
merge data, crawl web data, then chart and explore it
api  data  database  datascience  kdd  etl  stats  charts  map 
february 2011 by donturn
Computing at Scale » Blog Archive » From SQL To Parallel
good to see more people discussing this. as a long time sql developer but also a skeptic when needed, we need to admit that it's not always the best solution.
database  sql  mapreduce  hadoop  flatfile  rdbms  dbms 
march 2008 by donturn
MailSteward
sql or sqllite database for organizing and importing email
app  archive  backup  database  email  mac  mail  mail.app  mis  pim  search  tags  utilities 
february 2008 by donturn
Ruby on Rails
Web db and interface development for almost mere mortals.
database  php  rails  ruby  semantic_web  web2 
january 2006 by donturn

Copy this bookmark:



description:


tags: