Welcome to Hive!
november 2011 by donturn
Hive is a data warehouse system for Hadoop that facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible file systems. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. At the same time this language also allows traditional map/reduce programmers to plug in their custom mappers and reducers when it is inconvenient or inefficient to express this logic in HiveQL.
data
database
hadoop
opensource
nosql
november 2011 by donturn
Welcome to Apache Pig!
september 2011 by donturn
Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization, which in turns enables them to handle very large data sets.
apache
data
hadoop
mapreduce
opensource
sql
database
datascience
september 2011 by donturn
Needlebase
february 2011 by donturn
merge data, crawl web data, then chart and explore it
api
data
database
datascience
kdd
etl
stats
charts
map
february 2011 by donturn
The 70 Online Databases that Define Our Planet - Technology Review
december 2010 by donturn
The 70 Online Databases That Define Our Planet - Technology Review
database
data
research
datascience
content
from twitter
december 2010 by donturn
MapReduce: A major step backwards - The Database Column
march 2008 by donturn
a data architecture flame war begins - excellent!
algorithm
database
dbms
dev
mapreduce
google
programming
sql
march 2008 by donturn
Computing at Scale » Blog Archive » From SQL To Parallel
march 2008 by donturn
good to see more people discussing this. as a long time sql developer but also a skeptic when needed, we need to admit that it's not always the best solution.
database
sql
mapreduce
hadoop
flatfile
rdbms
dbms
march 2008 by donturn
Ruby on Rails
january 2006 by donturn
Web db and interface development for almost mere mortals.
database
php
rails
ruby
semantic_web
web2
january 2006 by donturn
related tags
algorithm ⊕ analysis ⊕ analytics ⊕ apache ⊕ api ⊕ app ⊕ archive ⊕ austin ⊕ backup ⊕ bbc ⊕ business ⊕ charts ⊕ cms ⊕ collaborative_filtering ⊕ content ⊕ crawler ⊕ crm ⊕ dashboard ⊕ data ⊕ database ⊖ datascience ⊕ data_mart ⊕ data_mining ⊕ data_warehouse ⊕ dbms ⊕ design ⊕ dev ⊕ docs ⊕ email ⊕ etl ⊕ flatfile ⊕ folksonomy ⊕ google ⊕ graph ⊕ hadoop ⊕ hardware ⊕ ia ⊕ index ⊕ indexing ⊕ information_architecture ⊕ information_retrieval ⊕ informetrics ⊕ interface ⊕ intro ⊕ ip ⊕ ir ⊕ java ⊕ kdd ⊕ km ⊕ kms ⊕ lamp ⊕ linkeddata ⊕ logs ⊕ lucene ⊕ mac ⊕ mail ⊕ mail.app ⊕ map ⊕ mapreduce ⊕ metrics ⊕ mis ⊕ mysql ⊕ nosql ⊕ olap ⊕ ontology ⊕ opensource ⊕ optimization ⊕ organization ⊕ patents ⊕ pearson ⊕ performance ⊕ personalization ⊕ php ⊕ pim ⊕ programming ⊕ quant ⊕ rails ⊕ rdbms ⊕ rdf ⊕ reports ⊕ research ⊕ ruby ⊕ search ⊕ semantic_web ⊕ sematic ⊕ socialgraph ⊕ sql ⊕ startup ⊕ statistics ⊕ stats ⊕ tagging ⊕ tags ⊕ text ⊕ utilities ⊕ ux ⊕ viz ⊕ web ⊕ web2 ⊕ wikipedia ⊕ xml ⊕Copy this bookmark: