al3x + hadoop   13

aboisvert/revolute
"a rich SQL-like querying language (embedded DSL) inspired by Apache Hive and Scala Query" [...] "essentially a thin layer on top of Cascading"
hadoop  scala  library  cascading  bigdata  query 
6 weeks ago by al3x
Hama
"a distributed scientific package on Hadoop for massive matrix and graph data"
hadoop  parallel  graph  matrix 
april 2010 by al3x
Matei Zaharia - Spark
"a simple cluster computing framework built on top of Nexus"
scala  cluster  distributed  hadoop 
march 2010 by al3x
Dumbo
"a Python module that allows you to easily write and run Hadoop streaming programs"
mapreduce  python  hadoop 
june 2008 by al3x
SHadoop
Beat to the punch!
scala  hadoop  mapreduce  java 
may 2008 by al3x
hadoop-streaming
"a collection of libraries which make writing Hadoop Streaming tasks easy". Coda understands my needs.
hadoop  python  library 
april 2008 by al3x
Cascading
Intends to make Hadoop easier. "Cascading uses a "pipe and filters" model for defining data processes. It efficiently supports splits, joins, grouping, and sorting. These are the only processing concepts the developer needs to "think" in."
hadoop  mapreduce 
april 2008 by al3x
Perspectives - Services
James Hamilton's notes from the Hadoop Summit, 2008.
hadoop  conference  blog 
march 2008 by al3x
JAQL
"a new query language being developed for JSON data"
hadoop  json  query  language 
december 2007 by al3x

Copy this bookmark:



description:


tags: