Edwin Chen's Blog: Movie recommendations and more via MapReduce and Scalding
february 2012 by tlockney
Excellent post on recommendation techniques with #Scalding (Scala + Cascading)
scala
hadoop
mapReduce
scalding
february 2012 by tlockney
How to process a million songs in 20 minutes « Music Machinery
september 2011 by tlockney
Cool! How to process a million songs in 20 minutes #mapreduce #cloudanalytics
cloudanalytics
mapreduce
from twitter_favs
september 2011 by tlockney
Parallel Processing Using the Map Reduce Programming Model :: myNoSQL
august 2011 by tlockney
Parallel Processing Using the Map Reduce Programming Model #MapReduce #Hadoop
MapReduce
Hadoop
from twitter_favs
august 2011 by tlockney
University of Washington: Problem Solving on Large Scale Clusters - Google Code University - Google Code
april 2009 by tlockney
Some additional reading and labs in here that go a bit beyond what Cloudera shows in their tutorials -- definitely MUCH more difficult to implement.
hadoop
mapreduce
distributed
clustering
april 2009 by tlockney
FrontPage - Lucene-hadoop Wiki
july 2007 by tlockney
Hadoop is a framework for running applications on large clusters built of commodity hardware.
mapreduce
distributed
hadoop
cluster
concurrency
july 2007 by tlockney
related tags
algorithms ⊕ amazon ⊕ analytics ⊕ appengine ⊕ architecture ⊕ aws ⊕ bigdata ⊕ bigtable ⊕ browser ⊕ cascading ⊕ cassandra ⊕ chukwa ⊕ clojure ⊕ cloudanalytics ⊕ cloudera ⊕ cloudmapreduce ⊕ cluster ⊕ clustering ⊕ concurrency ⊕ CouchDB ⊕ data ⊕ datamining ⊕ dataprocessing ⊕ datawarehouse ⊕ db ⊕ distributed ⊕ dumbo ⊕ ec2 ⊕ elasticmapreduce ⊕ erlang ⊕ eventmachine ⊕ filesystem ⊕ gfs ⊕ hadoop ⊕ hdfs ⊕ hive ⊕ ids ⊕ java ⊕ javascript ⊕ jdbc ⊕ jruby ⊕ linearalgebra ⊕ logging ⊕ lzo ⊕ mapreduce ⊖ math ⊕ mongodb ⊕ mule ⊕ networking ⊕ nosql ⊕ OCaml ⊕ pig ⊕ protobuf ⊕ python ⊕ r ⊕ research ⊕ riak ⊕ ruby ⊕ s3 ⊕ scala ⊕ scalability ⊕ scalding ⊕ scaling ⊕ scribe ⊕ snort ⊕ statistics ⊕ thrift ⊕ twitter ⊕ virtualbox ⊕Copy this bookmark: