Edwin Chen's Blog: Movie recommendations and more via MapReduce and Scalding
february 2012 by tlockney
Excellent post on recommendation techniques with #Scalding (Scala + Cascading)
scala
hadoop
mapReduce
scalding
february 2012 by tlockney
Artem Yankov - Hadoop 1.0 + MongoDB: the Beginning
january 2012 by tlockney
Hadoop 1.0 + MongoDB: the Beginning
hadoop
mongodb
from twitter_favs
january 2012 by tlockney
sysadvent: Day 10 - Analyzing Logs with Pig and Elastic MapReduce
december 2011 by tlockney
Analyzing logs with Pig and Elastic Map Reduce .. - a great read ..
pig
hadoop
aws
from twitter_favs
december 2011 by tlockney
NICTA/scoobi - GitHub
october 2011 by tlockney
Announcing Scoobi - a #scala productivity framework for #hadoop
hadoop
scala
october 2011 by tlockney
Parallel Processing Using the Map Reduce Programming Model :: myNoSQL
august 2011 by tlockney
Parallel Processing Using the Map Reduce Programming Model #MapReduce #Hadoop
MapReduce
Hadoop
from twitter_favs
august 2011 by tlockney
The Practical Quant: Large-scale Named Entity Recognition in the Cloud
august 2011 by tlockney
Amazing factoid from GraphLab talk I tweeted earlier: entity extraction much, much faster than #hadoop
hadoop
from twitter_favs
august 2011 by tlockney
PageRank implementation in Pig « Julien's tech blog
july 2011 by tlockney
blog post showing how to implement PageRank with Pig embedded in Python. (with the link!)
pig
hadoop
python
from twitter_favs
july 2011 by tlockney
Untitled (http://theory.stanford.edu/~sergei/papers/spaa11-matchings.pdf)
may 2011 by tlockney
RT @jakehofman: RT @vsergei: [5 New] Graph algorithms for MapReduce with @SilvioL, Ben Moseley and @ssuri #hadoop
hadoop
from twitter_favs
may 2011 by tlockney
cwensel/bash-emr - GitHub
april 2011 by tlockney
some simple bash functions for manipulating Amazon Elastic MapReduce clusters from the command line #hadoop #aws
aws
hadoop
from twitter_favs
april 2011 by tlockney
tc's cascading-scala-maven-example at master - GitHub
september 2010 by tlockney
Posted an example project for using cascading with scala and maven #hadoop
#hadoop
hadoop
from twitter_favs
september 2010 by tlockney
rjurney's Cloud-Stenography at master - GitHub
june 2010 by tlockney
Cloud Stenography was to be a graphical interface to Hadoop, on a dataflow and SQL abstraction. The idea was to compose ad-hoc datasets for analysis and visualization in Pig SQL using a graphical dataflow interface leveraging ILLUSTRATE to preview data, run jobs and then easily export them out to Excel.
pig
hadoop
dataflow
visualization
analytics
from delicious
june 2010 by tlockney
related tags
#chef ⊕ #db ⊕ #hadoop ⊕ #hbase ⊕ ai ⊕ algorithms ⊕ amazon ⊕ analytics ⊕ aws ⊕ bigdata ⊕ bigtable ⊕ Brisk ⊕ cascading ⊕ cascalog ⊕ cassandra ⊕ chef ⊕ chukwa ⊕ clojure ⊕ cloudcomputing ⊕ cloudera ⊕ cloudfront ⊕ cloudmapreduce ⊕ cluster ⊕ clustering ⊕ concurrency ⊕ data ⊕ dataflow ⊕ datamining ⊕ dataprocessing ⊕ datawarehouse ⊕ db ⊕ debian ⊕ distributed ⊕ dumbo ⊕ ec2 ⊕ eclipse ⊕ elasticmapreduce ⊕ facebook ⊕ filesystem ⊕ graph ⊕ hadoop ⊖ hbase ⊕ hdfs ⊕ hive ⊕ Hypertable ⊕ ids ⊕ java ⊕ jdbc ⊕ jruby ⊕ jvm ⊕ linkedin ⊕ logging ⊕ lzo ⊕ machinelearning ⊕ mahout ⊕ mapreduce ⊕ mongodb ⊕ nosql ⊕ Oozie ⊕ pig ⊕ protobuf ⊕ python ⊕ r ⊕ rails ⊕ research ⊕ Riak ⊕ ruby ⊕ s3 ⊕ scala ⊕ scalability ⊕ scalding ⊕ scaling ⊕ scribe ⊕ snort ⊕ Sqoop ⊕ statistics ⊕ storage ⊕ thrift ⊕ twitter ⊕ ubuntu ⊕ virtualbox ⊕ visualization ⊕Copy this bookmark: