tlockney + hadoop   106

Edwin Chen's Blog: Movie recommendations and more via MapReduce and Scalding
Excellent post on recommendation techniques with #Scalding (Scala + Cascading)
scala  hadoop  mapReduce  scalding 
february 2012 by tlockney
sysadvent: Day 10 - Analyzing Logs with Pig and Elastic MapReduce
Analyzing logs with Pig and Elastic Map Reduce .. - a great read ..
pig  hadoop  aws  from twitter_favs
december 2011 by tlockney
NICTA/scoobi - GitHub
Announcing Scoobi - a #scala productivity framework for #hadoop
hadoop  scala 
october 2011 by tlockney
The Practical Quant: Large-scale Named Entity Recognition in the Cloud
Amazing factoid from GraphLab talk I tweeted earlier: entity extraction much, much faster than
hadoop  from twitter_favs
august 2011 by tlockney
PageRank implementation in Pig « Julien's tech blog
blog post showing how to implement PageRank with Pig embedded in Python. (with the link!)
pig  hadoop  python  from twitter_favs
july 2011 by tlockney
cwensel/bash-emr - GitHub
some simple bash functions for manipulating Amazon Elastic MapReduce clusters from the command line
aws  hadoop  from twitter_favs
april 2011 by tlockney
tc's cascading-scala-maven-example at master - GitHub
Posted an example project for using cascading with scala and maven
#hadoop  hadoop  from twitter_favs
september 2010 by tlockney
rjurney's Cloud-Stenography at master - GitHub
Cloud Stenography was to be a graphical interface to Hadoop, on a dataflow and SQL abstraction. The idea was to compose ad-hoc datasets for analysis and visualization in Pig SQL using a graphical dataflow interface leveraging ILLUSTRATE to preview data, run jobs and then easily export them out to Excel.
pig  hadoop  dataflow  visualization  analytics  from delicious
june 2010 by tlockney
« earlier      

Copy this bookmark:



description:


tags: