Counting Triangles - Social Graph Analysis
8 weeks ago by rcrowley
Biased comparison of a graph-traversal solutions in raw Hadoop, Pig, and Vertica on a hilariously small data set.
hadoop
pig
vertica
graph
algorithms
8 weeks ago by rcrowley
Golden Orb
august 2011 by rcrowley
Open-source Pregel implementation based on HDFS.
goldenorb
pregel
hadoop
graph
august 2011 by rcrowley
Data Management, Facebook-style | 20bits
november 2008 by rcrowley
Hammer's talk about data at Facebook. 320 nodes for Hive, 120 nodes for Cassandra. Large data size and a (relatively) low write rate.
data
facebook
hive
cassandra
hadoop
log
logging
november 2008 by rcrowley
Running Hadoop On Ubuntu Linux (Multi-Node Cluster) - Michael G. Noll
august 2008 by rcrowley
No more package name guesswork.
ubuntu
linux
hadoop
hdfs
java
august 2008 by rcrowley
AmazonS3 - Lucene-hadoop Wiki
january 2008 by rcrowley
Using Hadoop's distributed filesystem with Amazon S3. Perhaps a better solution than SimpleDB for real data storage?
apache
lucene
hadoop
hash
table
filesystem
mapreduce
amazon
s3
java
dist
january 2008 by rcrowley
related tags
algorithms ⊕ amazon ⊕ apache ⊕ cassandra ⊕ cloudera ⊕ compression ⊕ data ⊕ dist ⊕ ec2 ⊕ facebook ⊕ filesystem ⊕ goldenorb ⊕ graph ⊕ grid ⊕ hadoop ⊖ hash ⊕ hbase ⊕ hdfs ⊕ hive ⊕ java ⊕ linux ⊕ log ⊕ logging ⊕ lucene ⊕ lzo ⊕ mapreduce ⊕ pig ⊕ pregel ⊕ python ⊕ s3 ⊕ sql ⊕ table ⊕ twitter ⊕ ubuntu ⊕ vertica ⊕Copy this bookmark: