rcrowley + dist   31

Google: Achieving Rapid Response Times in Large Online Services
Things to consider when optimizing at the 99th percentile. Variance is key and is one of the biggest reasons EBS blows.

Counterintuitively, he suggests synchronizing variances across a cluster (for example, a Puppet run). Even though for that moment in time all requests are slow, it's better to do that than let every request potentially be slow because it fanned-out to a temporarily slow node.
ops  dist  variance  performance  scalability  optimization  google  jeffdean 
8 weeks ago by rcrowley
Apache Kafka
Design documentation of LinkedIn's Kafka queue broker. It's distributed but relies on Zookeeper for orchestration. The section on the log file's implementation is particularly interesting.
kafka  queue  mq  messaging  logging  dist 
november 2011 by rcrowley
The Plasma Project
OCaml, but a distributed POSIX- and NFS-compliant filesystem.
filesystem  mapreduce  fuse  storage  dist 
november 2011 by rcrowley
toddlipcon/gremlins - GitHub
(Dumping GitHub watches into Pinboard.)

Distributed fault-testing tools.
python  dist  testing 
october 2011 by rcrowley
jrydberg/nesoi - GitHub
Gossip-based eventually-consistent (but for a small cluster likely always consistent) configuration storage engine like Noah.
dist  python  twisted  gossip  base  noah  cm  truth 
august 2011 by rcrowley
BOOM -- Berkeley Orders of Magnitude -- Declarative Languages And Systems
Distributed computing project from Berkeley. "...build systems that are OOM bigger than are building today, with OOM less effort than traditional programming methodologies."
berkeley  bloom  boom  cloud  dist 
july 2011 by rcrowley
Desktop Data Grid - Ddg - LSD - Laboratório de Sistemas Distribuídos
Jonhnny Weslley just added support for this to BashReduce. Need to read to see if it's worth merging.
bashreduce  ddgfs  filesystem  dist 
july 2009 by rcrowley
up and running with cassandra :: snax
This intrigues me: "unlike most key/value stores, you can query for ordered ranges of keys." In my messing around with Cassandra, it's Thrift Interface seemed to actively hinder doing anything like this.
cassandra  java  thrift  kv  db  dist 
july 2009 by rcrowley
Plurk Open Source - LightCloud - Distributed and persistent key value database
I could bolt libketama on the front of Net::Tyrant, or I could translate this to PHP.
lightcloud  tokyocabinet  tokyotyrant  python  kv  db  dist 
june 2009 by rcrowley
Gluster - Home
Another POSIX-compliant cluster filesystem.
gluster  glusterfs  nfs  filesystem  dist 
june 2009 by rcrowley
libdht2 - Google Code
DHT in library form? From Neils Provos so very much not a joke.
dht  libevent  event  c  dist 
april 2009 by rcrowley
Some Notes on Distributed Key Stores « random($foo)
Leonard landed on TC with his own hashing/distribution. Someday we'll have eventual consistency.
lhl  kv  tokyocabinet  tokyotyrant  dht  voldemort  memcachedb  dist 
april 2009 by rcrowley
Defying Classification: The Dark Side Of Distributed Version Control
"Again, open source runs on its communities. Collections of people using the benefits of multiple brains to solve, either deliberately or by random walk, complex problems. "
versioncontrol  opensource  community  dist 
march 2009 by rcrowley
MogileFS
Well, Digg uses it, so it must be at least worth reading.
mogilefs  storage  dist 
september 2008 by rcrowley
ongoing · Wide Finder 2
Back at it, this time with a beast of a server, a dataset that won't fit in memory (joy of joys!) and (inexplicably) more OCaml.
widefinder  sun  timbray  concurrency  programming  dist  parallelism  threading 
july 2008 by rcrowley
AmazonS3 - Lucene-hadoop Wiki
Using Hadoop's distributed filesystem with Amazon S3. Perhaps a better solution than SimpleDB for real data storage?
apache  lucene  hadoop  hash  table  filesystem  mapreduce  amazon  s3  java  dist 
january 2008 by rcrowley

Copy this bookmark:



description:


tags: