Self-service, Prorated Super Computing Fun! - Open - Code - New York Times Blog
february 2008 by mncaudill
Very cool article of how NYTimes used S3, EC2, and Hadoop to churn through over a hundred years of PDFs to get them web-display ready.
ec2
s3
cluster
february 2008 by mncaudill