mcroydon + storage   123

Basho | The All New
My initial thought is "Enterprise Park Place" but this seems pretty cool. There's a little circle of life going on here as well: Riak was inspired by Dynamo, which has been quietly powering S3 for years.
cloud  storage  riak  basho 
9 weeks ago by mcroydon
OpenTSDB - A Distributed, Scalable Monitoring System
OpenTSDB is a distributed, scalable Time Series Database (TSDB) written on top of HBase. OpenTSDB was written to address a common need: store, index and serve metrics collected from computer systems (network gear, operating systems, applications) at a large scale, and make this data easily accessible and graphable.
analysis  architecture  bigdata  cloud  data  database  db  java  lgpl  hbase  hadoop  development  graph  distributed  monitoring  nosql  opensource  operations  scalability  scale  time  sysadmin  software  storage  series  opentsdb  rrd  stumbleupon  time-series  timeseries 
november 2010 by mcroydon
todos.js
A really concise example of backbone.js using docco for literate programming.
ajax  app  application  code  demo  development  javascript  html5  framework  examples  example  documentation  jquery  js  library  local  model  mvc  oop  programming  storage  todo  tutorial  backbone  backbone.js 
november 2010 by mcroydon
SHARD Triple-Store
"SHARD is a proof-of-concept use of high-performance, low-cost distributed computing technology to develop a highly scalable triple-store. SHARD is released as an open-source project on the BSD license."
database  db  cloud  distributed  hadoop  lubm  mapreduce  rdf  store  sparql  storage  shard  semweb  semanticweb  scalability  triple-store 
october 2010 by mcroydon
RDFLib
"RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information." I was totally here years ago.
api  database  code  dev  development  framework  lib  graph  python  programming  parsing  parser  owl  ontology  library  rdf  semantic  rdfa  semantic-web  semanticweb  semweb  xml  web2.0  web  tools  storage  sparql  software  n3  rdflib 
october 2010 by mcroydon
Lawnchair
Yay things to put things in.
database  db  html5  json  storage  webkit 
august 2010 by mcroydon
AWS Import/Export
A station wagon full of backup tapes now has an API.
amazon  aws  bigdata  beta  backup  cloud  cloud_computing  service  s3  large  import  export  ec2  data  cloudcomputing  storage  tools  carrier  sneakernet  transfer 
june 2010 by mcroydon
The Basho Blog
Very similar to the way that Cassandra does things under the hood.
erlang  riak  nosql  storage 
april 2010 by mcroydon
Lineland
Scroll through for lots and lots of HBase internals.
blog  distributed  hadoop  hbase  nosql  mapreduce  programming  systems  storage  reference 
march 2010 by mcroydon
cloudkick | blog: 4 Months with Cassandra, a love story
A very interesting look at Cassandra with an eye toward gotchas. Cloudkick are doing some interesting stuff with aggregation over time periods.
admin  via:jacobian  administration  architecture  article  cassandra  database  databases  opensource  nosql  mysql  monitoring  django  distributed  db  datawarehouse  python  scalability  scaling  storage  toread  webdev  programming  cloudkick  neat 
march 2010 by mcroydon
Queued Storage Backend for Django | Step and Numbers
This seems like a simple and elegant solution to the problem at hand.
programming  django  python  storage  s3  queue  django-apps  amqp  files  celery 
february 2010 by mcroydon
Avro: a Format for Big Data » Cloudera Hadoop & Big Data Blog
Another data interchange format (I think) like ProtocolBuffers and Thrift. I think one of the bigger problems that the Hadoop/big data community has is parallel internal implementations of building blocks that are later open-sourced.
data  database  storage  distributed  hadoop  apache  cloud  json  messaging  encoding  protocol  portable  cloudera  bigdata  data-structures  serialization  format  foss  thrift  buffers  introduction  avro 
november 2009 by mcroydon
LucidDB Home Page
"LucidDB is the first and only open-source RDBMS purpose-built entirely for data warehousing and business intelligence. It is based on architectural cornerstones such as column-store, bitmap indexing, hash join/aggregation, and page-level multiversioning."
programming  software  development  database  data  business  opensource  java  scalability  storage  distributed  cluster  databases  sql  db  datamining  olap  columndb  bi  datawarehouse  dbms  reporting  rdbms  luciddb  column  warehousing  column-store  data_warehouse  column-oriented  dwh 
october 2009 by mcroydon
http://markorodriguez.com/Lectures_files/risk-symposium2009.pdf
A pretty concise look at graph databases vs. traditional relational databases.
data  database  research  storage  performance  todo  presentation  pdf  db  rdf  graph  graphs  rdbms  nosql  semweb  sparql  relational 
october 2009 by mcroydon
Geeking with Greg: Advice from Google on large distributed systems
With links to slides from LADIS '09. This includes a refresh and update about how GFS, MapReduce, etc are working in Google's fault-filled environment.
programming  google  blog  scalability  storage  performance  architecture  distributed  advice  scaling  infrastructure  systems  datacenters 
october 2009 by mcroydon
Why I like Redis
Redis is indeed awesome (and a little different) due to its support for rich primitive types.
python  programming  data  storage  database  dev  distributed  article  databases  cache  db  memcached  caching  articles  convert  cli  nosql  experiments  redis  schemaless  repl 
october 2009 by mcroydon
Training to Climb an Everest of Digital Data
Big data is big and almost always requires a completely different mindset than the one that is taught in computer science programs.
data  database  processing  google  news  toread  ibm  energy  datasets  mining  search  research  science  internet  algorithms  storage  scaling  education  hadoop  analysis  computer-science  datacuration 
october 2009 by mcroydon
Riak - A Decentralized Database
"Riak combines a decentralized key-value store, a flexible map/reduce engine, and a friendly HTTP/JSON query interface to provide a database ideally suited for Web applications." Erlang under the hood.
programming  web  development  key-value  database  webdev  opensource  erlang  storage  scalability  distributed  rest  databases  http  mapreduce  json  db  couchdb  store  kvstore  datastore  keyvalue  nosql  document  cloudcomputing  riak  decentralized  basho  documentoriented  key-value-store 
october 2009 by mcroydon
Some perspective to this DIY storage server mentioned at Storagemojo - c0t0d0s0.org
This addresses several of the shortcomings of the Backblaze high density disk storage system. This analysis is in line with some of my grumps but at the same time the Backblaze system is a pretty impressive thing despite its shortcomings.
storage  sysadmin 
september 2009 by mcroydon
Petabytes on a budget: How to build cheap cloud storage | Backblaze Blog
A pretty fantastic DIY solution to massive storage. I kinda wonder what the failure rate is on a configuration such as this.
design  linux  hardware  database  opensource  technology  amazon  server  diy  computer  storage  sysadmin  cool  s3  aws  debian  open  cloud  infrastructure  custom  cheap  backblaze  hard  drive  harddrives 
september 2009 by mcroydon
4store - Scalable RDF storage
"At times holding and running queries over databases of 15GT, supporting a Web application used by thousands of people."
programming  web  software  open-source  development  data  database  opensource  scalability  storage  databases  gpl  rdf  store  db  c  semantic  semanticweb  repository  ontology  semweb  semantic-web  triplestore  sparql  garlik  4store  triple-store  rdfstore  triple  websemantique 
august 2009 by mcroydon
Getting Good IO from Amazon's EBS
How to get good performance out of this mysterious creature.
amazon  storage  performance  startup  aws  testing  ec2  optimization  benchmark  ebs  raid  disk  io 
august 2009 by mcroydon
The Pathologies of Big Data - ACM Queue
"Scale up your datasets enough and all your apps will come undone."
storage  algorithm  postgresql  databases  compsci  scale  big  data  acm 
july 2009 by mcroydon
Are Cloud Based Memory Architectures the Next Big Thing? | High Scalability
Quite a long and thoughtful post, worth skimming and pondering at the very least. This post is a little too enterprisey and a little less startups in the trenches, but still worth thinking about.
programming  database  tools  scalability  storage  architecture  distributed  performance  clustering  collaboration  memcached  grid  cloud  caching  db  concurrency  articles  communication  cloudcomputing  semanticweb 
march 2009 by mcroydon
redis - Google Code
"Redis is a key-value database. It is similar to memcached but the dataset is not volatile, and keys can be strings, exactly like in memcached, but also lists and sets with atomic operations to push/pop elements."
programming  data  google  code  database  opensource  hacking  storage  scalability  performance  databases  cache  memcached  caching  c  memcache  db  memory  key  key-value  persistence  redis  value  kvs 
february 2009 by mcroydon
tuulos's ringo at master — GitHub
immutable key/value store written on top of Disco, a map/reduce framework built by Nokia Research.
python  database  nokia  erlang  storage  scalability  scaling  distributed  cache  mogilefs  parallel  hash  caching  cloud  distribution  mapreduce  dbms  consistent  hashing  ringo  dht  dynamo 
december 2008 by mcroydon
Dropbox - Home - Secure backup, sync and sharing made easy.
Probably the coolest thing ever. I'm hoping that I can afford the addiction once it costs money.
backup  download  file  home  network  mac  online  osx  software  sharing  storage  sync  upload  tools  web 
april 2008 by mcroydon
Winclone
This looks useful. I wish I knew about this before I accidentally toasted my Windows partition.
apple  backup  deployment  freeware  osx  parallels  software  storage  tools  vista  vm  windows 
march 2008 by mcroydon
Eye-Fi Card Review: 1. Introduction: Digital Photography Review
It looks like you can associate it with multiple APs as long as you register them ahead of time.
camera  digital  card  gadget  photo  photography  photos  sd  review  storage  tech  toread  wireless 
march 2008 by mcroydon
The Economics of Online Backup - Inside Lightroom
Never underestimate the bandwidth of a station wagon full of backup tapes.
amazon  analysis  bandwidth  blogs  backup  business  internet  s3  storage  sysadmin 
february 2008 by mcroydon
« earlier      

related tags

4store  10.5  @toread  access  acm  ad  admin  administration  advice  ajax  algorithm  algorithms  allegro  allegrograph  alternative  amazon  amazon-s3  amqp  analysis  analytics  apache  api  app  apple  appliance  application  applications  apps  arch  architecture  article  articles  audio  avro  aws  backblaze  backbone  backbone.js  backend  backup  bandwidth  basho  batch  benchmark  benchmarking  benchmarks  beowulf  beta  bi  big  bigdata  bigtable  bioinformatics  bizgres  blob  blog  blogs  books  bsd  buffers  business  businessintelligence  c  cache  caching  camera  card  carrier  cassandra  catapult  cdn  celery  cheap  cifs  cli  click  cloud  cloud-computing  cloudcomputing  cloudera  cloudkick  cloud_computing  cluster  clustering  code  coding  collaboration  column  column-oriented  column-store  columndb  command  command-line  commandline  comment  communication  community  compare  comparison  components  compsci  computer  computer-science  computerscience  computing  concurrency  conference  consistent  consulting  convert  cool  couchdb  cs  curl  custom  damien  danga  data  data-structures  data-warehousing  database  databases  datacenters  datacuration  datamining  dataset  datasets  datastore  datastructures  datawarehouse  data_warehouse  db  dbms  debian  decentralized  demo  deployment  design  dev  development  dht  dictionary  digg  digital  disk  dist  distributed  distributed-computing  distribution  diy  django  django-apps  document  documentation  documentoriented  download  drive  dvd  dw  dwh  dynamo  dynomite  ebs  ec2  education  efficient  encoding  energy  engineering  english  enterprise  erlang  example  examples  experiments  export  facebook  faq  fast  file  files  filesystem  firefox  format  formatting  foss  framework  free  freebase  freebsd  freeware  friendfeed  fuse  gadget  garlik  good  google  gpl  graph  graphd  graphdb  graphics  graphs  greenplum  grid  guide  ha  hack  hacking  hadoop  hard  harddrive  harddrives  hardware  hash  hashing  haystack  hbase  hdfs  history  hive  home  hosting  howto  html5  http  hypertable  ibm  image  images  implementation  import  important  index  indexing  info  infrastructure  inspiration  intelligence  interesting  interface  internet  introduction  io  iscsi  it  itunes  java  javascript  job  johncompanies  jquery  js  json  key  key-value  key-value-store  keystore  keyvalue  kvs  kvstore  laptop  large  large-scale  last.fm  leopard  lesen  lgpl  lib  library  lightcloud  line  linkedin  linux  lisp  list  local  lua  lubm  lucene  luciddb  mac  macfuse  macosx  mail  management  map  mapreduce  media  memcache  memcached  memory  mercurial  messages  messaging  metadata  metaweb  mining  mnesia  model  modem  module  mogilefs  mongodb  monitoring  mosix  mozilla  mp3  mvc  mysql  n3  nas  nasa  neat  network  networking  networks  news  nfs  nokia  nosql  olap  olpc  online  online-storage  ontology  oop  open  open-source  openrdf  opensolaris  opensource  opentsdb  open_data  operations  optimization  os  oss  osx  owl  p2p  pagerank  parallel  parallels  parser  parsing  partition  paxos  pdf  performance  perl  persistence  personal  photo  photography  photos  pictures  plugin  plurk  podcast  podcasting  portable  postgres  postgresql  pregel  presentation  preview  processing  production  programming  protocol  protocols  publicdata  pvr  python  queue  rackspace  raid  rails  rdbms  rdf  rdfa  rdflib  rdfstore  read  reading  realtime  reddit  redis  reference  relational  release  repl  replication  reporting  repository  research  resource  resources  rest  retro  review  reviews  riak  ringo  ror  rrd  ruby  ruby-on-rails  rubyonrails  s3  samba  samples  sata  scalability  scalaris  scale  scaling  schema  schema-less  schemaless  science  script  sd  search  security  semantic  semantic-web  semanticweb  semantic_web  semweb  seo  serialization  series  server  servers  service  services  sesame  shard  sharding  sharing  shell  shopping  simpledb  slides  sneakernet  soa  soap  software  softwareengineering  solaris  solaris10  source  space  sparql  sql  sqs  ssd  standard  standards  startup  startups  statistics  stats  storage  store  streaming  stumbleupon  subversion  sun  support  svn  sync  sysadmin  system  systems  tape  tech  technology  testing  thesis  thrift  time  time-series  timeseries  tips  to  to-read  todo  tokyo-cabinet  tokyo-tyrant  tokyocabinet  tokyotyrant  tool  tools  toread  tracking  training  transfer  treemap  trends  triple  triple-store  triplestore  tuple  tuples  tuplespace  tutorial  tutorials  twitter  ubuntu  unix  upload  usage  utilities  utility  value  vcs  versioning  via:jacobian  video  videos  virtualization  vista  visualization  vm  voldemort  warehouse  warehousing  web  web-services  web2.0  web3.0  webdav  webdesign  webdev  webhosting  webkit  websemantique  webserver  webservices  wiki  wikipedia  windows  wireless  work  xen  xfs  xml  zfs  zynga 

Copy this bookmark:



description:


tags: