mcroydon + nosql   81

The Basho Blog: Secondary Indexes in Riak
It's great to see secondary indexes landed.
basho  blog  nosql  riak 
september 2011 by mcroydon
AccumuloProposal - Incubator Wiki
NoSQL with some properties similar to HBase with some interesting per-cel ACL. Born at the NSA.
apache  hadoop  nosql  nsa 
september 2011 by mcroydon
leveldb - a fast and lightweight key/value database library - Google Project Hosting
"LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values."
database  google  library  nosql 
august 2011 by mcroydon
Buzz by Google Research
"Dremel is a scalable, interactive ad-hoc query system for analysis of read-only nested data. By combining multi-level execution trees and columnar data layout, it is capable of running aggregation queries over trillion-row tables in seconds. The system scales to thousands of CPUs and petabytes of data, and has thousands of users at Google. In this paper, we describe the architecture and implementation of Dremel, and explain how it complements MapReduce-based computing. We present a novel columnar storage representation for nested records and discuss experiments on few-thousand node instances of the system."
google  data  nosql  research  paper 
march 2011 by mcroydon
NoSQL at Netflix
How Netflix uses various NoSQL datastores.
architecture  cassandra  database  netflix  nosql 
march 2011 by mcroydon
On moving from CouchDB to Riak
Thoughts on a move from CouchDB to Riak and some really good commercial support experiences.
couchdb  nosql  riak 
march 2011 by mcroydon
Basho Riak: Schema Design and the Transition from Relational Databases
A solid collection of introductory to deep dive material for folks used to having it easy and relational.
riak  nosql  sql  data  migration 
december 2010 by mcroydon
OpenTSDB - A Distributed, Scalable Monitoring System
OpenTSDB is a distributed, scalable Time Series Database (TSDB) written on top of HBase. OpenTSDB was written to address a common need: store, index and serve metrics collected from computer systems (network gear, operating systems, applications) at a large scale, and make this data easily accessible and graphable.
analysis  architecture  bigdata  cloud  data  database  db  java  lgpl  hbase  hadoop  development  graph  distributed  monitoring  nosql  opensource  operations  scalability  scale  time  sysadmin  software  storage  series  opentsdb  rrd  stumbleupon  time-series  timeseries 
november 2010 by mcroydon
Riak SmartMachine Benchmark: The Technical Details « Joyeur
Using Protocol Buffers is a must, but these numbers don't surprise me.
riak  nosql  joyent 
november 2010 by mcroydon
Flexible Event Tracking (Paul Gebheim)
Logging lots and lots of data with Mongo.
justin.tv  nosql  mongo 
may 2010 by mcroydon
Cassandra By Example | Rackspace Cloud Computing & Hosting
Cassandra is different enough that it's hard to make it accessible. This writeup does a pretty good job.
articles  cassandra  rackspace  python  database  relational  tutorial  databases  db  twitter  web  example  examples  webdev  webdevelopment  hackernews  howto  nosql  programming 
may 2010 by mcroydon
gist: 387775 - GitHub
A data-modeling exercise for Cassandra using Python and Tragedy.
cassandra  tragedy  python  data  data-model  nosql 
may 2010 by mcroydon
The Basho Blog
Very similar to the way that Cassandra does things under the hood.
erlang  riak  nosql  storage 
april 2010 by mcroydon
Cassandra Internals – Reading
The companion to his writing-oriented Cassandra tour.
architecture  cassandra  database  data  distributed  internals  java  nosql  sysadmin 
march 2010 by mcroydon
Lineland
Scroll through for lots and lots of HBase internals.
blog  distributed  hadoop  hbase  nosql  mapreduce  programming  systems  storage  reference 
march 2010 by mcroydon
Cassandra Internals – Writing
A little on Cassandra's internals as it pertains to writes.
cassandra  databases  db  development  key-value  nosql  performance 
march 2010 by mcroydon
Visual Guide to NoSQL Systems - Nathan Hurst's Blog
Nice overview and placing various NoSQL implementations on the CAP subway map.
blogs  database  databases  db  guide  hbase  nosql  visualization  sql 
march 2010 by mcroydon
Why Europe’s Largest Ad Targeting Platform Uses Hadoop « Cloudera » Apache Hadoop for the Enterprise
Moving from Postgres to HDFS + Pig and MapReduce for large data storage, analysis, and aggregation.
clojure  data  cloud  database  development  hadoop  mapreduce  web  nosql 
march 2010 by mcroydon
cloudkick | blog: 4 Months with Cassandra, a love story
A very interesting look at Cassandra with an eye toward gotchas. Cloudkick are doing some interesting stuff with aggregation over time periods.
admin  via:jacobian  administration  architecture  article  cassandra  database  databases  opensource  nosql  mysql  monitoring  django  distributed  db  datawarehouse  python  scalability  scaling  storage  toread  webdev  programming  cloudkick  neat 
march 2010 by mcroydon
The Basho Blog: Why Vector Clocks are Easy
Straightforward but very powerful message/value versioning conflict avoidance. This reminds me of git in a way since to avoid conflicts each message bust contain all predecessors in its vector mask.
programming  toread  tutorial  scalability  distributed  algorithm  concurrency  event  nosql  vector  versioning  via:chl  clock  dist  riak  basho  vectorclocks  distributed_systems  vector-clocks  clocks  vectorclock 
january 2010 by mcroydon
One year of Redis
I can't believe it's only been a year. Redis has always felt mature beyond its years.
web  software  development  tutorial  opensource  open  source  redis  nosql 
december 2009 by mcroydon
MongoDB - Using MongoDB for Real-time Analytics
Upserts look like they might be quite helpful with realtime analytics.
database  webdev  howto  statistics  stats  analytics  metrics  nosql  realtime  mongodb 
december 2009 by mcroydon
Journal of Eivind Uggedal: NoSQL East 2009 - Summary of Day 1
Some interesting bits and more of the same but I really like the dark-launch approach that Scribe allows.
data  database  toread  blog  scalability  internet  distributed  article  hadoop  scaling  db  cloud  couchdb  conference  papers  keyvalue  nosql  links  cassandra  2009  mongodb  dynomite  riak 
november 2009 by mcroydon
http://markorodriguez.com/Lectures_files/risk-symposium2009.pdf
A pretty concise look at graph databases vs. traditional relational databases.
data  database  research  storage  performance  todo  presentation  pdf  db  rdf  graph  graphs  rdbms  nosql  semweb  sparql  relational 
october 2009 by mcroydon
Why I like Redis
Redis is indeed awesome (and a little different) due to its support for rich primitive types.
python  programming  data  storage  database  dev  distributed  article  databases  cache  db  memcached  caching  articles  convert  cli  nosql  experiments  redis  schemaless  repl 
october 2009 by mcroydon
Riak - A Decentralized Database
"Riak combines a decentralized key-value store, a flexible map/reduce engine, and a friendly HTTP/JSON query interface to provide a database ideally suited for Web applications." Erlang under the hood.
programming  web  development  key-value  database  webdev  opensource  erlang  storage  scalability  distributed  rest  databases  http  mapreduce  json  db  couchdb  store  kvstore  datastore  keyvalue  nosql  document  cloudcomputing  riak  decentralized  basho  documentoriented  key-value-store 
october 2009 by mcroydon
« earlier      

related tags

2read  aa  acm  admin  administration  advice  algorithm  algorithms  alternative  amazon  amqp  analysis  analytics  apache  api  appengine  application  apps  arch  architecture  article  articles  async  availability  aviation  aws  basho  benchmark  benchmarking  benchmarks  big  bigdata  bigtable  blog  blogs  books  business  cache  caching  cap  case  casestudy  cassandra  cheatsheet  cli  clock  clocks  clojure  cloud  cloud-computing  cloudcomputing  cloudera  cloudkick  cluster  clustering  cms  code  coding  collection  compare  comparison  computer  computing  concurrency  conference  consistency  consistent  consulting  convert  cool  cost  couchbase  couchdb  damien  data  data-model  database  databases  datacenter  datamining  datastore  datastructures  datawarehouse  data_mining  date  db  decentralized  decisions  deployment  design  dev  develop  developer  developerworks  development  dht  digg  dist  distributed  distributed-computing  distributed_systems  django  doc  document  documentation  documentoriented  domain  dynomite  ec2  elastic  elasticmapreduce  elasticsearch  emr  engine  engineering  engineyard  erlang  event  evented  example  examples  experience  experiment  experiments  facebook  firefox  flockdb  forms  foss  framework  free  fulltext  functional  gae  geo  glossary  good  google  graph  graphs  grid  guide  ha  hackernews  hacks  hadoop  hardware  hash  hbase  hdfs  hosting  howto  http  hypertable  ibm  ideas  important  imported  index  indexing  info  infrastructure  inspiration  internals  internet  irb  issues  java  javascript  joyent  js  json  justin.tv  key  key-value  key-value-store  keystore  keyvalue  knowledge  kvs  kvstore  lgpl  library  libs  linkeddata  linkedin  links  linux  list  load  location  logs  lucene  mail  make  management  map  map-reduce  mapreduce  membase  memcache  memcached  messages  messaging  metrics  migration  mining  model  mongo  mongodb  mongoengine  mongomapper  monitoring  mozilla  mrjob  mysql  neat  netflix  news  node  node.js  nodejs  nosql  notes  nsa  nytimes  open  open-source  opensource  opentsdb  operations  optimization  orm  overview  paper  papers  partition  pdf  performance  persistence  personal  pig  post  presentation  presentations  processing  production  programming  promises  python  query  rackspace  rails  rdbms  rdf  read  reading  readlater  realtime  reddit  redis  reduce  reference  relational  repl  replication  repository  research  resource  resources  rest  restful  review  riak  rrd  ruby  rubyonrails  saas  samples  sata  scala  scalability  scalable  scale  scaling  schemaless  scribe  search  search-engine  searchengine  semantic  semanticweb  semantic_web  semweb  series  server  servers  serverside  service  services  shard  sharding  simpledb  simplegeo  slides  slideshare  social  socialmedia  socialnetworking  socialnetworks  social_networking  software  solr  source  sparql  sql  startup  statistics  stats  storage  store  stumbleupon  support  sysadmin  systems  technology  theorem  time  time-series  timeseries  tips  to-read  todo  tool  toread  to_read  tracking  tragedy  training  trends  triple  triplestore  tutorial  tutorials  twitter  type  types  usecases  user  vector  vector-clocks  vectorclock  vectorclocks  versioning  via:chl  via:jacobian  video  videos  visualization  voldemort  vs  web  web2.0  webdesign  webdev  webdevelopment  webservices  work  xml  yam  yelp  zynga 

Copy this bookmark:



description:


tags: