mcroydon + db   78

Castle: Reinventing Storage for Big Data: OSCON 2011 - O'Reilly Conferences, July 25 - 29, 2011, Portland, OR
"The standard Linux storage stack wasn’t designed for write-heavy big data workloads, nor is it well-suited to modern hardware: large, slow SATA disks, SSDs or many cores. Castle, an open-source project, is a ground-up overhauling of RAID, file systems, and the POSIX interface. It is released under the GPL and runs as part of the Linux kernel."
castle  db 
september 2011 by mcroydon
Cassandra @ twitter
A solid modern deck about Cassandra at twitter.
cassandra  db  programming  twitter 
july 2011 by mcroydon
OpenTSDB - A Distributed, Scalable Monitoring System
OpenTSDB is a distributed, scalable Time Series Database (TSDB) written on top of HBase. OpenTSDB was written to address a common need: store, index and serve metrics collected from computer systems (network gear, operating systems, applications) at a large scale, and make this data easily accessible and graphable.
analysis  architecture  bigdata  cloud  data  database  db  java  lgpl  hbase  hadoop  development  graph  distributed  monitoring  nosql  opensource  operations  scalability  scale  time  sysadmin  software  storage  series  opentsdb  rrd  stumbleupon  time-series  timeseries 
november 2010 by mcroydon
SHARD Triple-Store
"SHARD is a proof-of-concept use of high-performance, low-cost distributed computing technology to develop a highly scalable triple-store. SHARD is released as an open-source project on the BSD license."
database  db  cloud  distributed  hadoop  lubm  mapreduce  rdf  store  sparql  storage  shard  semweb  semanticweb  scalability  triple-store 
october 2010 by mcroydon
Foursquare and MongoDB: What If | Jeremiah Peschka
Interesting thoughts on various big data approaches.
cassandra  db  foursquare  hbase  mongodb  riak 
october 2010 by mcroydon
Lawnchair
Yay things to put things in.
database  db  html5  json  storage  webkit 
august 2010 by mcroydon
Cassandra By Example | Rackspace Cloud Computing & Hosting
Cassandra is different enough that it's hard to make it accessible. This writeup does a pretty good job.
articles  cassandra  rackspace  python  database  relational  tutorial  databases  db  twitter  web  example  examples  webdev  webdevelopment  hackernews  howto  nosql  programming 
may 2010 by mcroydon
Cassandra Internals – Writing
A little on Cassandra's internals as it pertains to writes.
cassandra  databases  db  development  key-value  nosql  performance 
march 2010 by mcroydon
Visual Guide to NoSQL Systems - Nathan Hurst's Blog
Nice overview and placing various NoSQL implementations on the CAP subway map.
blogs  database  databases  db  guide  hbase  nosql  visualization  sql 
march 2010 by mcroydon
cloudkick | blog: 4 Months with Cassandra, a love story
A very interesting look at Cassandra with an eye toward gotchas. Cloudkick are doing some interesting stuff with aggregation over time periods.
admin  via:jacobian  administration  architecture  article  cassandra  database  databases  opensource  nosql  mysql  monitoring  django  distributed  db  datawarehouse  python  scalability  scaling  storage  toread  webdev  programming  cloudkick  neat 
march 2010 by mcroydon
Johnny Cache — Johnny Cache v0.1 documentation
"Johnny Cache is a caching framework for django applications. It works with the django caching abstraction, but was developed specifically with the use of memcached in mind. Its main feature is a patch on Django’s ORM that automatically caches all reads in a consistent manner."
application  caching  cache  database  db  django  django-apps  middleware  library  memcache  memcached  framework  opensource  performance  orm  python  plugin  webdev  johnny  queryset  query 
march 2010 by mcroydon
zoie - Project Hosting on Google Code
Built on top of Apache Lucene and focuses on solving several problems around real-time search and indexing performance.
programming  development  google  library  opensource  free  java  search  architecture  tech  apache  db  lucene  solr  project  indexing  realtime  index  fulltext  mq  linkedin  zoie 
november 2009 by mcroydon
Journal of Eivind Uggedal: NoSQL East 2009 - Summary of Day 1
Some interesting bits and more of the same but I really like the dark-launch approach that Scribe allows.
data  database  toread  blog  scalability  internet  distributed  article  hadoop  scaling  db  cloud  couchdb  conference  papers  keyvalue  nosql  links  cassandra  2009  mongodb  dynomite  riak 
november 2009 by mcroydon
LucidDB Home Page
"LucidDB is the first and only open-source RDBMS purpose-built entirely for data warehousing and business intelligence. It is based on architectural cornerstones such as column-store, bitmap indexing, hash join/aggregation, and page-level multiversioning."
programming  software  development  database  data  business  opensource  java  scalability  storage  distributed  cluster  databases  sql  db  datamining  olap  columndb  bi  datawarehouse  dbms  reporting  rdbms  luciddb  column  warehousing  column-store  data_warehouse  column-oriented  dwh 
october 2009 by mcroydon
http://markorodriguez.com/Lectures_files/risk-symposium2009.pdf
A pretty concise look at graph databases vs. traditional relational databases.
data  database  research  storage  performance  todo  presentation  pdf  db  rdf  graph  graphs  rdbms  nosql  semweb  sparql  relational 
october 2009 by mcroydon
Why I like Redis
Redis is indeed awesome (and a little different) due to its support for rich primitive types.
python  programming  data  storage  database  dev  distributed  article  databases  cache  db  memcached  caching  articles  convert  cli  nosql  experiments  redis  schemaless  repl 
october 2009 by mcroydon
bobby-tables.com: A guide to preventing SQL injection
How to avoid little bobby tables in several languages. There are more subtle attacks but handling the basics right is the first step.
programming  development  database  webdev  howto  computer  rails  hack  security  coding  hacks  fun  databases  geek  hacking  mysql  tips  sql  db  bugs  c#  injection  tables  sqlinjection  sql-injection 
october 2009 by mcroydon
Riak - A Decentralized Database
"Riak combines a decentralized key-value store, a flexible map/reduce engine, and a friendly HTTP/JSON query interface to provide a database ideally suited for Web applications." Erlang under the hood.
programming  web  development  key-value  database  webdev  opensource  erlang  storage  scalability  distributed  rest  databases  http  mapreduce  json  db  couchdb  store  kvstore  datastore  keyvalue  nosql  document  cloudcomputing  riak  decentralized  basho  documentoriented  key-value-store 
october 2009 by mcroydon
4store - Scalable RDF storage
"At times holding and running queries over databases of 15GT, supporting a Web application used by thousands of people."
programming  web  software  open-source  development  data  database  opensource  scalability  storage  databases  gpl  rdf  store  db  c  semantic  semanticweb  repository  ontology  semweb  semantic-web  triplestore  sparql  garlik  4store  triple-store  rdfstore  triple  websemantique 
august 2009 by mcroydon
Charu Aggarwal
Some pretty hefty thoughts on data mining and ML stuffs.
data  database  research  books  algorithms  management  db  datamining  academic  ibm  papers  mining  people  lists  data-mining  data_mining  privacy 
august 2009 by mcroydon
What happened to Hot Standby?
On hot standby and the future of replication for PostgreSQL.
database  opensource  postgresql  postgres  db  oss  replication  dbms  tumble  dba 
june 2009 by mcroydon
NUCULAR fielded text searchable indexing: Documentation
Another lightweight Python full-text search engine with a silly if not unfortunate name.
python  software  django  development  database  tools  library  api  xml  opensource  search  text  application  lucene  db  concurrency  indexing  solr  oss  ir  searchengine  fulltext  whoosh  nucular 
june 2009 by mcroydon
Neo4j - a Graph Database that Kicks Buttox | High Scalability
The most common complaint about existing graph databases is performance. Hopefully a stable of good, performant graph databases will change that.
data  database  toread  visualization  java  opensource  network  scalability  cool  architecture  performance  graph  hadoop  databases  db  graphs  2009  arch  socialnetworking  socialmedia  dataviz  neo4j  graph_database  graph-database  relationship 
june 2009 by mcroydon
Sequoia
"Sequoia is a transparent middleware solution offering clustering, load balancing and failover services for any database."
database  java  performance  cluster  postgresql  mysql  clustering  sql  tool  db  replication  ha  jdbc  highavailability 
may 2009 by mcroydon
Are Cloud Based Memory Architectures the Next Big Thing? | High Scalability
Quite a long and thoughtful post, worth skimming and pondering at the very least. This post is a little too enterprisey and a little less startups in the trenches, but still worth thinking about.
programming  database  tools  scalability  storage  architecture  distributed  performance  clustering  collaboration  memcached  grid  cloud  caching  db  concurrency  articles  communication  cloudcomputing  semanticweb 
march 2009 by mcroydon
redis - Google Code
"Redis is a key-value database. It is similar to memcached but the dataset is not volatile, and keys can be strings, exactly like in memcached, but also lists and sets with atomic operations to push/pop elements."
programming  data  google  code  database  opensource  hacking  storage  scalability  performance  databases  cache  memcached  caching  c  memcache  db  memory  key  key-value  persistence  redis  value  kvs 
february 2009 by mcroydon
DataImportHandler - Solr Wiki
I bet this is super fast, but it's a fun intersection of XML and SQL that makes me a little nauseous.
programming  data  java  search  database  databases  mysql  lucene  sql  solr  indexing  find  db  query  import  rdbms 
february 2009 by mcroydon
MySQL :: Using the New MySQL Query Profiler
Maybe I've been using PostgreSQL too long, but I find EXPLAIN ANALYZE much easier than all of this. I guess this is handy if you're creating a whole bunch of temp tables or helping spot insanely inefficient queries.
programming  database  tools  scalability  management  performance  mysql  databases  optimization  sql  test  debugging  db  profiler  profile  query  debug 
october 2008 by mcroydon

related tags

2read  4store  6.830  @toread  academic  ad  admin  administration  algo  algorithm  algorithms  allegro  allegrograph  alternative  amazon  analysis  analytics  apache  api  application  apps  arch  architecture  article  articles  aws  backup  basho  batch  benchmark  benchmarking  benchmarks  bi  bigdata  bigtable  bit  bitmap  bizgres  blob  blog  blogs  books  bugs  business  businessintelligence  c  c#  cache  caching  cap  career  case  cassandra  castle  cd  census  chart  class  cli  click  cloud  cloudcomputing  cloudkick  cluster  clustering  code  coding  collaboration  collection  column  column-oriented  column-store  columndb  comment  commercial  communication  compare  comparison  components  compress  compression  computer  computerscience  computing  concurrency  conference  configuration_management  consistency  consulting  convert  cool  couchdb  course  courses  cs  damien  data  data-mining  data-warehousing  database  databases  datacenter  datamining  datasets  datastore  datastructures  dataviz  datawarehouse  data_mining  data_warehouse  date  db  dba  dbms  dbms.postgresql  dbpedia  dbs  debug  debugging  decentralized  demographics  deployment  design  dev  development  dht  dictionary  diff  differences  digg  dist  distributed  distributed-computing  django  django-apps  doc  document  documentation  documentoriented  download  dw  dwh  dynamo  dynomite  ec2  education  efficient  engineering  engineyard  english  erlang  evolution  example  examples  experience  experiments  facebook  farmville  fastbit  ferrett  find  firefox  flockdb  foss  foursquare  framework  free  freebase  friendfeed  fulltext  fun  gamedev  games  garlik  geek  genealogy  good  google  gov  government  gpl  graph  graph-database  graphd  graphdb  graphics  graphs  graph_database  greenplum  grid  guide  ha  hack  hackernews  hacking  hacks  hadoop  hadoopdb  hash  hashing  hbase  hdfs  high-availability  highavailability  hive  hosting  howto  html5  http  hypertable  ibm  ideas  implementation  import  imported  index  indexing  information  infrastructure  injection  inspiration  intelligence  internet  ir  irb  irc  java  javascript  jdbc  johnny  json  key  key-value  key-value-store  keynote  keyvalue  knn  knowledge  kvs  kvstore  large  large-scale  learning  lecture  lectures  lesen  lgpl  library  libs  license:bsd  life  lifehack  lightcloud  linkedin  links  linux  list  lists  load  lua  lubm  lucene  luciddb  management  map  map-reduce  mapreduce  market  mashup  math  mathematics  memcache  memcached  memory  metadata  metaweb  middleware  migration  mining  mit  mongo  mongodb  mongoengine  monitor  monitoring  mozilla  mq  multidimensional  munin  mysql  neat  neo4j  network  networking  networks  nosql  notes  nucular  olap  online  ontology  open  open-source  openrdf  opensource  opentsdb  operations  optimization  orm  oss  overview  owl  p2p  pagerank  paper  papers  paxos  pdf  people  performance  perl  persistence  personal  pgsql  php  pig  plugin  plurk  population  postgres  postgresql  pregel  presentation  presentations  privacy  production  profile  profiler  programming  project  puppet  python  query  queryset  rackspace  rails  rdbms  rdf  rdfstore  read  reading  readlater  realtime  recommendations  reddit  redis  reference  relational  relationaldb  relationship  repl  replication  reporting  repository  research  resource  resources  rest  review  riak  rrd  ruby  samples  scalability  scalable  scalaris  scale  scaling  schema  schema-less  schemaless  scribe  script  scripts  search  searchengine  security  semantic  semantic-web  semanticweb  semantic_web  semweb  seo  series  server  services  sesame  shard  sharding  simpledb  slides  slideshare  social  socialmedia  socialnetworking  socialnetworks  software  solr  sparql  sql  sql-injection  sqlinjection  startup  startups  statistics  stats  storage  store  stumbleupon  support  sysadmin  systems  ta  tables  talk  tech  technology  test  text  time  time-series  timeseries  tips  to-read  todo  tokyo-cabinet  tokyo-tyrant  tokyocabinet  tokyotyrant  tool  tools  top  toread  to_read  tracking  training  trends  triple  triple-store  triplestore  tumble  tuning  tuple  tuples  tuplespace  tutorial  tutorials  twitter  type  types  unix  usecases  user  value  via:jacobian  video  videos  visualization  vldb  voldemort  warehouse  warehousing  web  web2.0  web3.0  webdesign  webdev  webdevelopment  webkit  websemantique  whoosh  wiki  wikipedia  work  xml  yale  yam  zoie  zynga 

Copy this bookmark:



description:


tags: