rybesh + distributed 35
Makeflow = Make + Workflow
29 days ago by rybesh
Makeflow is a workflow engine for executing large complex workflows on clusters, clouds, and grids. Makeflow is very similar to traditional Make, so if you can write a Makefile, then you can write a Makeflow. You can be up and running workflows in a matter of minutes.
fbp
cloud
grid
distributed
29 days ago by rybesh
RabbitMQ + Cloud Foundry: Cloud Messaging that Just Works | Cloud Foundry
october 2011 by rybesh
Our goal is to make it as easy as possible for you to use messaging to create and connect to cloud applications. This radically simpler portability of applications and integration of services is the essence of Cloud Foundry. To understand what this means for messaging as a service, let’s look at how people use RabbitMQ in the cloud and then see why we think offering RabbitMQ as a service makes sense.
distributed
messaging
october 2011 by rybesh
The Intelligent Transport Layer - zeromq
october 2011 by rybesh
Ø The socket library that acts as a concurrency framework.
Ø Faster than TCP, for clustered products and supercomputing.
Ø Carries messages across inproc, IPC, TCP, and multicast.
Ø Connect N-to-N via fanout, pubsub, pipeline, request-reply.
Ø Asynch I/O for scalable multicore message-passing apps.
Ø Large and active open source community.
Ø 30+ languages including C, C++, Java, .NET, Python.
Ø Most OSes including Linux, Windows, OS X.
Ø LGPL free software with full commercial support from iMatix.
distributed
messaging
framework
Ø Faster than TCP, for clustered products and supercomputing.
Ø Carries messages across inproc, IPC, TCP, and multicast.
Ø Connect N-to-N via fanout, pubsub, pipeline, request-reply.
Ø Asynch I/O for scalable multicore message-passing apps.
Ø Large and active open source community.
Ø 30+ languages including C, C++, Java, .NET, Python.
Ø Most OSes including Linux, Windows, OS X.
Ø LGPL free software with full commercial support from iMatix.
october 2011 by rybesh
substack/dnode - GitHub
october 2011 by rybesh
DNode is an asynchronous object-oriented RPC system for node.js that lets you call remote functions.
It works over network sockets and even in the browser with socket.io.
Plus, there are dnode implementations for perl, ruby, and java, so you can glue together all your backend processes swimmingly.
nodejs
distributed
rpc
It works over network sockets and even in the browser with socket.io.
Plus, there are dnode implementations for perl, ruby, and java, so you can glue together all your backend processes swimmingly.
october 2011 by rybesh
hookio/hook.io - GitHub
october 2011 by rybesh
hook.io creates a distributed node.js EventEmitter that works cross-process / cross-platform / cross-browser. Think of it like a real-time event bus that works anywhere JavaScript is supported.
You create custom i/o scenarios by picking and choosing from an extensive library of tiny, independent, autonomous "hooks" that seamlessly work together.
nodejs
distributed
messaging
You create custom i/o scenarios by picking and choosing from an extensive library of tiny, independent, autonomous "hooks" that seamlessly work together.
october 2011 by rybesh
Thrift vs. Protocol Buffers | Floating Sun
september 2011 by rybesh
Most discussion around thrift and protobuf are polarized around performance and/or features, but I’m afraid the discussion is more nuanced than that. My hope with this post is to shed some more light on these two systems and how you should go about evaluating what is best for your needs.
thrift
protobuf
distributed
development
september 2011 by rybesh
hook.io video tutorials - blog.nodejitsu.com - scaling node.js applications one callback at a time.
september 2011 by rybesh
hook.io is a full featured i/o framework for node.js. hook.io allows you to create clouds of node.js processes which seamlessly communicate with each other to form fault-tolerant distributed systems.
nodejs
distributed
september 2011 by rybesh
Bloom Programming Language
april 2011 by rybesh
Bloom is a language …
… for disorderly distributed programming
because order is expensive
… with powerful consistency analysis
CALM guidance on coordination
… and concise, familiar syntax
based on data-centric languages
distributed
code
ruby
cloud
… for disorderly distributed programming
because order is expensive
… with powerful consistency analysis
CALM guidance on coordination
… and concise, familiar syntax
based on data-centric languages
april 2011 by rybesh
RabbitMQ, Celery and Django « Happy stream of thoughts
november 2010 by rybesh
RabbitMQ is an open source AMQP (Advanced Message Queuing Protocol) implementation. Celery, originally built for use with Django, is now a general, distributed task queue system based on AMPQ server, multiprocessing Python module and a persistent or non-persistent storage backend. Combining these three solves the problem of building highly-available task queues.
django
celery
howto
distributed
november 2010 by rybesh
The Apache Cassandra Project
march 2010 by rybesh
The Apache Cassandra Project develops a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model.
distributed
database
non-relational
hashtable
march 2010 by rybesh
Cascading
august 2009 by rybesh
Cascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.
hadoop
mapreduce
distributed
datawrangling
tools
august 2009 by rybesh
Supervisor
june 2009 by rybesh
Supervisor is a client/server system that allows its users to monitor and control a number of processes on UNIX-like operating systems.
unix
distributed
management
tools
python
june 2009 by rybesh
django-springsteen and Distributed Search @ Irrational Exuberance
february 2009 by rybesh
Provides a trivial wrapper for Yahoo! BOSS, but goes further and provides a simple framework for building distributed search networks.
search
python
django
yahoo
distributed
february 2009 by rybesh
Parallel Python - Home
february 2009 by rybesh
PP is a python module which provides mechanism for parallel execution of python code on SMP (systems with multiple processors or cores) and clusters (computers connected via network).
python
tools
development
library
distributed
february 2009 by rybesh
persevere-framework - Google Code
july 2008 by rybesh
Persevere is an open source set of tools for persistence and distributed computing using an intuitive standards-based JSON interfaces of HTTP REST, JSON-RPC, JSONPath, and HTTP Channels.
json
distributed
database
framework
web
rest
ajax
opensource
july 2008 by rybesh
Amazon EC2: Fedora 4 & Red5 v5-0.6.3
march 2008 by rybesh
This is stock Amazon Fedora 4 image with JDK 1.6 and Red5 v5-0.6.3.
distributed
video
streaming
march 2008 by rybesh
CouchDB "Joins"
december 2007 by rybesh
Views map keys to values, where the key is automatically indexed and can be used to filter and/or sort results. Keys can be any JSON object, which enables a whole class of tricks that are rather non-obvious…
distributed
database
architecture
howto
javascript
december 2007 by rybesh
Amazon Simple DB
december 2007 by rybesh
Amazon SimpleDB is a web service for running queries on structured data in real time.
distributed
database
webservices
december 2007 by rybesh
Amazon EC2 Technical Documentation
november 2007 by rybesh
Docs for Amazon's Elastic Computing Cloud.
distributed
computers
reference
documentation
webservices
november 2007 by rybesh
Hbase/RDF - Lucene-hadoop Wiki
october 2007 by rybesh
An Hbase subsystem for RDF, called HbaseRDF, which uses Hbase + MapReduce to store RDF data and execute queries (e.g., SPARQL) on them.
semweb
distributed
database
october 2007 by rybesh
organgrinder: A Terracotta Cluster on Amazon's Elastic Compute Cloud
september 2007 by rybesh
An application written for a single node can, using Terracotta, be transparently deployed on an EC2 cluster.
distributed
java
webservices
performance
september 2007 by rybesh
CouchDb Project Website
september 2007 by rybesh
It makes it simple to build collaborative applications that can be replicated offline by users, with full interactivity (query, add, update, delete), and later synced up.
web
distributed
database
code
rest
september 2007 by rybesh
Harmony: framework for reconciling disconnected updates to heterogeneous, replicated XML
august 2007 by rybesh
A major component of the proposed work concerns developing the foundations of bi-directional programming languages, in which every program denotes a pair of functions---one for extracting a view of some complex data structure, and another for ``putting ba
distributed
database
xml
code
theory
language
research
opensource
august 2007 by rybesh
Map-Reduce for Machine Learning on Multicore
august 2007 by rybesh
In this paper, we develop a broadly applicable parallel programming method, one that is easily applied to many different learning algorithms.
machinelearning
distributed
grid
research
august 2007 by rybesh
Hbase - Lucene-hadoop Wiki
august 2007 by rybesh
Data is organized into tables, rows and columns, but a query language like SQL is not supported. An Iterator-like interface is available for scanning through a row range (and of course there is an ability to retrieve a column value for a specific key).
distributed
grid
database
quantitative
research
analysis
tools
opensource
august 2007 by rybesh
Pig | Yahoo! Research
august 2007 by rybesh
The highest abstraction layer in Pig is a query language interface, whereby users express data analysis tasks as queries, in the style of SQL or Relational Algebra.
distributed
grid
database
quantitative
research
analysis
tools
opensource
august 2007 by rybesh
Running Hadoop MapReduce on Amazon EC2 and Amazon S3
august 2007 by rybesh
This paper describes how to use Amazon Web Services and Hadoop to run an ad hoc analysis on a large collection of web access logs that otherwise would have cost a prohibitive amount in either time or money.
distributed
grid
quantitative
research
nlp
analysis
howto
august 2007 by rybesh
Andrea Grover: Phantom Captain
march 2007 by rybesh
With the cooperative intention of projects such as these, crowdsourcing as a method of artistic production appears to be heir to the throne of 1960s and 70s happenings and participatory art.
social
art
community
distributed
collaboration
march 2007 by rybesh
The Sheep Market
july 2006 by rybesh
Thousands of workers on Amazon's Mechanical Turk webservice were paid two cents to "draw a sheep facing to the left." Their sheep drawings were collected and printed on collectable stamps.
art
criticism
distributed
drawing
ucla
yahoo
work
july 2006 by rybesh
MultimediaN/E-Culture
july 2006 by rybesh
The objective of this project is the development of a set of e-culture demonstrators providing multimedia access to distributed collections of cultural heritage objects.
multimedia
research
museum
archives
distributed
semweb
interface
culture
july 2006 by rybesh
LID
july 2006 by rybesh
LID is a quite simple, but powerful technology that empowers individuals to keep control over and manage their on-line digital identities.
identity
standards
web
distributed
services
webservices
reference
july 2006 by rybesh
Hadoop
june 2006 by rybesh
Hadoop is a collection of code libraries and programs useful for creating very large distributed systems.
distributed
grid
java
opensource
code
june 2006 by rybesh
NetKernel Open Source Community
june 2006 by rybesh
From the microkernel to the upper layers NetKernel uses a generalization of REST, the basis for the successful operation of the World Wide Web, and applies it to the finest granularity of resource oriented software composition.
web
architecture
rest
development
tools
distributed
webservices
june 2006 by rybesh
dotSUB.com
april 2006 by rybesh
dotSUB provides free browser based tools that allow anyone to translate films from one language into countless other languages.
annotation
collaboration
community
distributed
subtitle
tools
timetags
april 2006 by rybesh
memcached: a distributed memory object caching system
march 2006 by rybesh
memcached is a high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load.
web
performance
opensource
code
distributed
cache
php
perl
python
ruby
java
march 2006 by rybesh
related tags
ajax ⊕ analysis ⊕ annotation ⊕ architecture ⊕ archives ⊕ art ⊕ cache ⊕ celery ⊕ cloud ⊕ code ⊕ collaboration ⊕ community ⊕ computers ⊕ criticism ⊕ culture ⊕ database ⊕ datawrangling ⊕ development ⊕ distributed ⊖ django ⊕ documentation ⊕ drawing ⊕ fbp ⊕ framework ⊕ grid ⊕ hadoop ⊕ hashtable ⊕ howto ⊕ identity ⊕ interface ⊕ java ⊕ javascript ⊕ json ⊕ language ⊕ library ⊕ machinelearning ⊕ management ⊕ mapreduce ⊕ messaging ⊕ multimedia ⊕ museum ⊕ nlp ⊕ nodejs ⊕ non-relational ⊕ opensource ⊕ performance ⊕ perl ⊕ php ⊕ protobuf ⊕ python ⊕ quantitative ⊕ reference ⊕ research ⊕ rest ⊕ rpc ⊕ ruby ⊕ search ⊕ semweb ⊕ services ⊕ social ⊕ standards ⊕ streaming ⊕ subtitle ⊕ theory ⊕ thrift ⊕ timetags ⊕ tools ⊕ ucla ⊕ unix ⊕ video ⊕ web ⊕ webservices ⊕ work ⊕ xml ⊕ yahoo ⊕Copy this bookmark: