rybesh + distributed   35

Makeflow = Make + Workflow
Makeflow is a workflow engine for executing large complex workflows on clusters, clouds, and grids. Makeflow is very similar to traditional Make, so if you can write a Makefile, then you can write a Makeflow. You can be up and running workflows in a matter of minutes.
fbp  cloud  grid  distributed 
29 days ago by rybesh
RabbitMQ + Cloud Foundry: Cloud Messaging that Just Works | Cloud Foundry
Our goal is to make it as easy as possible for you to use messaging to create and connect to cloud applications. This radically simpler portability of applications and integration of services is the essence of Cloud Foundry. To understand what this means for messaging as a service, let’s look at how people use RabbitMQ in the cloud and then see why we think offering RabbitMQ as a service makes sense.
distributed  messaging 
october 2011 by rybesh
The Intelligent Transport Layer - zeromq
 Ø  The socket library that acts as a concurrency framework.
 Ø  Faster than TCP, for clustered products and supercomputing.
 Ø  Carries messages across inproc, IPC, TCP, and multicast.
 Ø  Connect N-to-N via fanout, pubsub, pipeline, request-reply.
 Ø  Asynch I/O for scalable multicore message-passing apps.
 Ø  Large and active open source community.
 Ø  30+ languages including C, C++, Java, .NET, Python.
 Ø  Most OSes including Linux, Windows, OS X.
 Ø  LGPL free software with full commercial support from iMatix.
distributed  messaging  framework 
october 2011 by rybesh
substack/dnode - GitHub
DNode is an asynchronous object-oriented RPC system for node.js that lets you call remote functions.

It works over network sockets and even in the browser with socket.io.

Plus, there are dnode implementations for perl, ruby, and java, so you can glue together all your backend processes swimmingly.
nodejs  distributed  rpc 
october 2011 by rybesh
hookio/hook.io - GitHub
hook.io creates a distributed node.js EventEmitter that works cross-process / cross-platform / cross-browser. Think of it like a real-time event bus that works anywhere JavaScript is supported.

You create custom i/o scenarios by picking and choosing from an extensive library of tiny, independent, autonomous "hooks" that seamlessly work together.
nodejs  distributed  messaging 
october 2011 by rybesh
Thrift vs. Protocol Buffers | Floating Sun
Most discussion around thrift and protobuf are polarized around performance and/or features, but I’m afraid the discussion is more nuanced than that. My hope with this post is to shed some more light on these two systems and how you should go about evaluating what is best for your needs.
thrift  protobuf  distributed  development 
september 2011 by rybesh
hook.io video tutorials - blog.nodejitsu.com - scaling node.js applications one callback at a time.
hook.io is a full featured i/o framework for node.js. hook.io allows you to create clouds of node.js processes which seamlessly communicate with each other to form fault-tolerant distributed systems.
nodejs  distributed 
september 2011 by rybesh
Bloom Programming Language
Bloom is a language …

… for disorderly distributed programming
because order is expensive

… with powerful consistency analysis
CALM guidance on coordination

… and concise, familiar syntax
based on data-centric languages
distributed  code  ruby  cloud 
april 2011 by rybesh
RabbitMQ, Celery and Django « Happy stream of thoughts
RabbitMQ is an open source AMQP (Advanced Message Queuing Protocol) implementation. Celery, originally built for use with Django, is now a general, distributed task queue system based on AMPQ server, multiprocessing Python module and a persistent or non-persistent storage backend. Combining these three solves the problem of building highly-available task queues.
django  celery  howto  distributed 
november 2010 by rybesh
The Apache Cassandra Project
The Apache Cassandra Project develops a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model.
distributed  database  non-relational  hashtable 
march 2010 by rybesh
Cascading
Cascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.
hadoop  mapreduce  distributed  datawrangling  tools 
august 2009 by rybesh
Supervisor
Supervisor is a client/server system that allows its users to monitor and control a number of processes on UNIX-like operating systems.
unix  distributed  management  tools  python 
june 2009 by rybesh
django-springsteen and Distributed Search @ Irrational Exuberance
Provides a trivial wrapper for Yahoo! BOSS, but goes further and provides a simple framework for building distributed search networks.
search  python  django  yahoo  distributed 
february 2009 by rybesh
Parallel Python - Home
PP is a python module which provides mechanism for parallel execution of python code on SMP (systems with multiple processors or cores) and clusters (computers connected via network).
python  tools  development  library  distributed 
february 2009 by rybesh
persevere-framework - Google Code
Persevere is an open source set of tools for persistence and distributed computing using an intuitive standards-based JSON interfaces of HTTP REST, JSON-RPC, JSONPath, and HTTP Channels.
json  distributed  database  framework  web  rest  ajax  opensource 
july 2008 by rybesh
Amazon EC2: Fedora 4 & Red5 v5-0.6.3
This is stock Amazon Fedora 4 image with JDK 1.6 and Red5 v5-0.6.3.
distributed  video  streaming 
march 2008 by rybesh
CouchDB "Joins"
Views map keys to values, where the key is automatically indexed and can be used to filter and/or sort results. Keys can be any JSON object, which enables a whole class of tricks that are rather non-obvious…
distributed  database  architecture  howto  javascript 
december 2007 by rybesh
Amazon Simple DB
Amazon SimpleDB is a web service for running queries on structured data in real time.
distributed  database  webservices 
december 2007 by rybesh
Hbase/RDF - Lucene-hadoop Wiki
An Hbase subsystem for RDF, called HbaseRDF, which uses Hbase + MapReduce to store RDF data and execute queries (e.g., SPARQL) on them.
semweb  distributed  database 
october 2007 by rybesh
organgrinder: A Terracotta Cluster on Amazon's Elastic Compute Cloud
An application written for a single node can, using Terracotta, be transparently deployed on an EC2 cluster.
distributed  java  webservices  performance 
september 2007 by rybesh
CouchDb Project Website
It makes it simple to build collaborative applications that can be replicated offline by users, with full interactivity (query, add, update, delete), and later synced up.
web  distributed  database  code  rest 
september 2007 by rybesh
Harmony: framework for reconciling disconnected updates to heterogeneous, replicated XML
A major component of the proposed work concerns developing the foundations of bi-directional programming languages, in which every program denotes a pair of functions---one for extracting a view of some complex data structure, and another for ``putting ba
distributed  database  xml  code  theory  language  research  opensource 
august 2007 by rybesh
Map-Reduce for Machine Learning on Multicore
In this paper, we develop a broadly applicable parallel programming method, one that is easily applied to many different learning algorithms.
machinelearning  distributed  grid  research 
august 2007 by rybesh
Hbase - Lucene-hadoop Wiki
Data is organized into tables, rows and columns, but a query language like SQL is not supported. An Iterator-like interface is available for scanning through a row range (and of course there is an ability to retrieve a column value for a specific key).
distributed  grid  database  quantitative  research  analysis  tools  opensource 
august 2007 by rybesh
Pig | Yahoo! Research
The highest abstraction layer in Pig is a query language interface, whereby users express data analysis tasks as queries, in the style of SQL or Relational Algebra.
distributed  grid  database  quantitative  research  analysis  tools  opensource 
august 2007 by rybesh
Running Hadoop MapReduce on Amazon EC2 and Amazon S3
This paper describes how to use Amazon Web Services and Hadoop to run an ad hoc analysis on a large collection of web access logs that otherwise would have cost a prohibitive amount in either time or money.
distributed  grid  quantitative  research  nlp  analysis  howto 
august 2007 by rybesh
Andrea Grover: Phantom Captain
With the cooperative intention of projects such as these, crowdsourcing as a method of artistic production appears to be heir to the throne of 1960s and 70s happenings and participatory art.
social  art  community  distributed  collaboration 
march 2007 by rybesh
The Sheep Market
Thousands of workers on Amazon's Mechanical Turk webservice were paid two cents to "draw a sheep facing to the left." Their sheep drawings were collected and printed on collectable stamps.
art  criticism  distributed  drawing  ucla  yahoo  work 
july 2006 by rybesh
MultimediaN/E-Culture
The objective of this project is the development of a set of e-culture demonstrators providing multimedia access to distributed collections of cultural heritage objects.
multimedia  research  museum  archives  distributed  semweb  interface  culture 
july 2006 by rybesh
LID
LID is a quite simple, but powerful technology that empowers individuals to keep control over and manage their on-line digital identities.
identity  standards  web  distributed  services  webservices  reference 
july 2006 by rybesh
Hadoop
Hadoop is a collection of code libraries and programs useful for creating very large distributed systems.
distributed  grid  java  opensource  code 
june 2006 by rybesh
NetKernel Open Source Community
From the microkernel to the upper layers NetKernel uses a generalization of REST, the basis for the successful operation of the World Wide Web, and applies it to the finest granularity of resource oriented software composition.
web  architecture  rest  development  tools  distributed  webservices 
june 2006 by rybesh
dotSUB.com
dotSUB provides free browser based tools that allow anyone to translate films from one language into countless other languages.
annotation  collaboration  community  distributed  subtitle  tools  timetags 
april 2006 by rybesh
memcached: a distributed memory object caching system
memcached is a high-performance, distributed memory object caching system, generic in nature, but intended for use in speeding up dynamic web applications by alleviating database load.
web  performance  opensource  code  distributed  cache  php  perl  python  ruby  java 
march 2006 by rybesh

Copy this bookmark:



description:


tags: