x86-64/frozen - GitHub
november 2011 by adragomir
Frozen is data processing daemon driven by configuration files. It contain several modules for input, output and data processing. It can be used to construct your own database, simple web server, data collector, logger and so on.
Architecture of frozen based on Unix philosophy of simple parts and clean interfaces. This allows usage of existing modules in various places. If some functionality not present, you can write it and connect it to frozen. There is no limit on language to use - if it can compile to shared library - it is very likely to work properly.
data
analytics
c
server
Architecture of frozen based on Unix philosophy of simple parts and clean interfaces. This allows usage of existing modules in various places. If some functionality not present, you can write it and connect it to frozen. There is no limit on language to use - if it can compile to shared library - it is very likely to work properly.
november 2011 by adragomir
google-refine - Project Hosting on Google Code
october 2010 by adragomir
Google Refine is a power tool for cleaning up raw data, making it consistent,
linking it to data registries like Freebase,
augmenting it with more data from other data sources, transforming it into the
required format for other tools to consume, and contributing it back to some data
sources like Freebase. Google Refine is not
a web service but a desktop app that runs on your own computer, so you can process
sensitive data with privacy.
google
tool
opensource
data
linking it to data registries like Freebase,
augmenting it with more data from other data sources, transforming it into the
required format for other tools to consume, and contributing it back to some data
sources like Freebase. Google Refine is not
a web service but a desktop app that runs on your own computer, so you can process
sensitive data with privacy.
october 2010 by adragomir
Txr
november 2009 by adragomir
Txr is an interpreter for a query language (the ``txr language''). A txr query matches text and extracts pieces by binding them to variables that are embedded in the query. Txr can output the raw bindings gathered from the data, or substitute them into a template-driven report.
text
parsing
parser
data
november 2009 by adragomir
Python Package Index : FilterPype 0.2.7dev
july 2009 by adragomir
FilterPype is being used for multi-level data analysis, but could be applied to many other areas where it is difficult to split up a system into small independent parts
python
library
processing
data
july 2009 by adragomir
Orange
april 2009 by adragomir
Orange is a component-based data mining software. It includes a range of preprocessing, modelling and data exploration techniques. It is based on C++ components, that are accessed either directly (not very common), through Python scripts (easier and better), or through GUI objects called Orange Widgets.
datamining
machinelearning
software
programming
statistics
opensource
data
app
april 2009 by adragomir
related tags
ad ⊕ adapter ⊕ aggregator ⊕ ajax ⊕ algorithm ⊕ analysis ⊕ analytics ⊕ api ⊕ app ⊕ application ⊕ architecture ⊕ article ⊕ base ⊕ binary ⊕ btree ⊕ business ⊕ c ⊕ c++ ⊕ chart ⊕ charts ⊕ cli ⊕ climate ⊕ clojure ⊕ cloud ⊕ cms ⊕ code ⊕ commandline ⊕ community ⊕ concurrency ⊕ cool ⊕ cors ⊕ couchdb ⊕ cpu ⊕ crawler ⊕ csvkit ⊕ data ⊖ database ⊕ datacenter ⊕ datamining ⊕ dataparallel ⊕ dataportability ⊕ dataset ⊕ datasets ⊕ date ⊕ deduplication ⊕ design ⊕ development ⊕ distributed ⊕ document ⊕ driven ⊕ embed ⊕ erlang ⊕ essay ⊕ facebook ⊕ file ⊕ format ⊕ framework ⊕ gina ⊕ github ⊕ go ⊕ google ⊕ graph ⊕ hadoop ⊕ health ⊕ hosting ⊕ howto ⊕ identity ⊕ install ⊕ internet ⊕ java ⊕ javascript ⊕ journalism ⊕ json ⊕ language ⊕ library ⊕ linux ⊕ machinelearning ⊕ management ⊕ manipulation ⊕ mapreduce ⊕ mashup ⊕ massaging ⊕ microsoft ⊕ mine ⊕ mobile ⊕ modeling ⊕ network ⊕ nodejs ⊕ nosql ⊕ opensource ⊕ osx ⊕ package ⊕ paper ⊕ parallel ⊕ parser ⊕ parsing ⊕ pdf ⊕ personal ⊕ petewarden ⊕ politics ⊕ population ⊕ powerofdata ⊕ priovacy ⊕ processing ⊕ programming ⊕ project ⊕ protocl ⊕ push ⊕ python ⊕ r ⊕ realtime ⊕ redis ⊕ research ⊕ ruby ⊕ scraping ⊕ server ⊕ shell ⊕ social ⊕ software ⊕ speed ⊕ startup ⊕ statistics ⊕ struct ⊕ structure ⊕ svn_download ⊕ swiftriver ⊕ text ⊕ tool ⊕ tools ⊕ toread ⊕ unix ⊕ url ⊕ utility ⊕ video ⊕ visualization ⊕ vizualization ⊕ vlerq ⊕ web ⊕ web2.0 ⊕ webdev ⊕ webhooks ⊕ WebService ⊕ xmpp ⊕Copy this bookmark: