adragomir + data   52

x86-64/frozen - GitHub
Frozen is data processing daemon driven by configuration files. It contain several modules for input, output and data processing. It can be used to construct your own database, simple web server, data collector, logger and so on.

Architecture of frozen based on Unix philosophy of simple parts and clean interfaces. This allows usage of existing modules in various places. If some functionality not present, you can write it and connect it to frozen. There is no limit on language to use - if it can compile to shared library - it is very likely to work properly.
data  analytics  c  server 
november 2011 by adragomir
google-refine - Project Hosting on Google Code
Google Refine is a power tool for cleaning up raw data, making it consistent,
linking it to data registries like Freebase,
augmenting it with more data from other data sources, transforming it into the
required format for other tools to consume, and contributing it back to some data
sources like Freebase. Google Refine is not
a web service but a desktop app that runs on your own computer, so you can process
sensitive data with privacy.
google  tool  opensource  data 
october 2010 by adragomir
Txr
Txr is an interpreter for a query language (the ``txr language''). A txr query matches text and extracts pieces by binding them to variables that are embedded in the query. Txr can output the raw bindings gathered from the data, or substitute them into a template-driven report.
text  parsing  parser  data 
november 2009 by adragomir
Python Package Index : FilterPype 0.2.7dev
FilterPype is being used for multi-level data analysis, but could be applied to many other areas where it is difficult to split up a system into small independent parts
python  library  processing  data 
july 2009 by adragomir
Orange
Orange is a component-based data mining software. It includes a range of preprocessing, modelling and data exploration techniques. It is based on C++ components, that are accessed either directly (not very common), through Python scripts (easier and better), or through GUI objects called Orange Widgets.
datamining  machinelearning  software  programming  statistics  opensource  data  app 
april 2009 by adragomir

related tags

ad  adapter  aggregator  ajax  algorithm  analysis  analytics  api  app  application  architecture  article  base  binary  btree  business  c  c++  chart  charts  cli  climate  clojure  cloud  cms  code  commandline  community  concurrency  cool  cors  couchdb  cpu  crawler  csvkit  data  database  datacenter  datamining  dataparallel  dataportability  dataset  datasets  date  deduplication  design  development  distributed  document  driven  embed  erlang  essay  facebook  file  format  framework  gina  github  go  google  graph  hadoop  health  hosting  howto  identity  install  internet  java  javascript  journalism  json  language  library  linux  machinelearning  management  manipulation  mapreduce  mashup  massaging  microsoft  mine  mobile  modeling  network  nodejs  nosql  opensource  osx  package  paper  parallel  parser  parsing  pdf  personal  petewarden  politics  population  powerofdata  priovacy  processing  programming  project  protocl  push  python  r  realtime  redis  research  ruby  scraping  server  shell  social  software  speed  startup  statistics  struct  structure  svn_download  swiftriver  text  tool  tools  toread  unix  url  utility  video  visualization  vizualization  vlerq  web  web2.0  webdev  webhooks  WebService  xmpp 

Copy this bookmark:



description:


tags: