howthebodyworks + python   1103

Orange – Data Mining Fruitful & Fun
Design your data analysis process through visual programming. Orange remembers your choices, suggests most frequently used combinations, and intelligently chooses which communication channels between widgets to use.

Orange is packed with different visualizations, from scatterplots, bar charts, trees, to dendrograms, networks and heatmaps.

Actions seamlessly propagate through data analysis schema. Selection of data subset in one widget can automatically trigger change of display in the other one. By combining various widgets you can design data analytics framework of choice.

Over 100 widgets and growing. Coverage of most of standard data analysis tasks. Also specialized add-ons are available, like Bioorange for bioinformatics.

With scripting interface in Python, programming new algorithms and developing complex data analysis procedures is pure joy, using and reusing all power found in v
opensource  via:Strangefeatures  statistics  visualization  python  from delicious
9 days ago by howthebodyworks
Nicholas Piël » ZeroMQ an introduction
The bestest introduction to the easiest parallelisation doohickey I know
concurrency  python  programming  zeromq  from delicious
10 weeks ago by howthebodyworks
Writing a Package in Python
the mysterious cargo-cult bits of setup-tools explained for humans
setuptools  packaging  python  from delicious
11 weeks ago by howthebodyworks
Prabhu Ramachandran: Python vs. Cython vs. D (PyD) vs. C++ (SWIG)
Various performance options for you inner loop, including showing how to do D calls from python.
cython  numpy  c++  d  performance  python  from delicious
12 weeks ago by howthebodyworks
Overview — Sarge 0.1 documentation
better subprocess dispatch for python. because explicit is better than implicit, and simplicity is great. but between the subprocess module and perl lies system scripting that's actually both usable and maintainable. Sarge looks like a contender for this role (see also: envoy)
cli  python  from delicious
february 2012 by howthebodyworks
Pattern | CLiPS
combo data-mining/NLP/web-scraping toolkit for instant natural experiments online
api  statistics  nlp  datamining  python  from delicious
january 2012 by howthebodyworks
Refine, reuse and request data | ScraperWiki
Scrape web data sources for fun and profit, collaboratively. Want a provocative data set to scrape?
wiki  ruby.  scraping  data  python  from delicious
january 2012 by howthebodyworks
Home — Tinkerer
perhaps I should port my blog to run on this tidy looking static-site generator.
restructuredtext  blog  python  sphinx  from delicious
january 2012 by howthebodyworks
Fast, native-C Protocol Buffers from Python | @yaaang's blog
how to link python against c++ protobufs that don't rely on slow introspection but Just Go
python  c++  protobuf  from delicious
november 2011 by howthebodyworks
http://users.rcn.com/python/download/matfunc.py
don't have CPython binary extensions? Linear algebra lite with Raymond Hettinger's surprisingly complete pure python matrix library - including solutions, eigenvalues etc
python  linear_algebra  matrix  from delicious
november 2011 by howthebodyworks
Zed A. Shaw
the incorrigible zed shaw, with his patent blend of diatribe and good points. (I still prefer BSD though)
opensource  money  ip  gpl  python  from delicious
october 2011 by howthebodyworks
shogun | A Large Scale Machine Learning Toolbox
includes lots of tasty goodies radiating out from the SVM core, such as generalised markovian model learning, HDF5, string kernels, multi-language bindings and an emphasis on scalability.
svm  learning  ai  markov  r  python  matlab  from delicious
october 2011 by howthebodyworks
LuxRender - home
throws povray to the floor, if you read the specs sheet. include full spectrum modelling (as opp RGBA) full materials simulation (iridescence from first principles). trendy gpu support, animation optimisations, adaptive smoothing, python api, HDR... Uses modern statistical techniques (metropolis sampling of light paths converging in the mean to a correct scene it minimal bias)
physics  opensource  Python  opencl  3d  QT  gpu  c++  monte_carlo  statisics 
september 2011 by howthebodyworks
A Roadmap for Rich Scientific Data Structures in Python | Quant Pythonista
"So, this post is a bit of a brain dump on rich data structures in Python and what needs to happen in the very near future. I care about them for statistical computing (I want to build a statistical computing environment that trounces R) and financial data analysis (all evidence leads me to believe that Python is the best all-around tool for the finance space). Other people in the scientific Python community want them for numerous other applications: geophysics, neuroscience, etc. It’s really hard to make everyone happy with a single solution. But the current state of affairs has me rather anxious. And I’d like to explain why..."
statistics  Python  R  visualisation  db  nosql  has:for 
september 2011 by howthebodyworks
mrjob — mrjob v0.2.7 documentation
mapreduce python wrapper that can deploy to a hadoop cluster, OR can deploy locally, preserving interactive debuggers &c
mapreduce  Python  debug 
september 2011 by howthebodyworks
Sublime Text: The text editor you'll fall in love with
sublimetext has a fair whack of textmate compatibility, native python API, runs on osx, linus and windows, and, most importantly, the author lives in australia so we can find him if he goes awol for a few years like the macromates guy.
editor  TextMate  ide  Python  OSX  windows  linux 
august 2011 by howthebodyworks
Programming Guide — Kivy v1.0.8-dev documentation
From the creators of PyMT, a framework for creating apps, in python, that can be deployed in iOs, windows, android, linux and Os X. Also, interesting snark about pyobjc.
multitouch  Python  portable  ios  windows  android  ui  opengl 
august 2011 by howthebodyworks
pandas: a python data analysis library — pandas v0.4.0dev documentation
"pandas is a python package providing convenient data structures for time series, cross-sectional, or any other form of “labeled” data, with tools for building statistical and econometric models."

handle data in python intuitively. pass to R for fiddly bits.
Python  R  metadata  statistics 
august 2011 by howthebodyworks
GraphLab: A New Parallel Framework for Machine Learning
Library claiming orders of magnitude speed improvement over hadoop and more flexibility than mapreduce. Already has amazon ec2 images ready to fire up. c++ is the native language; POJO claim to be also a first class citizen, and thus other JVM langauges. Hm.
networks  ec2  amazon  c++  mapreduce  scalability  Java  jython  Python 
august 2011 by howthebodyworks
Bulbflow: a New Python Framework for Graph Databases
python graph traversal wrapper (OGM to the ORM world)
ORM  Python  networks 
august 2011 by howthebodyworks
j2labs/dictshield - GitHub
the mapping laer of an ORM made generic. per default, enforce schema on python objects and cast to/from JSON. Sound like it might go with textfile/SCM, MongoDB or CouchDB natively, and supposedly supports RDBMSs.
db  json  Python  ORM  mongodb  couchdb  schema 
july 2011 by howthebodyworks
Artemis « Software « Dmitriy Morozov
nice hack - track bugs as mail messages versioned in your mercurial repo, then browse them in mail client of choice. (practically, mutt)
Python  mercurial  hg  mail  bugtracker 
july 2011 by howthebodyworks
Bugs Everywhere
the other major command-line bugtracker contender based on plaintext, in-repo data. This one is written in python.
scm  dvcs  bugtracker  CLI  Python 
july 2011 by howthebodyworks
FrontPage - pytagsfs
pytagsfs is a FUSE filesystem that arranges media files in a virtual directory structure based on the file tags. For instance, a set of audio files could be mapped to a new directory structure organizing them hierarchically by album, genre, release date, etc. File tags can be changed by moving and renaming virtual files and directories. The virtual files can also be modified directly, and, of course, can be opened and played just like regular files.
Python  metadata  audio  media 
july 2011 by howthebodyworks
mutagen - Python multimedia tagging library - Google Project Hosting
Mutagen is a Python module to handle audio metadata. It supports ASF, FLAC, M4A, Monkey's Audio, MP3, Musepack, Ogg FLAC, Ogg Speex, Ogg Theora, Ogg Vorbis, True Audio, WavPack and OptimFROG audio files. All versions of ID3v2 are supported, and all standard ID3v2.4 frames are parsed. It can read Xing headers to accurately calculate the bitrate and length of MP3s. ID3 and APEv2 tags can be edited regardless of audio format. It can also manipulate Ogg streams on an individual packet/page level
metadata  mp3  audio  Python  CLI 
july 2011 by howthebodyworks
beets: the music geek's media organizer
Beets is the media library management system for obsessive-compulsive music geeks.

The purpose of beets is to get your music collection right once and for all. It catalogs your collection, automatically improving its metadata as it goes using the MusicBrainz database. (It also downloads cover art for albums it imports.) ...If beets doesn't do what you want yet, writing your own plugin is shockingly simple if you know a little Python.
music  Python  metadata  tagging  mp3  audio 
july 2011 by howthebodyworks
Christopher Denter: Python on iPhone & iPad
python as an ios development language, including opengl acceleration
ios  portable  python  objc 
july 2011 by howthebodyworks
ianozsvald/EuroPython2011_HighPerformanceComputing - GitHub
the ultra detailed walkthrough fo the classic python incremental optimisation mandelbrot thingy
Python  performance  cython  numpy  pypy  c++  gpu 
june 2011 by howthebodyworks
EC2 AMI for scientific computing in Python and R « Zero Intelligence Agents
Like many people who crunch numbers frequently, I have increasingly been integrating Amazon’s cloud computing services into my daily workflow. In particular, I have been using their elastic cloud computing (EC2) on a regular basis. The service is an excellent way to offload computationally intensive work from your laptop for literally pennies on the dollar.

One drawback that I have found, however, is there are not any obvious pre-configured images, called AMIs, designed for scientific computing in the languages I use most: Python and R. ... Thus began the odyssey of modifying the StarCluster AMI to more fully support scientific computing in Python in R. I have now uploaded and made public the resulting image, which includes several hundred Python and R packages for scientific computing, statistics, machine learning, data mining and visualization.
Python  numpy  R  ec2  amazon  HowTo 
june 2011 by howthebodyworks
yostudios/Spritemapper - GitHub
Spritemapper is an application that merges multiple images into one and generates CSS positioning for the corresponding slices.
CSS  Python  iage  browser  performance 
june 2011 by howthebodyworks
SimString - A fast and simple algorithm for approximate string matching/retrieval
SimString has the following features:

Fast algorithm for approximate string retrieval. For example, SimString can find strings in Google Web1T unigrams (13,588,391 strings) that have cosine similarity ≧0.7 in 1.10 [ms] per query (on Intel Xeon 5140 2.33 GHz CPU).
100% exact retrieval. Although some algorithms allow misses (false positives) for faster query response, SimString is guaranteed to achieve 100% correct retrieval with fast query response.
Unicode (wchar_t) support. For languages using multi-byte characters, developers can use Unicode characters (wchar_t) instead of single-byte characters (char) as a character representation.
Implementation in C++ header files. Developers can add the funtionality of approximate string retrieval into C++ programs just by including a header file.
Python and Ruby bindings via SWIG. Developers can easily perform approximate string retrieval in scripting languages.
nlp  search  python  ruby  c++  performance 
june 2011 by howthebodyworks
Overview — NetworkX v1.4 documentation
native python graph handling with ultralight api built aroudn hashes.
python  networks  statistics 
may 2011 by howthebodyworks
graph-tool
c++ graph lib for python, optimised for performance.
python  networks  c++  boost  statistics 
may 2011 by howthebodyworks
kennethreitz/requests - GitHub
yet anoher pyhthon HTTP lib that promises to be less annoying than urllib2
python  http 
may 2011 by howthebodyworks
FarMcKon/gitmarks_2 - GitHub
store bookmarks, socially, on github. fork your friends. very genius.
python  del.icio.us  bookmarklet  tagging  git 
may 2011 by howthebodyworks
Welcome — Theano v0.3.1 documentation
"It is good to think of theano.function as the interface to a compiler which builds a callable object from a purely symbolic graph. One of theano’s most important features is that theano.function can optimize a graph and even compile some or all of it into native machine instructions."
python  numpy  performance 
may 2011 by howthebodyworks
Haskell hacking
Looks like Haskell has autoparallelising decorators
python  ruby  concurrency  haskell  functional 
may 2011 by howthebodyworks
mlpy Documentation — mlpy v2.2.2 documentation
mlpy is a high-performance Python package for predictive modeling. It makes extensive use of NumPy (http://scipy.org) to provide fast N-dimensional array manipulation and easy integration of C code. mlpy provides high level procedures that support, with few lines of code, the design of rich Data Analysis Protocols (DAPs) for preprocessing, clustering, predictive classification, regression and feature selection. Methods are available for feature weighting and ranking, data resampling, error evaluation and experiment landscaping. The package includes tools to measure stability in sets of ranked feature lists.
python  learning  ai  classification  numpy 
april 2011 by howthebodyworks
JulienPalard/Pipe - GitHub
"Module enablig a sh like infix syntax (using pipes).

= Introduction =
As an example, here is the solution for the 2nd Euler Project exercise :

"Find the sum of all the even-valued terms in Fibonacci
which do not exceed four million."

Given fib a generator of fibonacci numbers :

euler2 = fib() | where(lambda x: x % 2 == 0)
| take_while(lambda x: x < 4000000)
| add"
python 
april 2011 by howthebodyworks
Software Carpentry » Interlude: Bein
eperimental management, much like sumatra, but with intermediate file-handling.
python  academic  research 
april 2011 by howthebodyworks
Welcome to Elefant — Elefant
machine learning in python, courtesy NICTA again
python  ui  ai  learning 
march 2011 by howthebodyworks
Data Science Toolkit
cloneable EC2 VM for parsing text snippets into names, electorates, coordinates
opensource  datamining  python  virtualization  nlp 
march 2011 by howthebodyworks
Think Stats: Probability and Statistics for Programmers
hawt python statistics. If you don't actually like R THAT much.
python  statistics 
march 2011 by howthebodyworks
Will McGugan: Creating a Virtual Filesystem with Python (and why you need one)
create ad hoc virtual filesystems from within python scripts - set yourself up virtual chroots and in-memory disks for, e.g. test scripts, cloud apps and so on
fs  python  testing 
march 2011 by howthebodyworks
Meliae python memory analysis in Launchpad
profile/debug python memory consumption ("64 megs in imports alone?")
python  performance  profiling  debug 
march 2011 by howthebodyworks
MADlib
statiscial analysis on your database contents: "MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data". Seems to be largely in PL/C, with some bonus python on the front.
python  statistics  postgresql  sql  db  greenplum  mapreduce  ai  classification 
march 2011 by howthebodyworks
Pimp my Interactive Interpreter - Not Invented Here
sweet little hacks to make the default shell behave somewhat like ipython using the native readline power, without requiring ipython bloat and threading explosions.
ipython  cli  python  readline 
march 2011 by howthebodyworks
TileMill | Home
a wholy exciting browser-based map tile designer
brows  mapping  python  javascript  mapnik  node.js 
march 2011 by howthebodyworks
Features — execnet v1.0.9 documentation
multiprocessing workalike for distributing shared-nothing jobs.
python  concurrent  performance 
march 2011 by howthebodyworks
pyOSC - V2_Lab Projects - Trac
justin maynard's recommended OSC implementation for python.
python 
february 2011 by howthebodyworks
Computational Economics
demand-curve-y approach to price finding in python.
python  price  numerical_methods  economics 
february 2011 by howthebodyworks
« earlier      

related tags

2d  3d  @frequent  @todo  ableton  academic  admin  agents  aggregator  agile  agnsw  ai  ajax  akismet  algorithm  amazon  amqp  analytics  android  animation  apache  api  applescript  archives  array  asp  atom  audio  australia  automation  automator  avcontent  backup  bash  bayes  bazaar  bibliography  bibtex  bikes  bioinformatics  blender  blog  bluetooth  bookmark  bookmarklet  boost  bot  brows  browser  bubble_economy  bugtracker  bzr  c  c#  c++  cache  calendar  captcha  chart  chat  cheatsheet  cherrypy  china  citation  classification  cli  client  climate  cloud  clustering  cms  cocoa  codec  coding  collaborative  colour  comet  comments  commercial  community  comparison  compiler  complexity  composition  compsci  computer  computer_vision  concert  concurrency  concurrent  conference  controller  convert  coroutines  couchdb  cpd  cpod  creativecommons  crm  csound  css  csv  ctypes  cuda  cython  d  daemon  darcs  data  datamining  date  db  dear_me  debian  debug  decorators  del.icio.us  deployment  design  development  diagram  dictionary  diff  distribute  distributed  distutils  diy  django  dns  docutils  dojo  dom  dotnet  dps  dry  dsl  dsp  dtrace  dublincore  dvcs  ebook  ec2  eclipse  ECommerce  economics  econophysics  editor  education  email  english  engrish  erlang  essential  event  evented  everyblock  evolution  excel  fabric  facebook  fastcgi  feed  ffmpeg  fibra  filetype:pdf  firebug  firefox  flash  flask  flickr  flowbasedprogramming  font  form  forms  forum  framework  free  freebase  French  friendfeed  fs  functional  ga  gae  game  games  game_theory  gdal  gdata  geek  generator  genetic  geo  geocoding  geography  geometry  gephi  gestuer  gevent  gis  git  github  gnome  google  gpl  gpu  grammar  grammarface  grammarthing  graph  graphics  graphstreaming  greenlet  greenplum  groovy  growl  gtd  gtk  gui  gwt  hack  hadoop  hardware  has:for  haskell  headphones  hg  hid  hierarchy  history  hosting  hotwo  howto  hps  html  html5  http  i18n  iage  ic  ica  ical  id2  id3  ide  IE  image  imagemagick  information_theory  interactive  interface  io  ios  ip  iphone  ipython  irc  ironpython  isp  italy  itunes  jabber  japan  java  javascript  jinja  journal  journalism  jquery  jruby  js  json  jython  kde  kml  korea  lamp  language  last.fm  latex  ldap  learning  leopard  lib  library  lighttpd  linear_algebra  linux  lisp  literate_programming  livingthing  logging  loom  lowbandwidth  lua  lucene  lxml  mac  machine  maema  maemo  mail  mailman  mapnik  mapping  mapreduce  markdown  markets  markov  markup  mashup  mathematica  mathematics  mathml  matlab  matlap  matplotlib  matrix  maxmsp  media  media:document  mediawiki  memcache  mercurial  metadata  methodology  microsoft  midi  migration  mime  mindmap  mock  modeling  modelling  money  mongodb  monte_carlo  mootools  mp3  multitouch  museum  music  mvc  mysql  nativeclient  netcdf  netcultures  network  networks  neuron  newforms-admin  newmedia  news  nginx  ngo  nlp  nltk  node.js  nokia  nose  nosql  numerical_methods  numpy  oai  oauth  objc  ocr  ocsse  office  ogg  ontology  oop  openaccess  opencalais  opencl  opengl  openid  opensource  openstreetmap  optimization  orm  osc  osx  Outliner  p2p  packag  package  packaging  parallel  parking  parkingsun  parser  parsimony  paster  path  patterns  paver  pca  pde  pdf  perforamnce  performance  perl  phd  philosophy  phm  phone  photography  php  physics  pingback  pip  pipes  plist  plone  plugin  podcast  portable  portble  possumpalace  postgresql  postscript  powerlaw  powerpoint  presentation  price  print  privacy  productivity  profiling  programming  projection  protobuf  proxy  pubsub  puredata  pyevolve  pygame  pyglet  pylons  pyobjc  pypy  python  qt  queue  quicksilver  quicktime  r  rails  rdbms  rdf  readline  realtime  reddit  redis  reference  regex  reinforcement_learning  remix  research  rest  restructuredtext  reverseengineering  reverse_engineering  rss  ruby  ruby.  russia  russian  sage  scala  scalability  scaling  scheduling  schema  schmooze  science  scipy  scm  scraper  scraping  scripting  search  security  selenium  semantic  semanticweb  seo  sequencer  serialization  server  setuptools  shell  similarity  simulation  smtp  snippet  soap  social  software  solr  som  soulseek  soundcloud  spam  sparql  sphinx  spotlight  spreadsheet  sql  sqlalchemy  sqlite  squid  ssh  stackless  standards  statisics  statistics  streaming  subversion  sun  supercollider  sustainability  svg  svm  svn  swig  sync  syndication  syntax  synth  synthesis  tagging  Tcl  teachmyself  technology  template  terry  testing  tex  text  textmate  thread  tidy  Tk  torrent  touch  trac  tracker  traits  turbogears  twisted  twitter  typography  ubuntu  ui  uml  unicode  unix  url  usb  validation  verifiability  via:aramgutang  via:cogat  via:complexitydaemon  via:crucialfelix  via:datakid  via:dpwolf  via:ideoforms  via:ixc  via:karpuscul  via:mtchl  via:phmdms  via:simonw  via:strangefeatures  video  vim  virtualenv  virtualization  vision  visualisation  visualization  vj  vnc  vpn  weave  web  web.py  web2.0  webdav  webdev  webfaction  webhook  webkit  webob  webservices  websocket  wifi  wii  wiki  windows  wordpress  wrapper  writing  wsgi  wxpython  wxwindows  WYSIWYG  xhtml  xml  xmlrpc  xpath  xthml  xul  yahoo  yaml  yui  z39.50  zc.buildout  zeromq  zodb  zope 

Copy this bookmark:



description:


tags: