howthebodyworks + r   35

Create maps with maptools R package | Statisfaction
How to do stats on the surface of he earth with beautiful visualizations
mapping  gis  r  statistics  france  from delicious
december 2011 by howthebodyworks
shogun | A Large Scale Machine Learning Toolbox
includes lots of tasty goodies radiating out from the SVM core, such as generalised markovian model learning, HDF5, string kernels, multi-language bindings and an emphasis on scalability.
svm  learning  ai  markov  r  python  matlab  from delicious
october 2011 by howthebodyworks
Deducer: A GUI for R - Deducer Manual
interactive exploratory data munging console for the more intuitive flavour of R user.
r  gui  java  from delicious
october 2011 by howthebodyworks
A Roadmap for Rich Scientific Data Structures in Python | Quant Pythonista
"So, this post is a bit of a brain dump on rich data structures in Python and what needs to happen in the very near future. I care about them for statistical computing (I want to build a statistical computing environment that trounces R) and financial data analysis (all evidence leads me to believe that Python is the best all-around tool for the finance space). Other people in the scientific Python community want them for numerous other applications: geophysics, neuroscience, etc. It’s really hard to make everyone happy with a single solution. But the current state of affairs has me rather anxious. And I’d like to explain why..."
statistics  Python  R  visualisation  db  nosql  has:for 
september 2011 by howthebodyworks
pandas: a python data analysis library — pandas v0.4.0dev documentation
"pandas is a python package providing convenient data structures for time series, cross-sectional, or any other form of “labeled” data, with tools for building statistical and econometric models."

handle data in python intuitively. pass to R for fiddly bits.
Python  R  metadata  statistics 
august 2011 by howthebodyworks
Automating R Scripts on Amazon EC2 | Travis Nelson's Blog
Overview:

How to setup R on an EC2 instance of Ubuntu 11.04 (Natty Narwhal)
How to setup Apache Tomcat 6.0 web server and configuring it with basic authentication so that we can view our output from R on a password protected webpage
How to automate your R scripts to run as a daily cron job.
ec2  R  amazon  HowTo 
june 2011 by howthebodyworks
EC2 AMI for scientific computing in Python and R « Zero Intelligence Agents
Like many people who crunch numbers frequently, I have increasingly been integrating Amazon’s cloud computing services into my daily workflow. In particular, I have been using their elastic cloud computing (EC2) on a regular basis. The service is an excellent way to offload computationally intensive work from your laptop for literally pennies on the dollar.

One drawback that I have found, however, is there are not any obvious pre-configured images, called AMIs, designed for scientific computing in the languages I use most: Python and R. ... Thus began the odyssey of modifying the StarCluster AMI to more fully support scientific computing in Python in R. I have now uploaded and made public the resulting image, which includes several hundred Python and R packages for scientific computing, statistics, machine learning, data mining and visualization.
Python  numpy  R  ec2  amazon  HowTo 
june 2011 by howthebodyworks
Cerebral Mastication » Blog Archive » Shell scripting EC2 for fun and profit
Lately I’ve been doing some work with creating ad-hoc clusters of EC2 machines. My ultimate goal is to create a simple way to spin up a cluster of EC2 machines for use with Bryan Lewis’s very cool doRedis backend for the R foreach package. But that’s a whole other post. What I was scratching my head about today was that I’d really just like to, with a single command, spin up an EC2 instance, wait for it to come up, and then ssh into it. I do this iteration about 20 times a day when I’m testing things, so it seemed to make sense to shell script it.
To do this, one needs the EC2 command line tools installed on your workstation. In Ubuntu that’s as easy as `sudo apt-get ec2-api-tools`
ec2  mapreduce  R  HowTo 
june 2011 by howthebodyworks
ExploringDataBlog: Interestingness Measures
categorical data has other Shannon-information-like estiamtors of "interestingness"
statistics  r  information_theory  bubble_economy 
may 2011 by howthebodyworks
One R Tip A Day: R Upgrade on Mac Os X 10.5.5 (Leopard)
general instructions for upgrading r and keeping your packages magically installed despite its zany clunky library handling.
r 
april 2011 by howthebodyworks
ConnectMV | Process Improvement using Data
interesting bunch of courses on statistical data fiddling
r  howto  statistics 
march 2011 by howthebodyworks
NetCDF (network Common Data Form)
storage search and interchange for large array data
MATLAB  ObjC  Perl  Python  R  Ruby  Tcl  Tk  c  c++  data  datamining  java 
january 2011 by howthebodyworks
Using Python (and R) to calculate Linear Regressions
best, simplest example of R<->python interaction i've seen
r  python 
december 2010 by howthebodyworks
“simply start over and build something better” « Xi'an's Og
the case for replacing the language of R with somethig else that can access the same statistical power
r  statistics  compsci  coding 
november 2010 by howthebodyworks
Power-law Distributions
how to find, in a statistically valid fashion, that your data fits a power law.
statistics  scaling  powerlaw  matlab  python  r 
november 2010 by howthebodyworks
http://rpy.sourceforge.net/rpy2.html
the python<->R interface is being rebooted
r  python  api  statistics 
october 2010 by howthebodyworks
Programmers Need To Learn Statistics Or I Will Kill Them All
This guy is reliably entertaining. in this case, it's a userful, arrogant rant about what you need to know about stats to push beyond the "mean request time or whatever. highlights:

>The next day we had IBM fixing the problem (turned out to be a single update index command) and we all kept our jobs. That’s what a proper analysis method can do for you.

>still I see software developers begging for gazillions of dollars to buy some crap tool that doesn’t even mention “standard deviation”, but throws “user” around like it’s Dr. Phil treating Robert Downey Jr. for heroin addiction.
r  reference  statistics  performance  compsci  dear_me 
october 2010 by howthebodyworks
What is data science? - O'Reilly Radar
O'reilly gives an overview of large dataset tools, including some nice machine learning ones
data  mapreduce  opensource  methodology  hps  hadoop  java  r  python  ai  via:datakid  schmooze  academic  phd 
june 2010 by howthebodyworks
R Videos
learn r monkey-see-monkey-do style
howto  r  statistics 
march 2010 by howthebodyworks
ananelson.com
interesting looking economics simulation person
schmooze  economics  simulation  visualization  r  ruby  amazon  ec2  ireland 
february 2010 by howthebodyworks

Copy this bookmark:



description:


tags: