howthebodyworks + r 35
Create maps with maptools R package | Statisfaction
december 2011 by howthebodyworks
How to do stats on the surface of he earth with beautiful visualizations
mapping
gis
r
statistics
france
from delicious
december 2011 by howthebodyworks
shogun | A Large Scale Machine Learning Toolbox
october 2011 by howthebodyworks
includes lots of tasty goodies radiating out from the SVM core, such as generalised markovian model learning, HDF5, string kernels, multi-language bindings and an emphasis on scalability.
svm
learning
ai
markov
r
python
matlab
from delicious
october 2011 by howthebodyworks
Deducer: A GUI for R - Deducer Manual
october 2011 by howthebodyworks
interactive exploratory data munging console for the more intuitive flavour of R user.
r
gui
java
from delicious
october 2011 by howthebodyworks
Welcome to a Little Book of R for Time Series! — Time Series v0.1 documentation
september 2011 by howthebodyworks
A gentle climb up a staircase, atop which is ARIMA.
R
statistics
september 2011 by howthebodyworks
A Roadmap for Rich Scientific Data Structures in Python | Quant Pythonista
september 2011 by howthebodyworks
"So, this post is a bit of a brain dump on rich data structures in Python and what needs to happen in the very near future. I care about them for statistical computing (I want to build a statistical computing environment that trounces R) and financial data analysis (all evidence leads me to believe that Python is the best all-around tool for the finance space). Other people in the scientific Python community want them for numerous other applications: geophysics, neuroscience, etc. It’s really hard to make everyone happy with a single solution. But the current state of affairs has me rather anxious. And I’d like to explain why..."
statistics
Python
R
visualisation
db
nosql
has:for
september 2011 by howthebodyworks
pandas: a python data analysis library — pandas v0.4.0dev documentation
august 2011 by howthebodyworks
"pandas is a python package providing convenient data structures for time series, cross-sectional, or any other form of “labeled” data, with tools for building statistical and econometric models."
handle data in python intuitively. pass to R for fiddly bits.
Python
R
metadata
statistics
handle data in python intuitively. pass to R for fiddly bits.
august 2011 by howthebodyworks
Automating R Scripts on Amazon EC2 | Travis Nelson's Blog
june 2011 by howthebodyworks
Overview:
How to setup R on an EC2 instance of Ubuntu 11.04 (Natty Narwhal)
How to setup Apache Tomcat 6.0 web server and configuring it with basic authentication so that we can view our output from R on a password protected webpage
How to automate your R scripts to run as a daily cron job.
ec2
R
amazon
HowTo
How to setup R on an EC2 instance of Ubuntu 11.04 (Natty Narwhal)
How to setup Apache Tomcat 6.0 web server and configuring it with basic authentication so that we can view our output from R on a password protected webpage
How to automate your R scripts to run as a daily cron job.
june 2011 by howthebodyworks
EC2 AMI for scientific computing in Python and R « Zero Intelligence Agents
june 2011 by howthebodyworks
Like many people who crunch numbers frequently, I have increasingly been integrating Amazon’s cloud computing services into my daily workflow. In particular, I have been using their elastic cloud computing (EC2) on a regular basis. The service is an excellent way to offload computationally intensive work from your laptop for literally pennies on the dollar.
One drawback that I have found, however, is there are not any obvious pre-configured images, called AMIs, designed for scientific computing in the languages I use most: Python and R. ... Thus began the odyssey of modifying the StarCluster AMI to more fully support scientific computing in Python in R. I have now uploaded and made public the resulting image, which includes several hundred Python and R packages for scientific computing, statistics, machine learning, data mining and visualization.
Python
numpy
R
ec2
amazon
HowTo
One drawback that I have found, however, is there are not any obvious pre-configured images, called AMIs, designed for scientific computing in the languages I use most: Python and R. ... Thus began the odyssey of modifying the StarCluster AMI to more fully support scientific computing in Python in R. I have now uploaded and made public the resulting image, which includes several hundred Python and R packages for scientific computing, statistics, machine learning, data mining and visualization.
june 2011 by howthebodyworks
Cerebral Mastication » Blog Archive » Shell scripting EC2 for fun and profit
june 2011 by howthebodyworks
Lately I’ve been doing some work with creating ad-hoc clusters of EC2 machines. My ultimate goal is to create a simple way to spin up a cluster of EC2 machines for use with Bryan Lewis’s very cool doRedis backend for the R foreach package. But that’s a whole other post. What I was scratching my head about today was that I’d really just like to, with a single command, spin up an EC2 instance, wait for it to come up, and then ssh into it. I do this iteration about 20 times a day when I’m testing things, so it seemed to make sense to shell script it.
To do this, one needs the EC2 command line tools installed on your workstation. In Ubuntu that’s as easy as `sudo apt-get ec2-api-tools`
ec2
mapreduce
R
HowTo
To do this, one needs the EC2 command line tools installed on your workstation. In Ubuntu that’s as easy as `sudo apt-get ec2-api-tools`
june 2011 by howthebodyworks
ExploringDataBlog: Interestingness Measures
may 2011 by howthebodyworks
categorical data has other Shannon-information-like estiamtors of "interestingness"
statistics
r
information_theory
bubble_economy
may 2011 by howthebodyworks
The igraph library for complex network research
may 2011 by howthebodyworks
c graph library with r and python support
networks
r
python
statistics
c
may 2011 by howthebodyworks
One R Tip A Day: R Upgrade on Mac Os X 10.5.5 (Leopard)
april 2011 by howthebodyworks
general instructions for upgrading r and keeping your packages magically installed despite its zany clunky library handling.
r
april 2011 by howthebodyworks
ConnectMV | Process Improvement using Data
march 2011 by howthebodyworks
interesting bunch of courses on statistical data fiddling
r
howto
statistics
march 2011 by howthebodyworks
NetCDF (network Common Data Form)
january 2011 by howthebodyworks
storage search and interchange for large array data
MATLAB
ObjC
Perl
Python
R
Ruby
Tcl
Tk
c
c++
data
datamining
java
january 2011 by howthebodyworks
Using Python (and R) to calculate Linear Regressions
december 2010 by howthebodyworks
best, simplest example of R<->python interaction i've seen
r
python
december 2010 by howthebodyworks
“simply start over and build something better” « Xi'an's Og
november 2010 by howthebodyworks
the case for replacing the language of R with somethig else that can access the same statistical power
r
statistics
compsci
coding
november 2010 by howthebodyworks
Power-law Distributions
november 2010 by howthebodyworks
how to find, in a statistically valid fashion, that your data fits a power law.
statistics
scaling
powerlaw
matlab
python
r
november 2010 by howthebodyworks
R: Entropy and Mutual Information Estimation
october 2010 by howthebodyworks
MI and H estimates
information_theory
r
statistics
october 2010 by howthebodyworks
http://rpy.sourceforge.net/rpy2.html
october 2010 by howthebodyworks
the python<->R interface is being rebooted
r
python
api
statistics
october 2010 by howthebodyworks
ggplot. had.co.nz
october 2010 by howthebodyworks
Tufte-compliant graphing tricks for R
r
visualization
statistics
academic
parsimony
october 2010 by howthebodyworks
Programmers Need To Learn Statistics Or I Will Kill Them All
october 2010 by howthebodyworks
This guy is reliably entertaining. in this case, it's a userful, arrogant rant about what you need to know about stats to push beyond the "mean request time or whatever. highlights:
>The next day we had IBM fixing the problem (turned out to be a single update index command) and we all kept our jobs. That’s what a proper analysis method can do for you.
>still I see software developers begging for gazillions of dollars to buy some crap tool that doesn’t even mention “standard deviation”, but throws “user” around like it’s Dr. Phil treating Robert Downey Jr. for heroin addiction.
r
reference
statistics
performance
compsci
dear_me
>The next day we had IBM fixing the problem (turned out to be a single update index command) and we all kept our jobs. That’s what a proper analysis method can do for you.
>still I see software developers begging for gazillions of dollars to buy some crap tool that doesn’t even mention “standard deviation”, but throws “user” around like it’s Dr. Phil treating Robert Downey Jr. for heroin addiction.
october 2010 by howthebodyworks
Statnet
september 2010 by howthebodyworks
network stats for R
simss
phd
academic
networks
r
datamining
statistics
opensource
september 2010 by howthebodyworks
What is data science? - O'Reilly Radar
june 2010 by howthebodyworks
O'reilly gives an overview of large dataset tools, including some nice machine learning ones
data
mapreduce
opensource
methodology
hps
hadoop
java
r
python
ai
via:datakid
schmooze
academic
phd
june 2010 by howthebodyworks
NetLogo-R-Extension
march 2010 by howthebodyworks
export from netlogo to R
r
netlogo
statistics
visualization
agents
march 2010 by howthebodyworks
ananelson.com
february 2010 by howthebodyworks
interesting looking economics simulation person
schmooze
economics
simulation
visualization
r
ruby
amazon
ec2
ireland
february 2010 by howthebodyworks
Complexity and Social Networks Blog: Goudreau-Hunter tutorial on Advanced Social Network Analysis Using R and statnet
january 2010 by howthebodyworks
how to use r plus some graph libraryies to do statistically useful social network analysis.
r
datamining
networks
statistics
january 2010 by howthebodyworks
Network Analysis with igraph
january 2010 by howthebodyworks
a noice-looking graph analysis library
graph
networks
python
r
statistics
visualization
datamining
howto
january 2010 by howthebodyworks
Guide to Getting Started in Machine Learning | A Beautiful WWW
october 2009 by howthebodyworks
Instant guid to getting started with machine learning
learning
ai
howto
phd
geek
nlp
statistics
r
october 2009 by howthebodyworks
CRAN: Contributed Documentation
september 2009 by howthebodyworks
more free R docs.
r
reference
howto
statistics
september 2009 by howthebodyworks
related tags
academic ⊕ agents ⊕ ai ⊕ amazon ⊕ api ⊕ bubble_economy ⊕ c ⊕ c++ ⊕ coding ⊕ compsci ⊕ data ⊕ datamining ⊕ db ⊕ dear_me ⊕ ec2 ⊕ economics ⊕ france ⊕ geek ⊕ gis ⊕ graph ⊕ gui ⊕ hadoop ⊕ has:for ⊕ howto ⊕ hps ⊕ information_theory ⊕ ireland ⊕ java ⊕ learning ⊕ mapping ⊕ mapreduce ⊕ markov ⊕ matlab ⊕ metadata ⊕ methodology ⊕ netlogo ⊕ networks ⊕ nlp ⊕ nosql ⊕ numpy ⊕ ObjC ⊕ opensource ⊕ parsimony ⊕ performance ⊕ Perl ⊕ phd ⊕ powerlaw ⊕ python ⊕ r ⊖ reference ⊕ ruby ⊕ scaling ⊕ schmooze ⊕ simss ⊕ simulation ⊕ statistics ⊕ svm ⊕ Tcl ⊕ Tk ⊕ vc++ ⊕ via:datakid ⊕ visualisation ⊕ visualization ⊕ wiki ⊕Copy this bookmark: