infovore + data + statistics   7

The Seven Secrets of Successful Data Scientists : Dataspora Blog
"...don’t confuse this kind of data exploration, where the goal is to size up the data, with building proper data plumbing, where you want robustness and maintainability. Perl and bash scripts are nice for the former, but can be a nightmare for building data pipelines." Lots of good stuff in this article; this was a highlight.
bigdata  data  datamining  statistics  machinelearning 
september 2010 by infovore
The Three Sexy Skills of Data Geeks : Dataspora Blog
"Statisticians’ sex appeal has little to do with their lascivious leanings ... and more with the scarcity of their skills. I believe that the folks to whom Hal Varian is referring are not statisticians in the narrow sense, but rather people who possess skills in three key, yet independent areas: statistics, data munging, and data visualization. (In parentheses next to each, I’ve put the salient character trait needed to acquire it)."
data  analytics  visualization  statistics  datamining  maths  analysis  trends 
june 2009 by infovore
Fitting curves to data using Ruby and the GNU Scientific Library
"If you need to perform data analysis, provide graphics for your users in your webapp, or produce high quality plots I encourage you to investigate the combination of ruby, GSL and GNUPlot." Looks good. I should probably give this a poke some time; could come in handy.
gsl  graphing  plotting  data  analysis  statistics  ruby  visualisation 
december 2008 by infovore
You Know What I Did Last Summer? (Frumination)
"I spent 10 weeks last Summer as an intern on the strategy team of Transport for London's (TfL) London Rail division.... My general task was to help London Rail start to make use of the oceans of data spewing out of the Oyster smartcard ticketing system, but I spent the bulk of my time working on a project that came to be titled Oyster-Based Performance Metrics for the London Overground. I've posted my final report and slides and outline for the presentation I gave to TfL executive management." Some interesting data and information here.
travel  tfl  statistics  oyster  overground  data  graphs 
november 2008 by infovore
Chris' Survival Horror Quest
"Some people believe that there's no correlation between quality and sales, and thus think that the way to make money is to make things that are easily marketable (read: licenses). Game developers themselves usually argue that sales above a certain level require a game to be sufficient quality. I decided to see which of these perspectives was correct for the Playstation 2 era." Datanalysismachinego!
data  visualization  statistics  sales  games  quality  analysis 
september 2008 by infovore
Winner of the Personal Visualization Project is… | FlowingData
"The winner is Tim Graham who took manual personal data collection to another level. From email spam, to beverage consumption, to aches and pains, Tim embraced the spirit of self-surveillance. He even made his personal data available in the forums." Dataviz overload!
information  infoviz  dataviz  statistics  reporting  data  analysis  personal 
september 2008 by infovore
Data Feeds at BookMooch
Wow. Bookmooch has some seriously comprehensive data available from it, if you fancing munging their entire dataset (security-sanitised, obviously).
bookmooch  data  api  xml  statistics 
february 2008 by infovore

Copy this bookmark:



description:


tags: