jtth + statistics   72

R Guide -- Analysis of Variance
Using analysis of variance (ANOVA) in psychological data analysis with R.
data  learning  r  statistics 
february 2012 by jtth
Using Python (and R) to draw a Heatmap from Microarray Data
[c]
This document follows on from this page which uses R to analyse an Acute lymphocytic leukemia (ALL) microarray dataset, producing a heatmap (with dendrograms) of genes differentially expressed between two types of leukemia. On this page I deal with how to do this using Python and RPy.
r  python  statistics  visualization 
february 2010 by jtth
2009/2010 OMOP Cup: Methods Competition
We are providing a large dataset that resembles observational data that can be extracted from insurance claims or electronic medical records. Your method will identify relationships in the data between drugs and medical outcomes (adverse events). The goal is to develop methods that correctly identify true drug-event associations while minimizing false positive findings. Methods will be evaluated by how closely they predict the known relationships that exist in the data.
statistics  datamining 
january 2010 by jtth
Programmers Need To Learn Statistics Or I Will Kill Them All
I have a major pet peeve that I need to confess. I go insane when I hear programmers talking about statistics like they know shit when it’s clearly obvious they do not. I’ve been studying it for years and years and still don’t think I know anything. This article is my call for all programmers to finally learn enough about statistics to at least know they don’t know shit. I have no idea why, but their confidence in their lacking knowledge is only surpassed by their lack of confidence in their personal appearance.
programming  math  statistics 
january 2010 by jtth
The R programming language for programmers coming from other programming languages
I have written software professionally in perhaps a dozen programming languages, and the hardest language for me to learn has been R. The language is actually fairly simple, but it is unconventional. These notes are intended to make the language easier to learn for someone used to more commonly used languages such as C++, Java, Perl, etc.
manual  business  database  reference  howto  language  coding  intro  r  documentation  statistics  programming  tutorial  geek  stats  information  tutorials  notes  languages  math 
august 2009 by jtth
Orange - Data Mining Fruitful & Fun
Open source data visualization and analysis for novice and experts. Make your own data analysis schemata by visual programming or Python scripting. Extensions for bioinformatics and text mining. Comprehensive, flexible and fast.
software  visualization  programming  tools  opensource  python  ai  code  research  development  learning  statistics  c++  api  database  algorithms  data  analysis  library  datamining  framework  clustering  machine-learning  machine_learning  classification  machinelearning  mining  data_mining 
july 2009 by jtth
You should follow me on Twitter | Dustin Curtis
I actually tried many more permutations than I show here. I only discuss the most interesting ones below and describe my thought process along the way.
blog  web  design  psychology  articles  writing  webdesign  ui  usability  blogging  data  inspiration  language  statistics  communication  optimization  marketing  conversion  testing  twitter  persuasion  copywriting  socialmedia  action  ux  clickthrough  abtesting  measurement  calltoaction  wording 
july 2009 by jtth
Median Split
The applet on this page illustrates in a regression context the negative effects of using median splits in data analysis.
statistics 
may 2009 by jtth
Mechanical Turk: The Demographics ~~ (A Computer Scientist in a Business School)
One of the common misbeliefs about Mechanical Turk is that it is a virtual sweatshop, essentially taking advantage of poor people in third world countries that are doing tedious tasks for pennies. Therefore, many people are afraid of outsourcing research tasks on Mechanical Turk, being afraid that the results will be either of very poor quality, or they will not be representative of the actual U.S. population.
mechanicalturk  statistics  amazon  research  crowdsourcing  ux  webapp  stats  mechanical  turk  mturk  demographics 
april 2009 by jtth
Voodoo Correlations: Have the Results of Some Brain Scanning Experiments Been Overstated?: Scientific American
VUL: We use that term as a humorous way to describe mysteriously high correlations produced by complicated statistical methods (which usually were never clearly described in the scientific papers we examined)—and which turn out unfortunately to yield some very misleading results. The specific issue we focus on, which is responsible for a great many mysterious correlations, is something we call “non-independent” testing and measurement of correlations. Basically, this involves inadvertently cherry-picking data and it results in inflated estimates of correlations.
sciam  article  neuroscience  cogsci  science  method  fmri  mri  paper  voodoo  statistics 
february 2009 by jtth
UCI Machine Learning Repository
We currently maintain 176 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. Our old web site is still available, for those who prefer the old format. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please consult our donation policy. For any other questions, feel free to contact the Repository librarians. We have also set up a mirror site for the Repository.
reference  research  ai  learning  cs  datamining  statistics  library  data  machine-learning  artificial  machine  database  resource  mining  repository  clustering  directory  dataset  ga  datasets  data_mining  ml  machine_learning 
december 2008 by jtth
WSJ.com
Your parents might have worried when you chose Philosophy or International Relations as a major. But a year-long survey of 1.2 million people with only a bachelor's degree by PayScale Inc. shows that graduates in these subjects earned 103.5% and 97.8% more, respectively, about 10 years post-commencement. Majors that didn't show as much salary growth include Nursing and Information Technology.
education  work  money  statistics  School  career  college  usa  jobs  salary  salaries 
december 2008 by jtth
The Computer Language Benchmarks Game
Benchmarks of most languages, along with varying compilers, JITs, addons, etc. Very useful. I think I'm going to be learning Haskell after seeing it.
speed  statistics  software  reference  programming  shootout  benchmark 
november 2008 by jtth
Presidential Election 2008 FAQ
This is an FAQ (Frequently Asked Questions list) for the 2008 United States Presidential Election. I need to disclose up front that I am an Obama supporter. However, with the exception of the very last question, this FAQ is designed as a collection of factual information (such as the latest poll results) and of analysis that is as objective as possible.
usa  statistics  reference  president  politics  obama  norvig  interesting 
october 2008 by jtth
Open source Clustering software
Cluster 3.0 provides a Graphical User Interface to access to the clustering routines. It is available for Windows, Mac OS X, and Linux/Unix. Python users can access the clustering routines by using Pycluster, which is an extension module to Python.
algorithm  algorithms  analysis  cluster  code  comparison  complexity  clustering  software  opensource  statistics  python  datamining  programming  tools 
july 2008 by jtth
R Graph Gallery :: Home
The R Graph Gallery aims to present several different graphics fully created with the programming environment R [http://www.r-project.org]. Graphs are gathered in a MySQL database and browsable thanks to PHP. (linked via http://addictedtor.free.fr/graphiq
r  programming  statistics  resource  reference  gallery 
april 2008 by jtth
What Every Computer Scientist Should Know About Floating-Point Arithmetic
This paper presents a tutorial on those aspects of floating-point that have a direct impact on designers of computer systems. It begins with background on floating-point representation and rounding error, continues with a discussion of the IEEE floating-p
programming  floating-point  reference  floating  point  cs  math  computing  computerscience  academic  development  documentation  document  engineering  optimization  numerical  numbers  number  model  mathematics  manual  science  sun  statistics  technology  theory  useful 
april 2008 by jtth
Memories of John W. Tukey
This web site is dedicated to gathering recollections and reflections on John Tukey from those who knew him and from those who would like to comment on his life or work.
tukey  belllabs  bell  analysis  math  statistics  people  person  tribute 
february 2008 by jtth
Documentation -
This is an official center for all documentation to NumPy and SciPy.
api  code  docs  library  mathematics  python  reference  research  statistics  science  tutorial  documentation  numpy  scipy 
october 2007 by jtth
SciPy -
SciPy (pronounced "Sigh Pie") is open-source software for mathematics, science, and engineering. It is also the name of a very popular conference on scientific programming with Python.
academia  application  code  coding  compsci  computation  computer  computers  computing  cs  lab  graphing  graphic  graph  freeware  free  framework  extension  engineering  developer  language  libraries  library  math  mathematics  maths  matrix  modeling  number  numbers  open  open-source  opensource  optimization  oss  package  plot  programming  python  research  resources  science  scientific  visualization  work  scripting  software  source  statistic  statistics  stats  tutorials  tools 
september 2007 by jtth
R: Statistical Software for Psychology Research
Doing psychology research in R. Insane amount of material here. Insane.
r  howto  guide  tutorial  statistics  stats  math  maths  mathematics  introduction  intro  graph  graphing  graphics  information  reference  research 
september 2007 by jtth
R Tutorial
Union College Dept. Of Math tutorial of R.
R  statistics  tutorial  programming  tutorials  math  reference  software  development  guides 
september 2007 by jtth
The R Project for Statistical Computing
R is a free software environment for statistical computing and graphics developed by AT&T research labs and available as an open source programming system.
r  programming  statistics  analysis  science  data 
march 2007 by jtth
GraphPad InStat. Instant biostatistics
Most statistics programs are designed by statisticians, for statisticians. These programs are feature-packed and powerful, but can overwhelm scientists with thick manuals, obscure statistical jargon and high prices. GraphPad InStat is different. InStat is
statistic  application  graphing  graph  statistics  science 
march 2007 by jtth

related tags

abtesting  academia  academic  action  addons  adult  ai  algebra  algorithm  algorithms  amazon  america  analysis  ancient  answers  apa  api  apple  applets  application  arstechnica  article  articles  artificial  bayesian-networks  bell  belllabs  benchmark  bible  blog  blogging  book  books  bridge  business  c++  calculus  calltoaction  career  census  cheatsheet  classification  clickthrough  cluster  clustering  code  coding  cogsci  collaborative  college  commands  communication  communities  community  comparison  complex  complexity  compsci  computation  computer  computers  computerscience  computing  conversion  copywriting  crowdsourcing  cs  culture  data  database  datamining  dataset  datasets  data_mining  debian  decisiontrees  democracy  demographics  depression  design  developer  development  directory  distribution  docs  document  documentation  earth  ebook  ecology  economics  education  election  election2004  elections  encyclopedia  energy  engineering  error  extension  floating  floating-point  fmri  food  forum  framework  free  freeware  future  ga  gallery  game  games  gaming  geek  geography  geometry  government  graph  graphic  graphics  graphing  graphs  great  gui  guide  guides  hacking  hci  history  howto  image  imported  information  inspiration  instruction  interactive  interesting  interface  intro  introduction  java  jobs  lab  language  languages  latex  layout  learning  lecture  libraries  library  life  linguistics  links  linux  list  mac  machine  machine-learning  machinelearning  machine_learning  manual  map  Maps  marketing  marketshare  math  mathematics  mathoverflow  maths  matrix  measurement  mechanical  mechanicalturk  method  methodology  methods  microsoft  mining  ml  mmo  mmorpg  model  modeling  money  mri  mturk  network  networking  networks  neuroscience  nlp  norvig  notes  number  numbers  numerical  numpy  obama  occupation  open  open-source  opensource  optimization  oss  overflow  package  paper  patterns  people  person  persuasion  plot  point  politics  power  president  price  prices  probability  programming  psychology  python  questions  r  reference  regression  repository  research  resource  resources  roman  rome  salaries  salary  school  sciam  science  scientific  scipy  scripting  search  sem  semanticweb  sets  shootout  simulations  social  socialmedia  socialnetworking  society  software  source  speed  stackoverflow  startup  statistic  statistics  stats  Story  study  sun  teaching  tech  technology  testing  tex  textbooks  theory  tool  tools  trace  tree  tribute  tukey  turk  tutorial  tutorials  twitter  ubuntu  ui  university  US  usa  usability  usage  useful  user  ux  virtualworlds  visual  visualization  voodoo  voting  warcraft  web  web2.0  webapp  webdesign  webdev  weka  wiki  wikibook  wikipedia  wikiversity  windows  wording  work  worldofwarcraft  wow  writing  zipf  zipfslaw 

Copy this bookmark:



description:


tags: