cshalizi + bad_data_analysis   89

Misuse of hierarchical linear models overstates the significance of a reported association between OXTR and prosociality
Going from a p-value of 10^-16 to 0.027 is --- painful. IFrom the lack of a response, I tend to infer that there's no arguing back...
Prediction: the original association will continue to be cited without correction.
bad_data_analysis  hierarchical_models  human_genetics  evisceration 
27 days ago by cshalizi
The Role of Copulas in the Housing Crisis - Review of Economics and Statistics - Abstract
"Due to its simplicity and familiarity, the Gaussian copula is popular in calculating risk in collaterized debt obligations, but it imposes asymptotic independence such that extreme events appear to be unrelated. This restriction might be innocuous in normal times, but during extreme events, such as the housing crisis, the Gaussian copula might be inappropriate. This paper explores various copula specifications and finds that the degree to which housing prices are related based on the Gaussian copula is too small compared with real housing price data."
to:NB  mortgage_crisis  financial_crisis_of_2007--  finance  copulas  bad_data_analysis  mea_copula  mea_maxima_copula 
4 weeks ago by cshalizi
[citation needed]» Blog Archive » Sixteen is not magic: Comment on Friston (2012)
OH KARL FRISTON NO.

ETA: From Tal's follow-up post, Friston is quite gracious about this.
bad_data_analysis  fmri  to:blog 
4 weeks ago by cshalizi
Stock Market Behavior Predicted by Rat Neurons
"We here report for the first time, to the best of our knowledge, rat motor cortex neurons predicting the behavior of the American stock market. We implanted the motor cortex of the brains of rats with silicon electrodes. Using the correlation technique, we monitored the activity of neurons in our rats while simultaneously tracking the activity of stocks in the U.S. stock market."
have_read  to:NB  neuroscience  finance  statistics  prediction  multiple_testing  bad_data_analysis  funny:geeky  funny:malicious  via:mejn  to:blog  to_teach:undergrad-ADA 
8 weeks ago by cshalizi
Analyzing Released NYC Value-Added Data Part 3 | Gary Rubinstein's Blog
This actually looks more like a job for nonparametric regression, or even relative distribution comparisons, but still...
bad_data_analysis  education  evisceration  to_teach:undergrad-ADA  via:mathbabe 
11 weeks ago by cshalizi
Analyzing Released NYC Value-Added Data Part 2 | Gary Rubinstein's Blog
It's the comparison of the same teacher in the same year on the same subject but in different grades which clinches the model being an EPIC FAIL.
bad_data_analysis  education  evisceration  to_teach:undergrad-ADA  via:mathbabe 
11 weeks ago by cshalizi
Analyzing Released NYC Value-Added Data Part 1 | Gary Rubinstein's Blog
To be clear, the bad data analysis is on the part of whatever hacks came p with the value added model being used here. These results are insane.
bad_data_analysis  evisceration  education  via:mathbabe  to_teach:undergrad-ADA 
11 weeks ago by cshalizi
How Big Pharma Cooks Data: The Case of Vioxx and Heart Disease « mathbabe
"Just as the financial system has to be changed to serve the needs of the people before the needs of the bankers, the drug trial system has to be changed to lower the incentives for cheating (and massive death tolls) just for a quick buck. As I mentioned before, it’s still not clear that they would have made less money, even including the penalties, if they had come clean in 2000. They made a bet that the fines they’d need to eventually pay would be smaller than the profits they’d make in the meantime. That sounds familiar to anyone who has been following the fallout from the credit crisis.
"One thing that should be changed immediately: the clinical trials for drugs should not be run or reported on by the drug companies themselves. There has to be a third party which is in charge of testing the drugs and has the power to take the drugs off the market immediately if adverse effects (like CVT events) are found. Hopefully they will be given more power than risk firms are currently given in finance (which is none)- in other words, it needs to be more than reporting, it needs to be an active regulatory power, with smart people who understand statistics and do their own state-of-the-art analyses – although as we’ve seen above even just Stats 101 would sometimes do the trick."
bad_data_analysis  moral_depravity  medicine  big_pharma  our_decrepit_institutions 
february 2012 by cshalizi
Language Log » Keith Chen, Whorfian economist
"I also worry that it is too easy to find correlations of this kind, and we don't have any idea just how easy until a concerted effort has been made to show that the spurious ones are not supportable. For example, if we took "has (vs. does not have) pharyngeal consonants", or "uses (vs. does not use) close front rounded vowels", would we find correlations there too? I have some colleagues here at the University of Edinburgh, within Simon Kirby's research group, who have run some informal experiments on the data Chen uses to see if dredging up spurious correlations of this kind is easy or hard, and so far they have found it jaw-droppingly easy. (I won't say any more, because I am in the weird position of producing unrefereed telegraphing of unrefereed and informal objections to an unrefereed and unpublished working paper, and it's all getting a bit too weird for me.)"

How many languages are there in Europe? Order of 10^2. How many variables can an economist get cross-country data on? Again, order of 10^2. How many discriminable syntactic features do languages have? Easily order of 10^3 if not come. Conclusion: this is not what I mean when I say that economists should do more data-mining.
economics  bad_data_analysis  linguistics  pullum.geoff 
february 2012 by cshalizi
Stephen Budiansky's Liberal Curmudgeon Blog: U.S. News, the root of all evil
"There's a special place in hell for the perpetrators of this, where I hope the gods of mathematics and reason are devising some exquisite tortures for them—perhaps in the form of endlessly reading Introduction to Statistics and doing the same problem sets over and over through eternity . . ."
academia  why_oh_why_cant_we_have_a_better_press_corps  bad_data_analysis  us_news_and_world_report  budiansky.stephen  funny:malicious 
february 2012 by cshalizi
PLoS ONE: Low Pitched Voices Are Perceived as Masculine and Attractive but Do They Predict Semen Quality in Men?
How does anyone _not_ read this paper and think that they were correlating everything they could until they got a "significant" effect?
--- I am very tempted right now to make this a problem set in ADA, but that's just asking for trouble, yes?
practices_relating_to_the_transmission_of_genetic_information  regression  statistics  bad_data_analysis  via:unfogged  have_read  principal_components  to:blog 
december 2011 by cshalizi
Functional MRI in Health Psychology and beyond: A call for caution
To be clear, "bad_data_analysis" applies to what Yarkoni is talking about, not to Yarkoni.
in_NB  neuroscience  cognitive_science  bad_data_analysis  fmri  yarkoni.tal 
december 2011 by cshalizi
[1108.0833] Temporal statistical analysis on human article creation patterns
Sadly, in this case fitting crappy power laws to the works of Gene Stanley and Laszlo Barabasi is not an_intentional_ joke.
bad_data_analysis  heavy_tails  barabasi.albert-laszlo  stanley.h._eugene  newman.mark  su.shi  have_read  blogged 
august 2011 by cshalizi
Mathematical intimidation; driven by data
I am a bit surprised that he doesn't point out that this year's teacher predicts last year's "value added".
education  mental_testing  bad_data_analysis  bad_management  to:NB  to:blog  via:mathbabe 
july 2011 by cshalizi
Role of test motivation in intelligence testing
Shorter: many people taking pointless tests are not actually motivated to try very hard.  Those who are motivated to try hard on pointless tests do better, and are different people in many ways.  In other breaking news, snow is cold and water is wet.  (To be clear, my "bad data analysis" tag here refers to the IQ-mongers, and not to this paper.)
mental_testing  iq  experimental_psychology  confounding  bad_data_analysis  re:g_paper  to:blog 
april 2011 by cshalizi
Reason Foundation - No Booze? You May Lose
Exercise for the student: Devise at least two reasons why the causality might run from high income to frequent social drinking, rather than vice versa.  (This is I think too elementary to make a good problem for ADA.)
bad_data_analysis  booze  via:tony_lin  causal_inference  to_teach:undergrad-ADA 
april 2011 by cshalizi
Language Log » Lyrical Narcissism?
I'm tempted to make this into a problem set, but it's probably not challenging enough.
bad_data_analysis  music  poetry  cultural_criticism  liberman.mark  debunking  to_teach:undergrad-ADA 
april 2011 by cshalizi
Unions and Growth | Mother Jones
Drum nails this one; thinking that this tells us anything about the effects of the over-all level of union strength (or pro-union laws) is a simple fallacy of composition.  Which economists  are shockingly ready to commit whenever they see a "natural experiment".
bad_data_analysis  economics  unions 
march 2011 by cshalizi
Twin Studies in Behavioral Research (Kamin and Goldberger, 2001)
Now that is how you give these idiots the business...  The last paragraph is a lovely encapsulation of just how foolish the whole enterprise really is.
heritability  human_genetics  behavioral_genetics  evisceration  bad_data_analysis  re:g_paper  kamin.leon  goldberger.arthur 
february 2011 by cshalizi
The Monkey Cage: Lifemanship (Academic edition)
"The bit that is most striking is the 'papers with 1,328 different co-authors.' My inner Diego Gambetta ... suggests that this may in part be a signalling phenomenon. If you are doing research in a field where someone has convincingly demonstrated that the emperor has no clothes, you want to signal as convincingly as possible that you (in contrast to those losers down the hall/in the department across the river/the country/wherever) have results that really stand up. One obvious way to do this is to get the little boy who pointed out the emperor's sartorial shortcomings to co-author a paper with you. Hence, I suspect, lots and lots of people who are eager to co-author something with Ioannidis (and plausibly, many, many more than he might be comfortable co-authoring with).... (NB that I am not suggesting that Iannidis was motivated by such an opportunity; I would be startled and amazed if he had been)."
modest_proposals  academia  why_oh_why_cant_we_have_a_better_academic_publishing_system  bad_data_analysis  funny:malicious  funny:geeky  farrell.henry 
november 2010 by cshalizi
IMF, Economist and Roosevelt Institute on Alesina and Ardagna. « Rortybomb
"To be frank, when I dug into the Alesina and Ardagna paper and finally understood the work their 1.5% primary deficit reduction was doing I wandered around stunned for a day or two. I called a bunch of people I trusted on macroeconomics and tried to see if I was missing something; was our elite discourse, the Sensible People Stuff, really being driven by this?"
bad_data_analysis  economics  macroeconomics  economic_policy 
october 2010 by cshalizi
Problems with the Use of Student Test Scores to Evaluate Teachers
"consequences of students ... not being randomly assigned to teachers within a school. It uses a [Value-Added Measure] to assign effects to teachers after controlling for other factors, but applies the model backwards.... [S]tudents’ fifth grade teachers appear to be good predictors of students’ fourth grade test scores. Inasmuch as a student’s later fifth grade teacher cannot possibly have influenced that student’s fourth grade performance, this curious result can only mean that students are systematically grouped into fifth grade classrooms based on their fourth grade performance. ... The usefulness of value-added modeling requires the assumption that teachers whose performance is being compared have classrooms with students of similar ability (or that the analyst has been able to control statistically for all the relevant characteristics of students that differ across classrooms). ..."
education  debunking  management  bad_data_analysis  via:orzelc 
september 2010 by cshalizi
The Power Law Shop
"I went to a physics conference, and all I got was a lousy power law"
funny:geeky  funny:malicious  heavy_tails  statistics  bad_data_analysis  porter.mason  via:aaron_clauset 
september 2010 by cshalizi
Employers and Credit Scores: An Update | Mother Jones
"So I stand corrected. Credit reporting agencies don't pass along your credit scores to prospective employers. They do pass along your entire credit history and specifically promote it as a way of weeding out problem candidates, but there's no credit score. Just your entire credit history."
moral_depravity  corporations  bad_data_analysis  whats_gone_wrong_with_america  drum.kevin 
august 2010 by cshalizi
Details About The Statistical Analysis Behind The Quiz - Pew Research Center ("How Millenial Are You?" Quiz)
They're not actually doing anything _flagrantly stupid_, except that they persist in talking about discrete generations when all their distributions massively overlap. Model specification FAIL.
bad_data_analysis  generations  to_teach:data-mining  to_teach:undergrad-research  logistic_regression  via:klk 
february 2010 by cshalizi
Luen, Stark: Testing earthquake predictions
Back-up for the Hough review. Also: might make a good mini-project for the data-mining class, though I'd have to teach about spatio-temporal methods (which I should anyway [but where would the time come?]).
earthquakes  hypothesis_testing  bad_data_analysis  stark.philip  statistics  prediction  have_read  to_teach:data-mining  to_teach:undergrad-ADA 
december 2009 by cshalizi
Elster: Excessive Ambitions
"The current financial crisis has brought out a fatal flaw in the foundations of the economic theories that guided economic agents and regulators: the unwarranted claim to precision and robustness. In this article I try to diagnose this flaw and discuss possible remedies. I argue that actual agents are intrinsically less sophisticated than the models assume they are, and that the various proposals to sustain the models by appealing to "as-if rationality" all fail. I next consider behavioral economics as an alternative to the standard models, claiming that while they may allow for successful retrodiction, they do not hold out much promise for prediction. I also discuss the use of statistical models, arguing that they are subject to so many traps and pitfalls that only a handful of elite practitioners can be trusted to use them well. Finally, I offer some speculations to explain the persistence in the economic profession and elsewhere of these useless or harmful models."
economics  social_science_methodology  rational_choice  methodology  elster.jon  bad_data_analysis  to:blog  have_read 
october 2009 by cshalizi
Ezra Klein - The Shoddy Statistics of Super Freakonomics
Jesus fucking Christ. Levitt actually put his name to this?!? This is the kind of thing I cobble together as a "what's gone wrong here?" assignment for my classes; maybe I should give 'em this.

To be explicit: in addition to all the problems Klein notes, the reason we frown on drunk driving more than on drunk walking is that one puts an incompetent in control of several thousand pounds of high-speed machinery, and the other a few hundred pounds of shambling biped. It's the DANGER TO OTHER PEOPLE that matters.
economics  utter_stupidity  debunking  gives_economists_a_bad_name  klein.ezra  levitt.steven  bad_data_analysis  statistics  to:blog 
october 2009 by cshalizi
Zen and the Art of Modern Macroeconomics: The Quest for Perfect Nothingness
The key part is the observation that there is NO POWER to detect reasonably-sized deviations from random walks in only forty years' of data.
economics  macroeconomics  time_series  evisceration  frankel.jeffrey  via:krugman  bad_data_analysis  bad_science  re:your_favorite_dsge_sucks 
august 2009 by cshalizi
Why are doctors still measuring obesity with the body mass index? - By Jeremy Singer-Vine - Slate Magazine
Institutionalizing BMI, despite its ineffectiveness and the existence of superior alternatives. (Which, errr, make it even more obvious that I'm way over-weight, so this isn't rationalization on my part.) Lots of issues here for a data-mining class.
via:?  statistics  debunking  obesity  medicine  epidemiology  to_teach:data-mining  bad_data_analysis  institutions  social_life_of_the_mind 
july 2009 by cshalizi
Does Mr Galton still have a problem? [PDF]
Cultural traits are correlated across space and time, and ignoring this in regressions gives you nonsense. (Of course if you want to see really flagrant examples of this fallacy, look at macroeconomists doing cross-country regressions of growth rates.)
anthropology  regression  bad_data_analysis  via:fionajay  to_read 
march 2009 by cshalizi
naked capitalism: More on the Simply Dreadful Performance of CDOs
There is a serious selection bias issue here (disproportionately looking at the badly-performing loans), but wow does everyone come out of this looking like a dangerous idiot. (Use this as a negative example in the next incarnation of the data-mining class?)
finance  mortgage_crisis  securitization  risk_assessment  via:hilzoy  bad_data_analysis  to_teach:data-mining 
february 2009 by cshalizi
Why Twin Studies Are Problematic for the Study of Political Ideology: Rethinking "Are Political Orientations Genetically Transmitted?"
Good to see some push-back on this. (I was once able to reduce a statistical geneticist to hysterics by reading aloud from a table of heritabilities from the paper being critiqued here. I am not sure if they have quite forgiven me yet.)
political_science  behavioral_genetics  debunking  suhay.liz  ideology  bad_data_analysis  via:henry_farrell 
january 2009 by cshalizi
The Little Professor: Perhaps I am not sufficiently evolved
You guys are not making it any easier for me to maintain my favorable stance towards evolutionary psychology, y'know? (Among everything else, "altruistic" != "egalitarian".)
bad_data_analysis  factor_analysis  evolutionary_psychology  literary_criticism  burstein.miriam 
january 2009 by cshalizi
"Voodoo Correlations in Social Neuroscience" - Vul et al.
Wow, that is some really bad data analysis... A pure instance of the effect Kieran once described as wondering why all the closets you open are so full of your clothes.
fmri  neuroscience  bad_data_analysis  selection_bias  sampling_on_the_dependent_variable  via:gelman  social_neuroscience  vul.edward  harris.christine  winkielman.pitor  pashler.harold 
december 2008 by cshalizi
Op-Ed Contributor - The Test Passes, Colleges Fail - NYTimes.com
File under: people who do not grasp the meaning of the term "controlled experiment". See also: "ecological fallacy", "massive confounding".

Update: I gave in: http://bactra.org/weblog/592.html
bad_data_analysis  academia  standardized_testing  salins.peter  via:klk  blogged 
november 2008 by cshalizi
Structure+Strangeness: Power laws in the mist
Hallucinating power-laws in the interactome. Complete with a sound re-analysis of the data!
heavy_tails  bioinformatics  molecular_biology  interactome  clauset.aaron  barabasi.albert-laszlo  bad_data_analysis 
october 2008 by cshalizi
SSRN-The Promise and Perils of Credit Derivatives by David Skeel, Frank Partnoy
"[E]ither [collateralized debt obligations] are being used to arbitrage a substantial price discrepancy in the fixed income markets or CDOs are being used to convert existing fixed income instruments that are accurately priced into new fixed income instruments that are overvalued. The first possibility assumes the existence of a substantial market inefficiency, perhaps the most substantial inefficiency ever found in the finance literature. The second possibility seems more likely. In other words, either CDOs are evidence of a substantial and pervasive market imperfection, or they are being used to create one."
via:?  financial_speculation  financialization  bad_data_analysis  credit_derivatives  skeel.david  partnoy.frank  finance 
september 2008 by cshalizi
Trending Toward Inanity -- In These Times
"If this book were turned in as the final to an entry-level statistics class, Penn would not only be failed, but the professor might well retire in shame." Hmm, how to design a class like that?
book_reviews  evisceration  polling  us_politics  bad_data_analysis  penn.mark  klein.ezra  to_teach:data-mining 
august 2008 by cshalizi
Novembre and Stephens, "Interpreting principal component analyses of spatial population genetic variation" (Nature Genetics)
"We find that gradients and waves observed in ... maps resemble sinusoidal mathematical artifacts that arise generally when PCA is applied to spatial data, implying that the patterns do not necessarily reflect specific migration events."
genetics  human_genetics  statistics  principal_components  spatial_statistics  stepping_stone_model  cavalli-sforza  via:arthegall  bad_data_analysis  to_teach:data-mining  to:NB  to_teach:undergrad-ADA 
may 2008 by cshalizi
The Monkey Cage: Demography Is Not King, or Why David Brooks Is a Hedgehog, Not a Fox
"Brooks knows one big thing: that the world can be easily divided into groups (preferably two) and these groups are really, really different for each other. ... Brooks also desperately wants to infer political divisions from sociological divisions."
brooks.david  utter_stupidity  evisceration  us_politics  sides.john  bad_data_analysis  thinking_in_stereotypes  historical_materialism  class_struggles_in_america 
may 2008 by cshalizi
"A Note on the Cobb-Douglas Function": The Review of Economic Studies, Vol. 30, No. 2, (1963 ), pp. 93-94
Shorter Simon & Levy (1963): I am sickened by the weakness of your model's goodness-of-fit test. (Does make me reconsider the many papers I still see using Cobb-Douglas...)
econometrics  simon.herbert  levy.ferdinand  cobb_douglas_production_function  bad_data_analysis  linear_regression  to_teach  via:slaniel  to_teach:undergrad-ADA  have_read 
april 2008 by cshalizi
Matthew Yglesias: Swing Voters
For MY's "1.1 percent", read "2.1 percent". The point, however, stands.
bad_data_analysis  us_politics  yglesias.matthew 
february 2008 by cshalizi
« earlier      

related tags

academia  adolescence  anthropology  antifeminist_idiocy  astrology  bad_data_analysis  bad_management  bad_science  bad_science_journalism  barabasi.albert-laszlo  behavioral_genetics  bibliometry  big_pharma  bioinformatics  blogged  blogging  books:noted  book_reviews  booze  brooks.david  brumm.maria  budiansky.stephen  burke.timothy  burstein.miriam  cats  causal_inference  cavalli-sforza  chalko.tom  classifiers  class_struggles_in_america  clauset.aaron  climate_change  coates.ta-nehisi  cobb_douglas_production_function  cognitive_science  confounding  copulas  corporations  credit_derivatives  cultural_criticism  cultural_transmission  data_mining  debunking  diversity  DNA_testing  douthat.ross  drum.kevin  earthquakes  econometrics  economics  economic_policy  education  elster.jon  entropy  epidemiology  estimation  evisceration  evolutionary_psychology  experimental_psychology  factor_analysis  farrell.henry  fermi_problems  fienberg.steve  finance  financialization  financial_crisis_of_2007--  financial_speculation  fmri  forensics  frankel.jeffrey  freese.jeremy  functional_connectivity  funny:academic  funny:geeky  funny:malicious  funny:pointed  galtons_problem  gelman.andrew  generations  genetics  geoengineering  geology  gives_economists_a_bad_name  gives_physicists_a_bad_name  goldberger.arthur  happiness  harrapan_civilization  harris.christine  have_read  healy.kieran  heart_attack  heavy_tails  heritability  hierarchical_models  historical_materialism  humanities  human_genetics  hypothesis_testing  ideology  inequality  information_theory  injustice  institutions  interactome  in_NB  iq  kamin.leon  kanazawa.satoshi  klein.ezra  levitt.steven  levy.ferdinand  liberman.mark  libraries  linear_regression  linguistics  literary_criticism  logistic_regression  loud_and_prolonged_applause  macroeconomics  management  mea_copula  mea_maxima_copula  medical_statistics  medicine  mental_testing  methodological_advice  methodology  modest_proposals  molecular_biology  moral_depravity  mortgage_crisis  multiple_comparisons  multiple_testing  murray.charles  music  natural_history_of_truthiness  neuroscience  newman.mark  nielsen.michael  obesity  our_decrepit_institutions  over-fitting  p-values  parapsychology  partnoy.frank  pashler.harold  penn.mark  pictish  pierrehumbert.raymond  poetry  political_science  polling  porter.mason  practices_relating_to_the_transmission_of_genetic_information  prediction  principal_components  psychoceramics  pullum.geoff  racism  racist_idiocy  rational_choice  re:g_paper  re:homophily_and_confounding  re:neutral_model_of_inquiry  re:your_favorite_dsge_sucks  regression  renewable_energy  risk_assessment  risk_perception  routinization  running_dogs_of_reaction  salins.peter  salmon  sampling_on_the_dependent_variable  science  science_policy  securitization  selection_bias  self-organized_criticality  self-promotion  sex_differences  shot_after_a_fair_trial  sides.john  simon.herbert  skeel.david  social_life_of_the_mind  social_media  social_neuroscience  social_psychology  social_science_methodology  spatial_statistics  sports  standardized_testing  stanley.h._eugene  stark.philip  statistics  stepping_stone_model  stochastic_processes  su.shi  suhay.liz  text_mining  theory_of_mind  the_american_dilemma  thinking_in_stereotypes  time_series  to:blog  to:NB  to_read  to_teach  to_teach:data-mining  to_teach:undergrad-ADA  to_teach:undergrad-research  unions  us_news_and_world_report  us_politics  utter_stupidity  via:?  via:aaron_clauset  via:arthegall  via:coates.ta-nehisi  via:djm1107  via:email  via:fionajay  via:flint_riemen  via:gelman  via:henry_farrell  via:hilzoy  via:klk  via:krugman  via:mathbabe  via:mejn  via:orzelc  via:slaniel  via:tony_lin  via:unfogged  violence  visual_display_of_quantitative_information  voting  vul.edward  whats_gone_wrong_with_america  why_oh_why_cant_we_have_a_better_academic_publishing_system  why_oh_why_cant_we_have_a_better_press_corps  winkielman.pitor  yarkoni.tal  yglesias.matthew 

Copy this bookmark:



description:


tags: