arthegall + via:cshalizi   129

Summary of rules from "Elements of Programming Style," 1974 | Beyond The Beyond | Wired.com
"Avoid temporary variables"??? (A lot of the rest is reasonable, but could be subsumed into a reasonable version of 'lint' for whatever language you're working in. Is there an R lint?)
lint  programming  tips  R  via:cshalizi 
6 days ago by arthegall
Omniscient Gentlemen of The Atlantic | | Notebook | The Baffler
"The din of younger colleagues tapping keyboards is never soothing, but sitting in the press room of the Ideas Forum felt like a human rights violation. What could anyone write about something so tyrannically dull— other than an angry elegy for the massacre of meaning?" --- A little purple, but still pretty funny.
humor  journalism  the-atlantic  mo-tkacik  via:cshalizi  death-of-print 
5 weeks ago by arthegall
PeteSearch: Keep the web weird
so, two comments:
(1) "computable web" != "canonical names." Common mistake.
(2) the "ambiguity" of reference that [some of] the semantic web people are working to eliminate here isn't the ambiguity he's describing ("My ... apartment has been described as being in the Lower Haight, Duboce Triangle, or Upper Castro, depending on who you ask..."), where one "thing" can have multiple names -- but the exact opposite, where one name refers to many (different) things, in different contexts. Imagine if "Duboce Triangle" was the name of a neighborhood *and* a newspaper about that neighborhood, *and* also the collective name for all the people living within 2 miles of Warden's apartment. A person might even use the same noun (or noun phrase) to refer to all three things, within the space of a single unit of text. It'd get pretty confusing. Using "canonical" names is a (admittedly somewhat simplistic) attempt to get around *that* problem, rather than the one he's describing; and saying that you "embrace" the ambiguity that's latent here is equivalent to saying that you don't care if the web is unusable to certain groups of people (e.g. scientists, researchers) who *are* concerned with avoiding this sort of ambiguity. "People searching for movie times" is just a test-case for "people searching for data about a 'gene'."

Also, I love someone who writes critically about Wolfram (.data, Alpha, and all the rest) as much as the next guy -- but saying, "the web is written for humans to read" is pretty laughable when it's not coming out of the mouth of a guy named "Firefox." The web is written for your web browser, and no number of SXSW presentations will change that.

</rant>
via:cshalizi  web  internet  semanticweb  tagging  rant  folksonomy 
10 weeks ago by arthegall
[1203.0697] Learning High-Dimensional Mixtures of Graphical Models
"We now propose a method for learning the mixture components given n i.i.d. samples y_n
drawn from a graphical mixture model P(y). Our method proceeds in two stages. First, we estimate the graph G_∪ := U_{r}^{h=1} G_h, which is the union of the Markov graphs of the mixture. This is accomplished via a series of rank tests. Note that in the special case when G_h ≡ G_∪, this also gives the graph estimates of the component models. We then use the graph estimate hat{G}_∪ to obtain the pairwise marginals of the respective mixture components via a spectral decomposition method. Finally, we use the Chow-Liu algorithm to obtain tree approximations {T_h}_h of the individual mixture components." -- To do: review how this works in the context of gene expression experiments for transcription factor regulatory relationships, which are (presumably) mixtures of a couple different underlying models or modes.
gene-expression  bioinformatics  research-article  arxiv  via:cshalizi  graphical-models  mixture-models  machinelearning 
11 weeks ago by arthegall
Cthulhu Tract | By Fred Van Lente and Steve Ellis
"Stars are WRONG" -- (so it would be a bad idea, to print a few of these onto cards and start handing them out at train stations?)
via:cshalizi  cthulu  humor  awesome  comic  old-ones 
december 2010 by arthegall
John Baez, Mike Stay, "Algorithmic Thermodynamics"
"Charles Babbage described a computer powered by a steam engine; we de- scribe a heat engine powered by programs! We admit that the significance of this line of thinking remains a bit mysterious."
algorithms  computerscience  theory  john-baez  thermodynamics  probability  complexity  entropy  via:cshalizi 
october 2010 by arthegall
"Experiment in GP based on ImageMagick" (Notional Slurry)
"If you skip this step—even with a downloaded library—you’re a baaaaaad genetic programmer. Turn in your Jaws and go back to machine learning land."
humor  genetic-programming  machinelearning  joke  via:cshalizi  by:Vaguery  imagemagick  testing 
september 2010 by arthegall
Leibler, Kussell, "Individual histories and selection in heterogeneous populations" PNAS
"Using “individual histories”—temporal sequences of all reproduction events and phenotypic changes of individuals and their ancestors—we present an alternative approach to quantifying selection in diverse experimental settings..."
via:cshalizi  pnas  selection  history  population-effects  genomics  research-article  statistics 
july 2010 by arthegall
Harrington, Hero, "Spatio-Temporal Graphical Model Selection" (arXiv)
"We consider the problem of estimating the topology of spatial interactions in a discrete state, discrete time spatio-temporal graphical model where the interactions affect the temporal evolution of each agent in a network."
via:cshalizi  lasso  graphical-models  research-article  machinelearning  spatial-data  temporal-data  arxiv 
july 2010 by arthegall
Allman et al. "Parameter identifiability in a class of random graph mixture models" (arXiv)
"We prove identifiability of parameters for a broad class of random graph mixture models. These models are characterized by a partition of the set of graph nodes into latent (unobservable) groups. The connectivities between nodes are independent random variables when conditioned on the groups of the nodes being connected. In the binary random graph case, in which edges are either present or absent, these models are known as stochastic blockmodels and have been widely used in the social sciences and, more recently, in biology. " -- To read, in the context of the blockmodeling paper from a few weeks back.
blockmodeling  graph  statistics  arxiv  research-article  parameters  identifiability  via:cshalizi 
june 2010 by arthegall
5 Creepy Ways Video Games Are Trying to Get You Addicted | Cracked.com
Fascinating ... and, I admit, a little creepy. I say this as someone who accidentally played four hours of video games last night. (Suddenly I looked up, it was 1am, and I was all, "whoa...?") John, this goes back to the "washing the dishes" game we were talking about a long time ago? Also, I kinda wonder: sure, it's creepy when a *game* does this to hook you. But I kinda wonder if some of these psychology-based game-honed ideas couldn't be put to use in a classroom setting with students. How could that work? Tokens? Medals? "Coins? (a la Mario?) Lots of little questions? Probably effective teachers, especially at the grade school level, already use a combination of these techniques...?
teaching  games  via:cshalizi  psychology  bf-skinner  addiction 
april 2010 by arthegall
Jill North, "An Empirical Approach to Symmetry and Probability"
"I argue that a priori symmetries need never constrain our probability attributions, even for initial credences."
philosophy  probability  symmetry  belief  via:cshalizi 
april 2010 by arthegall
"Pictish writing?" (Language Log)
"I certainly don't mean to suggest that the ancient Picts generated their petroglyphs using throws of 7d6." -- Liberman reveals his background as a D&D-playing nerd (one of us, one of us....)
via:cshalizi  language  humor  picts  writing  entropy  information  history  dungeons-and-dragons  obscurely-referential 
april 2010 by arthegall
Unhappy Hipsters
"You can come out when you can properly explain the differences between Modernist architecture and postmodern ornamentation."
via:cshalizi  humor  architecture  design  goddamn-hipsters  blog 
february 2010 by arthegall
Wesley C. Salmon - Wikipedia, the free encyclopedia
Probably should have bought and read one of his books (Scientific Explanation and the Causal Structure of the World) a long time ago.
via:cshalizi  causality  science  philosophy  philosophy-of-science  book  recommendation 
january 2010 by arthegall
Malmgren, Hofman, Amaral, and Watts. "Characterizing Individual Communication Patterns" (arXiv)
"Here, we propose a model of individual e-mail communication that is sufficiently rich to capture meaningful variability across individuals, while remaining simple enough to be interpretable. We show that the model, a cascading non-homogeneous Poisson process, can be formulated as a double-chain hidden Markov model..."
poisson-process  arxiv  research-article  communication  project  duncan-watts  email  markov-models  network  via:cshalizi  rediscovering-what-chl-already-knew 
december 2009 by arthegall
Language Log » The business of newspapers is news
"In other words, the more prestigious the journal (as measured by its "impact factor"), the less likely the genetic association studies it publishes are to be replicated." -- a great line. But to be honest, I'm less concerned by this kind of thing today than I was (probably) a year ago. First of all, I expect it from Wade. But second, I don't have that high an opinion of the vetting process that goes into journal articles. Peer review is not the arbiter of truth, it's the beginning of the investigation -- plenty of carefully-worded refereed journal articles (*especially* in Nature & Science) are also over-ambitious, breathlessly phrased, and written to attract (more) funding. Whatever the business of newspapers is, it should be clear that the business of [some] journals is not science, but *surprising* science. That's often a different thing.
publishing  science  culture  newspapers  nyt  nicholas-wade  via:cshalizi 
december 2009 by arthegall
"The hunt for the Hat Gene" (Language Log)
I've said it once, I'll say it again -- *You Cannot Trust the NYT Science Writers.* They're not in the business of explaining science; they are in the business of sensationalizing science for the readers who happened to have wandered off of the NYT Fashion section and are wondering where they are.
via:cshalizi  genes  phenotype  science  journamalism 
november 2009 by arthegall
Quick-R: Home Page
"R for ... SPSS users." Need to send this to Rachel...
r  via:cshalizi  tutorial  statistics  software  reference  programming 
october 2009 by arthegall
Lee & Wasserman, "Spectral Connectivity Analysis"
A reasonable tutorial? I still think that some of the local-linear-embedding techniques would work well in the context of disk or storage layout in a database setting.
arxiv  local-linear-embedding  research-article  spectral-graph-theory  datamining  via:cshalizi 
october 2009 by arthegall
My Combinatorics Problem, Let Me Show You It
These are the Bell Numbers (http://mathworld.wolfram.com/BellNumber.html), no? Or are the letters ordered in some way (alphabetically, or something)?
bell-numbers  via:cshalizi  question  combinatorics 
september 2009 by arthegall
"Entertainment Values: Will Capitalism Go To Hollywood," The Unofficial Paul Krugman Web Page
What's the old adage about wrestling with a pig? At any rate, a reasonable takedown of Kevin Kelly: "If there is something new in the writings of Kelly and other cyberprophets, it is the fact that they don't just predict a future in which the curves slope the wrong way, they endorse it. That is, along with the gee-whiz pronouncements about how the economy supposedly works goes a pronounced libertarian bent, a belief that the new economy is too dynamic, organic, or whatever to be regulated from above."
kevin-kelly  paul-krugman  review  takedown  humor  economy  capitalism  technology  futurism  via:cshalizi 
september 2009 by arthegall
Vazquez & Farinelli, "Gauge Invariance, Geometry and Arbitrage" (arXiv)
"We show that our arbitrage measure is invariant under changes of num\'{e}raire and equivalent probability. Moreover, such measure has a geometrical interpretation as a gauge connection." -- Ahhh.
via:cshalizi  stochastic-processes  arxiv  research-article  geometry  arbitrage  finance  markets 
september 2009 by arthegall
"The Impact Factor’s Matthew Effect" (orgtheory.net)
I want to look at this a little more closely after the thesis defense, but my initial thoughts are that this doesn't seem to control very well for the exact *date* of publication, right? (Besides a quick note in the paper that most duplicate papers appear to be published in the same year, a quick scan reveals no other mention of publication time -- but maybe I missed something.) I wonder how often the publication in the "higher impact factor" journal is actually the *first* publication, chronologically.
publication  science  matthew-effect  via:cshalizi  sociology  citation 
august 2009 by arthegall
Probabilistic Graphical Models - The MIT Press
For some reason, I keep seeing Nir Friedman sitting outside of the Starbucks down the street from me, talking to people I know. It was E.F.'s former post-doc, last week.
nir-friedman  humor  we-used-to-pass-the-salt-in-cambridge  book  via:cshalizi  graphical-models 
august 2009 by arthegall
Robert Berk, "Limiting Behavior of Posterior Distributions when the Model is Incorrect"
Can I ask a question? Is it the case that Brad DeLong's "Rosencrantz and Guildenstern Flip a Coin" example (http://delong.typepad.com/sdj/2009/03/cosma-shalizi-takes-me-to-probability-school-or-is-it-philosophy-school.html) is a particular instance of the example given in the last paragraph of this paper? (the answer, I am subsequently told, is "yes.")
via:cshalizi  baysian-methods  inference  modeling  research-article  statistics 
august 2009 by arthegall
"Why Andrew Sullivan is right about Megan McArdle, but not in the way he thinks." (The Inverse Square Blog)
The rest of the criticism may be (probably is) mostly on-target, but his criticism of McArdle's potted one-graf description of academic research vs. pharmaceutical work misses the mark pretty widely. Far from being "laughable," I'd say it's actually a pretty reasonable 100,000 ft. analogy, and I'd be truly surprised if Sue Lindquist or anyone else in her lab disagreed with it. What are laughable are the little blurbs made by his (Levenson's) students at the link -- for instance, "Scientists can cure Parkinson’s Disease in yeast – can they extend this to humans?" To the extent that the Lindquist lab will "cure Parkinson's" in humans, it will be in a theoretical sense. They won't be producing any drugs in our lifetimes. Also, "researchers pounding molecules into receptors" is a pretty poor description of the WIBR, and yeah, I can throw a stone from my office and hit them too. Grrrrr.
idiocy  whitehead  biology  research  science  drug-discovery  markets  health-care  via:cshalizi 
july 2009 by arthegall
Wetware - Bray, Dennis - Yale University Press
To be bought, read, and stored next to the Uri Alon books on my bookshelf.
book  computation  molecular-biology  genetics  systems-biology  via:cshalizi 
may 2009 by arthegall
"Good is dead" (Language Log)
"When discussing complex systems, like brains and other societies, it is easy to oversimplify: I call this Occam's lobotomy." -- Awesome.
via:cshalizi  ij-good  researcher  statistics  estimation  history 
april 2009 by arthegall
Money Metric Welfare and "Economic Efficiency" ~ Angry Bear
"It is clear that the choice of the initial state A is critical to this calculation. It is just not true that if the EV of a change from A to B is negative, then the EV of a change from B to A is positive. The choice of the initial state A is critical and always made very casually when attempting to calculate EVs."
economics  utility  expected-value  money  efficiency  markets  via:cshalizi 
april 2009 by arthegall
Shalizi, Camperi, and Klinkner, "Discovering Functional Communities in Dynamical Networks" (arXiv)
"In this paper, we lay out the problem of discovering_functional communities_, and describe an approach to doing so. This method combines recent work on measuring information sharing across stochastic networks with an existing and successful community-discovery algorithm for weighted networks. We illustrate it with an application to a large biophysical model of the transition from beta to gamma rhythms in the hippocampus."
via:cshalizi  arxiv  research-article  social-networks  networks  communities  machinelearning 
march 2009 by arthegall
John Sutton, "Technology and Market Structure" (MIT Press)
"One [way of studying market structure] looks to "industry characteristics" to explain why different industries develop in different ways; the other looks to the pattern of firm growth within a "typical" industry to describe the evolution of the size distribution of firms. In his new book, John Sutton sets out a unified theory that encompasses both approaches, while generating a series of novel predictions as to how markets evolve."
via:cshalizi  markets  technology  book  economics  development  growth  mit-press 
march 2009 by arthegall
"The unfortunate uselessness of most ’state of the art’ academic monetary economics" (Willem Buiter)
"The conclusion, boys and girls, should be that trade - voluntary exchange - is the exception rather than the rule and that markets are inherently and hopelessly incomplete. Live with it and start from that fact. The benchmark is no trade - pre-Friday Robinson Crusoe autarky. For every good, service or financial instrument that plays a role in your ‘model of the world’, you should explain why a market for it exists - why it is traded at all. Perhaps we shall get somewhere this time." --- Buiter's criticism of (among others) Robert Lucas.
via:cshalizi  markets  modeling  economics  essay  academia  autarky 
march 2009 by arthegall
Steven N. Durlauf
"... empirics, as opposed to speculative theory and stylized facts, of economic growth..."
researcher  homepage  economics  development  growth  economist  via:cshalizi 
march 2009 by arthegall
"Ikea chairs" (the statistical mechanic)
"In a review article about agent-based models Dietrich Stauffer once wrote "Physicists not only know everything, they also know everything better."" -- They're more like a contagion (physicists, that is) than anything else. Having done physics as a physicist would, they then spread out to new, uncharted (to them) areas, to explain how science should be done. Economics? It's really physics. Biology? Start with theoretical models, as a physicist would. Bioinformatics? We've solved those problems already. Philosophy of Science? Sorry, I think you meant to say, "philosophy of Science as it would be if performed by a Physicist." Philosophy of Physics, really. I suppose most of this is a completely normal byproduct of the fact that we're at the tail end of a century when physics was, as a discipline, technology, and economic activity, remarkably successful.
humor  physics  economics  quote  via:cshalizi 
march 2009 by arthegall
"Lovecraftian School Board Member Wants Madness Added To Curriculum" (The Onion)
"West says the school inadequately prepares students for the black seas of infinity." --- I couldn't agree more.
humor  education  onion  cthulu  lovecraft  via:cshalizi 
march 2009 by arthegall
Omics! Omics!: Ah, them gold rush days!
"In some sense the genomics companies were just too early for their own good (though the late entrants such as DeCode haven't fared much better). There are no genomics companies -- yet genomics is everywhere. Basic biology fueled by the genome or the technologies pushed by genomics permeate the drug industry (based on the 2 large pharmas I interviewed at in the year MLNM laid me off & what I can read; constructive dissent on this point is welcomed). Probably no novel small molecule drug development history will be directly pinned back to a 1990's genomics effort -- but also virtually no drugs going forward will have their development unaffected by the knowledge of the genome. Everything is tangled up & confused & merged." --- That's funny, some of the people from Millennium sit in Stata these days (in the Science Commons consortium). They have a similar perspective, I think, but a different takeaway message (also: more optimism).
biology  genomics  history  via:cshalizi  pharmaceuticals  drug-design  millennium  irrational-exuberance 
february 2009 by arthegall
36-707: Regression Analysis, Fall 2007
Larry Wasserman's class notes from a course on regression analysis. Via cshalizi.
statistics  list  regression  course  notes  via:cshalizi 
february 2009 by arthegall
Shrader-Frechette on Sunstein on Risk
I knew I'd seen this review somewhere before -- did I really not save it at the time (I guess not). I should probably go ahead and buy the book. "Although Sunstein correctly calls for “sound science” in risk policy, he often gets his science wrong and almost always attempts to reduce ethical to purely scientific questions." -- although that's a little question-begging itself. Then --- "Because most hazardous materials are not tested, most risk probabilities are determined through mathematical models. As such, the models describe events falling into the category of Bayesian “uncertainty,” where no accurate probabilities are available, because there are no frequency data. If data were available, there would be no need for risk analysis and its attendant models. Given this Bayesian uncertainty, virtually all risk experts accept the fact that risk analyses typically err by 4 to 6 orders of magnitude." -- What?
via:cshalizi  review  risk  book  cass-sunstein  decision-theory  rationality  policy  cost-benefit-analysis  ethics 
january 2009 by arthegall
PHD Comics: Abstract Mad Libs
Good enough that I would actually consider using it as a template for a future abstract.
via:cshalizi  academia  humor  comic  writing 
january 2009 by arthegall
"Beyond Proportional Analogy" (Apperceptual)
"For some time now, I’ve been experimenting with algorithms for solving proportional analogies." Haven't had enough time to read the underlying paper yet (apparently, my family and girlfriend are not cool with me reading papers at the dinner table on Christmas Eve? who knew...), but this looks totally sweet. Time to add Apperceptual to the RSS reader (I'm not sure why I hadn't done that already).
via:cshalizi  learning  analogies  machinelearning  relational-data 
december 2008 by arthegall
« earlier      

related tags

academia  addiction  aic  alan-willsky  algorithms  analogies  anomalies  arbitrage  architecture  arxiv  atmosphere  audio  autarky  awesome  banking  bayesian-methods  bayesian-networks  bayesian-probability  baysian-methods  belief  bell-numbers  betting  bf-skinner  big-data  bioethics  bioinformatics  biology  blockmodeling  blog  blogging  book  books  bounds  bruce-sterling  by:cshalizi  by:Vaguery  calculus  call-and-response  capitalism  cartoon  cass-sunstein  cat  cat-waxing  cats  causality  celera  citation  clustering  cmu  coffee  combinatorics  comic  comics  comment  communication  communities  complexity  computation  computational-biology  computerscience  consciousness  consistency  consumption  convex-optimization  cost-benefit-analysis  course  craig-venter  credit-derivatives  criticism  cthulu  culture  cybernetics  data  datamining  death-of-print  decision-theory  delicious  density-estimation  design  development  diagrams  differential-geometry  disease  drama  drug-design  drug-discovery  duncan-watts  dungeons-and-dragons  dutch-book  dynamic-bayesian-networks  dynamical-systems  economics  economist  economy  education  efficiency  electronics  email  empirical-bayes  entropy  epistemology  eric-xing  essay  estimation  ethics  expected-value  exponential-distributions  exponential-families  feyerabend  finance  folksonomy  formal-methods  functionalprogramming  futurism  game-of-thrones  game-theory  games  gardening  gaussianprocesses  gavagai  gene-expression  generalized-linear-models  generator  genes  genetic-association-studies  genetic-programming  genetics  genomics  geometry  goddamn-hipsters  graduate-school  grammar  graph  graphical-models  growth  health-care  heavy-tailed-distributions  hilarious  history  homepage  humor  hypothesis-testing  idea  identifiability  idiocy  ij-good  image  imagemagick  index  inequality  inference  informatics  information  innovation  integration  intellectual-property  intelligent-design  internet  interview  irrational-exuberance  jmlr  john-baez  john-milton  joke  journal-article  journalism  journamalism  jstor  jun-liu  kass  kelly-criterion  kernel-methods  kevin-kelly  knowledge  knowledge-base  language  lasso  latent-variables  law  le-cam  learning  lecture  lectures  linear-regression  linguistics  links  lint  list  local-linear-embedding  logic  lolcats  lovecraft  machinelearning  markets  markov-logic  markov-models  martingales  mathematics  matthew-effect  mccain  measurement  medicine  memorization  memory  microarray-analysis  microarrays  millennium  mineshaft  mit-press  mixture-models  mo-tkacik  model-selection  modeling  molecular-biology  money  monopoly  mortality  multiple-testing  network  networks  newspapers  neyman  neyman-pearson  nicholas-wade  nir-friedman  nonparametrics  norbert-weiner  notes  number  numeracy  nyt  obscurely-referential  old-ones  onion  online-optimization  ontology  optimization  paradise-lost  parameters  paul-krugman  pdf  personal-finance  pharmaceuticals  phenotype  philosophy  philosophy-of-science  photos  physics  picts  pnas  poetry  poisson-process  policy  politics  population-effects  population-genetics  probabilistic-relational-models  probability  procrastination  programming  project  project-euclid  proof  psychology  psychometrics  publication  publishing  pubmed  query  question  quote  r  random  random-processes  randomization  rant  rationality  rdf  recommendation  rediscovering-what-chl-already-knew  reference  regression  regulatory-networks  relational-data  religion  research  research-article  research-articles  researcher  review  richard-thaler  risk  satire  saving  science  selection  semantics  semanticweb  sleep  social-networks  sociology  software  spatial-data  spectral-graph-theory  ssrn  statistics  statnet  steins-method  stochastic-processes  strange  symbolic-methods  symmetry  systems-biology  tagging  tags  takedown  teaching  technology  temporal-data  testing  textbook  the-atlantic  theory  thermodynamics  thesis  time-series  tips  to-read  to:blog  to:buy  transcriptional-regulation  trees  tutorial  tv  utility  via:cshalizi  video  visualization  voting  we-used-to-pass-the-salt-in-cambridge  web  web-service  whitehead  word-cloud  writing  yak-shaving  zellig-harris 

Copy this bookmark:



description:


tags: