cshalizi + heard_the_talk   37

Accurately estimating neuronal correlation requires a new spike-sorting paradigm
"Neurophysiology is increasingly focused on identifying coincident activity among neurons. Strong inferences about neural computation are made from the results of such studies, so it is important that these results be accurate. However, the preliminary step in the analysis of such data, the assignment of spike waveforms to individual neurons (“spike-sorting”), makes a critical assumption which undermines the analysis: that spikes, and hence neurons, are independent. We show that this assumption guarantees that coincident spiking estimates such as correlation coefficients are biased. We also show how to eliminate this bias. Our solution involves sorting spikes jointly, which contrasts with the current practice of sorting spikes independently of other spikes. This new “ensemble sorting” yields unbiased estimates of coincident spiking, and permits more data to be analyzed with confidence, improving the quality and quantity of neurophysiological inferences. These results should be of interest outside the context of neuronal correlations studies. Indeed, simultaneous recording of many neurons has become the rule rather than the exception in experiments, so it is essential to spike sort correctly if we are to make valid inferences about any properties of, and relationships between, neurons."
to:NB  heard_the_talk  neuroscience  neural_data_analysis  ventura.valerie  kith_and_kin  statistics  inference_to_latent_objects 
20 days ago by cshalizi
Exact Covariance Thresholding into Connected Components for Large-Scale Graphical Lasso
"We consider the sparse inverse covariance regularization problem or graphical lasso with regularization parameter λ. Suppose the sample covariance graph formed by thresholding the entries of the sample covariance matrix at λ is decomposed into connected components. We show that the vertex-partition induced by the connected components of the thresholded sample covariance graph (at λ) is exactly equal to that induced by the connected components of the estimated concentration graph, obtained by solving the graphical lasso problem for the same λ. This characterizes a very interesting property of a path of graphical lasso solutions. Furthermore, this simple rule, when used as a wrapper around existing algorithms for the graphical lasso, leads to enormous performance gains. For a range of values of λ, our proposal splits a large graphical lasso problem into smaller tractable problems, making it possible to solve an otherwise infeasible large-scale problem. We illustrate the graceful scalability of our proposal via synthetic and real-life microarray examples."
--- I wonder whether this hasn't some application to the PC algorithm?
to:NB  graphical_models  lasso  sparsity  statistics  heard_the_talk 
7 weeks ago by cshalizi
[0805.1179] Autoregressive Process Modeling via the Lasso Procedure
"The Lasso is a popular model selection and estimation procedure for linear models that enjoys nice theoretical properties. In this paper, we study the Lasso estimator for fitting autoregressive time series models. We adopt a double asymptotic framework where the maximal lag may increase with the sample size. We derive theoretical results establishing various types of consistency. In particular, we derive conditions under which the Lasso estimator for the autoregressive coefficients is model selection consistent, estimation consistent and prediction consistent. Simulation study results are reported."
to:NB  time_series  statistics  lasso  sparsity  variable_selection  kith_and_kin  heard_the_talk  rinaldo.alessandro  nardi.yuval 
12 weeks ago by cshalizi
[0806.3978] Information In The Non-Stationary Case
"Information estimates such as the ``direct method'' of Strong et al. (1998) sidestep the difficult problem of estimating the joint distribution of response and stimulus by instead estimating the difference between the marginal and conditional entropies of the response. While this is an effective estimation strategy, it tempts the practitioner to ignore the role of the stimulus and the meaning of mutual information. We show here that, as the number of trials increases indefinitely, the direct (or ``plug-in'') estimate of marginal entropy converges (with probability 1) to the entropy of the time-averaged conditional distribution of the response, and the direct estimate of the conditional entropy converges to the time-averaged entropy of the conditional distribution of the response. Under joint stationarity and ergodicity of the response and stimulus, the difference of these quantities converges to the mutual information. When the stimulus is deterministic or non-stationary the direct estimate of information no longer estimates mutual information, which is no longer meaningful, but it remains a measure of variability of the response distribution across time."
in_NB  statistics  neuroscience  entropy_estimation  kith_and_kin  heard_the_talk  yu.bin  vu.vincent  kass.rob  neural_data_analysis 
12 weeks ago by cshalizi
Phys. Rev. E 84, 051917 (2011): Nonequilibrium phase transitions in biomolecular signal transduction
"We study a mechanism for reliable switching in biomolecular signal-transduction cascades. Steady bistable states are created by system-size cooperative effects in populations of proteins, in spite of the fact that the phosphorylation-state transitions of any molecule, by means of which the switch is implemented, are highly stochastic. The emergence of switching is a nonequilibrium phase transition in an energetically driven, dissipative system described by a master equation. We use operator and functional integral methods from reaction-diffusion theory to solve for the phase structure, noise spectrum, and escape trajectories and first-passage times of a class of minimal models of switches, showing how all critical properties for switch behavior can be computed within a unified framework."
to:NB  heard_the_talk  kith_and_kin  signal_transduction  biochemical_networks  phase_transitions  statistical_mechanics  non-equilibrium  smith.eric  fontana.walter  krakauer.david 
november 2011 by cshalizi
[1111.4226] Joint Modeling of Multiple Related Time Series via the Beta Process
"We propose a Bayesian nonparametric approach to the problem of jointly modeling multiple related time series. Our approach is based on the discovery of a set of latent, shared dynamical behaviors. Using a beta process prior, the size of the set and the sharing pattern are both inferred from data. We develop efficient Markov chain Monte Carlo methods based on the Indian buffet process representation of the predictive distribution of the beta process, without relying on a truncated model. In particular, our approach uses the sum-product algorithm to efficiently compute Metropolis-Hastings acceptance probabilities, and explores new dynamical behaviors via birth and death proposals. We examine the benefits of our proposed feature-based model on several synthetic datasets, and also demonstrate promising results on unsupervised segmentation of visual motion capture data."
to:NB  heard_the_talk  time_series  statistics  machine_learning  nonparametrics  fox.emily  jordan.michael_i. 
november 2011 by cshalizi
[1111.4470] Efficient Regression in Metric Spaces via Approximate Lipschitz Extension
"We present a framework for performing efficient regression in general metric spaces. Roughly speaking, our regressor predicts the value at a new point by computing a Lipschitz extension -- the smoothest function consistent with the observed data -- while performing an optimized structural risk minimization to avoid overfitting. The offline (learning) and online (inference) stages can be solved by convex programming, but this naive approach has runtime complexity O(n^3), which is prohibitive for large datasets. We design instead an algorithm that is fast when the the doubling dimension, which measures the "intrinsic" dimensionality of the metric space, is low.
We use the doubling dimension multiple times; first, on the statistical front, to bound fat-shattering dimension of the class of Lipschitz functions (and obtain risk bounds); and second, on the computational front, to quickly compute a hypothesis function and a prediction based on Lipschitz extension. Our resulting regressor is both asymptotically strongly consistent and comes with finite-sample risk bounds, while making minimal structural and noise assumptions."
in_NB  heard_the_talk  kith_and_kin  regression  learning_theory  statistics  kontorovich.aryeh 
november 2011 by cshalizi
SSRN-Non-Bayesian Social Learning, Second Version by Ali Jadbabaie, Alvaro Sandroni, Alireza Tahbaz-Salehi
Actually, I heard the talk about this and some even more impressive follow-up work, which Jadbabaie said will be in working paper form very soon.
social_life_of_the_mind  collective_cognition  heard_the_talk 
february 2011 by cshalizi
[1101.5108] Causal Dependence Tree Approximations of Joint Distributions for Multiple Random Processes
"We investigate approximating joint distributions of random processes with causal dependence tree distributions. Such distributions are particularly useful in providing parsimonious representation when there exists causal dynamics among processes. By extending the results by Chow and Liu on dependence tree approximations, we show that the best causal dependence tree approximation is the one which maximizes the sum of directed informations on its edges, where best is defined in terms of minimizing the KL-divergence between the original and the approximate distribution. Moreover, we describe a low-complexity algorithm to efficiently pick this approximate distribution."
stochastic_processes  information_theory  graphical_models  chow-liu_trees  in_NB  coleman.todd  heard_the_talk 
february 2011 by cshalizi
Propagation of innovations in networked groups.
"A novel paradigm was developed to study the behavior of groups of networked people searching a problem space. The authors examined how different network structures affect the propagation of information in laboratory-created groups. Participants made numerical guesses and received scores that were also made available to their neighbors in the network. The networks were compared on speed of discovery and convergence on the optimal solution. One experiment showed that individuals within a group tend to converge on similar solutions even when there is an equally valid alternative solution. Two additional studies demonstrated that the optimal network structure depends on the problem space being explored, with networks that incorporate spatially based cliques having an advantage for problems that benefit from broad exploration, and networks with greater long-range connectivity having an advantage for problems requiring less exploration."
social_networks  experimental_psychology  collective_cognition  social_life_of_the_mind  re:do-institutions-evolve  kith_and_kin  heard_the_talk  have_read  to_teach:complexity-and-inference  to:blog  mason.winter  re:democratic_cognition 
december 2010 by cshalizi
Approximate Bayesian Computation in Evolution and Ecology (Beaumont, 2010)
Heard the talk in Bristol. Naturally the idea makes a lot more sense when laid out by a proponent than by a hostile critic; it's actually very similar to indirect inference (only with a prior to add bias).
heard_the_talk  to_read  indirect_inference  approximate_bayesian_computation  statistics  estimation 
november 2010 by cshalizi
[1010.2286] Divergence-based characterization of fundamental limitations of adaptive dynamical systems
"general problem of adaptively controlling and/or identifying a stochastic dynamical system, where our {\em a priori} knowledge allows us to place the system in a subset of a metric space (the uncertainty set). We present an information-theoretic meta-theorem that captures the trade-off between the metric complexity (or richness) of the uncertainty set, the amount of information acquired online in the process of controlling and observing the system, and the residual uncertainty remaining after the observations have been collected. Following the approach of Zames, we quantify {\em a priori} information by the Kolmogorov (metric) entropy of the uncertainty set, while the information acquired online is expressed as a sum of information divergences. The general theory is used to derive new minimax lower bounds on the metric identification error, as well as to give a simple derivation of the minimum time needed to stabilize an uncertain stochastic linear system."
systems_identification  control_theory  estimation  learning_theory  minimax  heard_the_talk  raginsky.maxim  to:NB 
october 2010 by cshalizi
Snijders, Koskinen, Schweinberger: Maximum likelihood estimation for social network dynamics
"A model for network panel data is discussed, based on the assumption that the observed data are discrete observations of a continuous-time Markov process on the space of all directed graphs on a given node set, in which changes in tie variables are independent conditional on the current graph. The model for tie changes is parametric and designed for applications to social network analysis, where the network dynamics can be interpreted as being generated by choices made by the social actors represented by the nodes of the graph. An algorithm for calculating the Maximum Likelihood estimator is presented, based on data augmentation and stochastic approximation. An application to an evolving friendship network is given and a small simulation study is presented which suggests that for small data sets the Maximum Likelihood estimator is more efficient than the earlier proposed Method of Moments estimator."
network_data_analysis  social_networks  markov_models  estimation  statistics  heard_the_talk 
august 2010 by cshalizi
Inferring deterministic causal relations
Best Student Paper at UAI 2010. What would happen if you used this on sequences of values from the Arnold cat map? Could it learn the direction of time?
causal_inference  information_theory  to_read  heard_the_talk 
july 2010 by cshalizi
Social Influence and the Autism Epidemic
Social influence on diagnosis, not actually producing autism. Heard the talk at the MERSI conference in 2009; it sounded pretty convincing.
re:homophily_and_confounding  social_cognition  autism  sociology  heard_the_talk  via:orgtheory 
may 2010 by cshalizi
[1002.4802] Gaussian Process Structural Equation Models with Latent Variables
"In a variety of disciplines such as social sciences, psychology, medicine and economics, the recorded data are considered to be noisy measurements of latent variables connected by some causal structure. This corresponds to a family of graphical models known as the structural equation model with latent variables. While linear non-Gaussian variants have been well-studied, inference in nonparametric structural equation models is still underdeveloped. We introduce a sparse Gaussian process parameterization that defines a non-linear structure connecting latent variables, unlike common formulations of Gaussian process latent variable models. An efficient Markov chain Monte Carlo procedure is described. We evaluate the stability of the sampling procedure and the predictive ability of the model compared against the current practice."
statistics  graphical_models  latent_variables  nonparametrics  estimation  heard_the_talk 
february 2010 by cshalizi
[0708.3412] Observability and nonlinear filtering
"This paper develops a connection between the asymptotic stability of nonlinear filters and a notion of observability. We consider a general class of hidden Markov models in continuous time with compact signal state space, and call such a model observable if no two initial measures of the signal process give rise to the same law of the observation process. We demonstrate that observability implies stability of the filter, i.e., the filtered estimates become insensitive to the initial measure at large times. For the special case where the signal is a finite-state Markov process and the observations are of the white noise type, a complete (necessary and sufficient) characterization of filter stability is obtained in terms of a slightly weaker detectability condition. In addition to observability, the role of controllability in filter stability is explored. Finally, the results are partially extended to non-compact signal state spaces."
in_NB  markov_models  filtering  re:AoS_project  van_handel.ramon  heard_the_talk 
november 2007 by cshalizi

related tags

adaptation  ahmed.amr  albers.dave  amaral.luis  approximate_bayesian_computation  autism  automated_diagnosis  bergstrom.carl  biochemical_networks  brown.emery  causal_inference  change-point_problem  chow-liu_trees  citation_networks  clermont.gilles  clustering  coleman.todd  collective_cognition  community_discovery  computational_statistics  control_theory  data_mining  emergence  entropy_estimation  ergodic_theory  estimation  expectation-maximization  experimental_psychology  exponential_family_random_graphs  filtering  fontana.walter  fox.emily  graphical_models  haslinger.rob  have_read  heard_the_talk  hofman.jake  homophily  identifiability  indirect_inference  inference_to_latent_objects  influence  information_theory  internet  inverse_problems  in_NB  jordan.michael_i.  kass.rob  kith_and_kin  kontorovich.aryeh  krakauer.david  lasso  latent_variables  learning_theory  machine_learning  map-reduce  markov_models  mason.winter  minimax  model_selection  morris.martina  nardi.yuval  networked_life  networks  network_data_analysis  neural_data_analysis  neuroscience  non-equilibrium  nonparametrics  parallel_computing  phase_transitions  point_processes  raginsky.maxim  random_walks  re:AoS_project  re:critique_of_diffusion  re:democratic_cognition  re:do-institutions-evolve  re:homophily_and_confounding  re:phil-of-bayes_paper  re:stacs  regression  rinaldo.alessandro  rosvall.martin  rubin.jonathan  signal_transduction  smith.eric  smola.alex  social_cognition  social_life_of_the_mind  social_networks  sociology  sparsity  state_estimation  statistical_mechanics  statistics  stochastic_processes  systems_identification  text_mining  time_rescaling  time_series  to:blog  to:NB  to_read  to_teach:complexity-and-inference  to_teach:data-mining  to_teach:undergrad-ADA  tutorials  van_handel.ramon  variable_selection  ventura.valerie  via:ariddell  via:orgtheory  voter_model  vu.vincent  watts.duncan  web  yu.bin  zenker.sven 

Copy this bookmark:



description:


tags: