cshalizi + heard_the_talk 37
Accurately estimating neuronal correlation requires a new spike-sorting paradigm
20 days ago by cshalizi
"Neurophysiology is increasingly focused on identifying coincident activity among neurons. Strong inferences about neural computation are made from the results of such studies, so it is important that these results be accurate. However, the preliminary step in the analysis of such data, the assignment of spike waveforms to individual neurons (“spike-sorting”), makes a critical assumption which undermines the analysis: that spikes, and hence neurons, are independent. We show that this assumption guarantees that coincident spiking estimates such as correlation coefficients are biased. We also show how to eliminate this bias. Our solution involves sorting spikes jointly, which contrasts with the current practice of sorting spikes independently of other spikes. This new “ensemble sorting” yields unbiased estimates of coincident spiking, and permits more data to be analyzed with confidence, improving the quality and quantity of neurophysiological inferences. These results should be of interest outside the context of neuronal correlations studies. Indeed, simultaneous recording of many neurons has become the rule rather than the exception in experiments, so it is essential to spike sort correctly if we are to make valid inferences about any properties of, and relationships between, neurons."
to:NB
heard_the_talk
neuroscience
neural_data_analysis
ventura.valerie
kith_and_kin
statistics
inference_to_latent_objects
20 days ago by cshalizi
Exact Covariance Thresholding into Connected Components for Large-Scale Graphical Lasso
7 weeks ago by cshalizi
"We consider the sparse inverse covariance regularization problem or graphical lasso with regularization parameter λ. Suppose the sample covariance graph formed by thresholding the entries of the sample covariance matrix at λ is decomposed into connected components. We show that the vertex-partition induced by the connected components of the thresholded sample covariance graph (at λ) is exactly equal to that induced by the connected components of the estimated concentration graph, obtained by solving the graphical lasso problem for the same λ. This characterizes a very interesting property of a path of graphical lasso solutions. Furthermore, this simple rule, when used as a wrapper around existing algorithms for the graphical lasso, leads to enormous performance gains. For a range of values of λ, our proposal splits a large graphical lasso problem into smaller tractable problems, making it possible to solve an otherwise infeasible large-scale problem. We illustrate the graceful scalability of our proposal via synthetic and real-life microarray examples."
--- I wonder whether this hasn't some application to the PC algorithm?
to:NB
graphical_models
lasso
sparsity
statistics
heard_the_talk
--- I wonder whether this hasn't some application to the PC algorithm?
7 weeks ago by cshalizi
[0805.1179] Autoregressive Process Modeling via the Lasso Procedure
12 weeks ago by cshalizi
"The Lasso is a popular model selection and estimation procedure for linear models that enjoys nice theoretical properties. In this paper, we study the Lasso estimator for fitting autoregressive time series models. We adopt a double asymptotic framework where the maximal lag may increase with the sample size. We derive theoretical results establishing various types of consistency. In particular, we derive conditions under which the Lasso estimator for the autoregressive coefficients is model selection consistent, estimation consistent and prediction consistent. Simulation study results are reported."
to:NB
time_series
statistics
lasso
sparsity
variable_selection
kith_and_kin
heard_the_talk
rinaldo.alessandro
nardi.yuval
12 weeks ago by cshalizi
[0806.3978] Information In The Non-Stationary Case
12 weeks ago by cshalizi
"Information estimates such as the ``direct method'' of Strong et al. (1998) sidestep the difficult problem of estimating the joint distribution of response and stimulus by instead estimating the difference between the marginal and conditional entropies of the response. While this is an effective estimation strategy, it tempts the practitioner to ignore the role of the stimulus and the meaning of mutual information. We show here that, as the number of trials increases indefinitely, the direct (or ``plug-in'') estimate of marginal entropy converges (with probability 1) to the entropy of the time-averaged conditional distribution of the response, and the direct estimate of the conditional entropy converges to the time-averaged entropy of the conditional distribution of the response. Under joint stationarity and ergodicity of the response and stimulus, the difference of these quantities converges to the mutual information. When the stimulus is deterministic or non-stationary the direct estimate of information no longer estimates mutual information, which is no longer meaningful, but it remains a measure of variability of the response distribution across time."
in_NB
statistics
neuroscience
entropy_estimation
kith_and_kin
heard_the_talk
yu.bin
vu.vincent
kass.rob
neural_data_analysis
12 weeks ago by cshalizi
Phys. Rev. E 84, 051917 (2011): Nonequilibrium phase transitions in biomolecular signal transduction
november 2011 by cshalizi
"We study a mechanism for reliable switching in biomolecular signal-transduction cascades. Steady bistable states are created by system-size cooperative effects in populations of proteins, in spite of the fact that the phosphorylation-state transitions of any molecule, by means of which the switch is implemented, are highly stochastic. The emergence of switching is a nonequilibrium phase transition in an energetically driven, dissipative system described by a master equation. We use operator and functional integral methods from reaction-diffusion theory to solve for the phase structure, noise spectrum, and escape trajectories and first-passage times of a class of minimal models of switches, showing how all critical properties for switch behavior can be computed within a unified framework."
to:NB
heard_the_talk
kith_and_kin
signal_transduction
biochemical_networks
phase_transitions
statistical_mechanics
non-equilibrium
smith.eric
fontana.walter
krakauer.david
november 2011 by cshalizi
[1111.4226] Joint Modeling of Multiple Related Time Series via the Beta Process
november 2011 by cshalizi
"We propose a Bayesian nonparametric approach to the problem of jointly modeling multiple related time series. Our approach is based on the discovery of a set of latent, shared dynamical behaviors. Using a beta process prior, the size of the set and the sharing pattern are both inferred from data. We develop efficient Markov chain Monte Carlo methods based on the Indian buffet process representation of the predictive distribution of the beta process, without relying on a truncated model. In particular, our approach uses the sum-product algorithm to efficiently compute Metropolis-Hastings acceptance probabilities, and explores new dynamical behaviors via birth and death proposals. We examine the benefits of our proposed feature-based model on several synthetic datasets, and also demonstrate promising results on unsupervised segmentation of visual motion capture data."
to:NB
heard_the_talk
time_series
statistics
machine_learning
nonparametrics
fox.emily
jordan.michael_i.
november 2011 by cshalizi
[1111.4470] Efficient Regression in Metric Spaces via Approximate Lipschitz Extension
november 2011 by cshalizi
"We present a framework for performing efficient regression in general metric spaces. Roughly speaking, our regressor predicts the value at a new point by computing a Lipschitz extension -- the smoothest function consistent with the observed data -- while performing an optimized structural risk minimization to avoid overfitting. The offline (learning) and online (inference) stages can be solved by convex programming, but this naive approach has runtime complexity O(n^3), which is prohibitive for large datasets. We design instead an algorithm that is fast when the the doubling dimension, which measures the "intrinsic" dimensionality of the metric space, is low.
We use the doubling dimension multiple times; first, on the statistical front, to bound fat-shattering dimension of the class of Lipschitz functions (and obtain risk bounds); and second, on the computational front, to quickly compute a hypothesis function and a prediction based on Lipschitz extension. Our resulting regressor is both asymptotically strongly consistent and comes with finite-sample risk bounds, while making minimal structural and noise assumptions."
in_NB
heard_the_talk
kith_and_kin
regression
learning_theory
statistics
kontorovich.aryeh
We use the doubling dimension multiple times; first, on the statistical front, to bound fat-shattering dimension of the class of Lipschitz functions (and obtain risk bounds); and second, on the computational front, to quickly compute a hypothesis function and a prediction based on Lipschitz extension. Our resulting regressor is both asymptotically strongly consistent and comes with finite-sample risk bounds, while making minimal structural and noise assumptions."
november 2011 by cshalizi
Adventures in Data Land, Graphical Models for the Internet
march 2011 by cshalizi
Look at this later and re-consider the to_teach tags.
clustering
graphical_models
tutorials
expectation-maximization
internet
text_mining
to_teach:data-mining
to_teach:undergrad-ADA
smola.alex
ahmed.amr
heard_the_talk
march 2011 by cshalizi
[1103.4395] On Non-Bayesian Social Learning
march 2011 by cshalizi
Heard the talk at ISIT 2011; it was very good.
collective_cognition
heard_the_talk
networks
to:NB
to:blog
march 2011 by cshalizi
SSRN-Non-Bayesian Social Learning, Second Version by Ali Jadbabaie, Alvaro Sandroni, Alireza Tahbaz-Salehi
february 2011 by cshalizi
Actually, I heard the talk about this and some even more impressive follow-up work, which Jadbabaie said will be in working paper form very soon.
social_life_of_the_mind
collective_cognition
heard_the_talk
february 2011 by cshalizi
[1101.5108] Causal Dependence Tree Approximations of Joint Distributions for Multiple Random Processes
february 2011 by cshalizi
"We investigate approximating joint distributions of random processes with causal dependence tree distributions. Such distributions are particularly useful in providing parsimonious representation when there exists causal dynamics among processes. By extending the results by Chow and Liu on dependence tree approximations, we show that the best causal dependence tree approximation is the one which maximizes the sum of directed informations on its edges, where best is defined in terms of minimizing the KL-divergence between the original and the approximate distribution. Moreover, we describe a low-complexity algorithm to efficiently pick this approximate distribution."
stochastic_processes
information_theory
graphical_models
chow-liu_trees
in_NB
coleman.todd
heard_the_talk
february 2011 by cshalizi
Propagation of innovations in networked groups.
december 2010 by cshalizi
"A novel paradigm was developed to study the behavior of groups of networked people searching a problem space. The authors examined how different network structures affect the propagation of information in laboratory-created groups. Participants made numerical guesses and received scores that were also made available to their neighbors in the network. The networks were compared on speed of discovery and convergence on the optimal solution. One experiment showed that individuals within a group tend to converge on similar solutions even when there is an equally valid alternative solution. Two additional studies demonstrated that the optimal network structure depends on the problem space being explored, with networks that incorporate spatially based cliques having an advantage for problems that benefit from broad exploration, and networks with greater long-range connectivity having an advantage for problems requiring less exploration."
social_networks
experimental_psychology
collective_cognition
social_life_of_the_mind
re:do-institutions-evolve
kith_and_kin
heard_the_talk
have_read
to_teach:complexity-and-inference
to:blog
mason.winter
re:democratic_cognition
december 2010 by cshalizi
Approximate Bayesian Computation in Evolution and Ecology (Beaumont, 2010)
november 2010 by cshalizi
Heard the talk in Bristol. Naturally the idea makes a lot more sense when laid out by a proponent than by a hostile critic; it's actually very similar to indirect inference (only with a prior to add bias).
heard_the_talk
to_read
indirect_inference
approximate_bayesian_computation
statistics
estimation
november 2010 by cshalizi
[1010.2286] Divergence-based characterization of fundamental limitations of adaptive dynamical systems
october 2010 by cshalizi
"general problem of adaptively controlling and/or identifying a stochastic dynamical system, where our {\em a priori} knowledge allows us to place the system in a subset of a metric space (the uncertainty set). We present an information-theoretic meta-theorem that captures the trade-off between the metric complexity (or richness) of the uncertainty set, the amount of information acquired online in the process of controlling and observing the system, and the residual uncertainty remaining after the observations have been collected. Following the approach of Zames, we quantify {\em a priori} information by the Kolmogorov (metric) entropy of the uncertainty set, while the information acquired online is expressed as a sum of information divergences. The general theory is used to derive new minimax lower bounds on the metric identification error, as well as to give a simple derivation of the minimum time needed to stabilize an uncertain stochastic linear system."
systems_identification
control_theory
estimation
learning_theory
minimax
heard_the_talk
raginsky.maxim
to:NB
october 2010 by cshalizi
Discrete Time Rescaling Theorem: Determining Goodness of Fit for Discrete Time Statistical Models of Neural Spiking
september 2010 by cshalizi
And by "heard the talk" I mean "had it explained on napkin-back over beer".
neural_data_analysis
time_series
kith_and_kin
haslinger.rob
brown.emery
time_rescaling
statistics
heard_the_talk
september 2010 by cshalizi
Snijders, Koskinen, Schweinberger: Maximum likelihood estimation for social network dynamics
august 2010 by cshalizi
"A model for network panel data is discussed, based on the assumption that the observed data are discrete observations of a continuous-time Markov process on the space of all directed graphs on a given node set, in which changes in tie variables are independent conditional on the current graph. The model for tie changes is parametric and designed for applications to social network analysis, where the network dynamics can be interpreted as being generated by choices made by the social actors represented by the nodes of the graph. An algorithm for calculating the Maximum Likelihood estimator is presented, based on data augmentation and stochastic approximation. An application to an evolving friendship network is given and a small simulation study is presented which suggests that for small data sets the Maximum Likelihood estimator is more efficient than the earlier proposed Method of Moments estimator."
network_data_analysis
social_networks
markov_models
estimation
statistics
heard_the_talk
august 2010 by cshalizi
Inferring deterministic causal relations
july 2010 by cshalizi
Best Student Paper at UAI 2010. What would happen if you used this on sequences of values from the Arnold cat map? Could it learn the direction of time?
causal_inference
information_theory
to_read
heard_the_talk
july 2010 by cshalizi
Social Influence and the Autism Epidemic
may 2010 by cshalizi
Social influence on diagnosis, not actually producing autism. Heard the talk at the MERSI conference in 2009; it sounded pretty convincing.
re:homophily_and_confounding
social_cognition
autism
sociology
heard_the_talk
via:orgtheory
may 2010 by cshalizi
[1002.4802] Gaussian Process Structural Equation Models with Latent Variables
february 2010 by cshalizi
"In a variety of disciplines such as social sciences, psychology, medicine and economics, the recorded data are considered to be noisy measurements of latent variables connected by some causal structure. This corresponds to a family of graphical models known as the structural equation model with latent variables. While linear non-Gaussian variants have been well-studied, inference in nonparametric structural equation models is still underdeveloped. We introduce a sparse Gaussian process parameterization that defines a non-linear structure connecting latent variables, unlike common formulations of Gaussian process latent variable models. An efficient Markov chain Monte Carlo procedure is described. We evaluate the stability of the sampling procedure and the predictive ability of the model compared against the current practice."
statistics
graphical_models
latent_variables
nonparametrics
estimation
heard_the_talk
february 2010 by cshalizi
Project MUSE - Demography - Birds of a Feather, Or Friend of a Friend?: Using Exponential Random Graph Models to Investigate Adolescent Social Networks
network_data_analysis social_networks homophily re:homophily_and_confounding kith_and_kin morris.martina have_read heard_the_talk networks exponential_family_random_graphs to:blog
january 2010 by cshalizi
network_data_analysis social_networks homophily re:homophily_and_confounding kith_and_kin morris.martina have_read heard_the_talk networks exponential_family_random_graphs to:blog
january 2010 by cshalizi
[0708.3412] Observability and nonlinear filtering
november 2007 by cshalizi
"This paper develops a connection between the asymptotic stability of nonlinear filters and a notion of observability. We consider a general class of hidden Markov models in continuous time with compact signal state space, and call such a model observable if no two initial measures of the signal process give rise to the same law of the observation process. We demonstrate that observability implies stability of the filter, i.e., the filtered estimates become insensitive to the initial measure at large times. For the special case where the signal is a finite-state Markov process and the observations are of the white noise type, a complete (necessary and sufficient) characterization of filter stability is obtained in terms of a slightly weaker detectability condition. In addition to observability, the role of controllability in filter stability is explored. Finally, the results are partially extended to non-compact signal state spaces."
in_NB
markov_models
filtering
re:AoS_project
van_handel.ramon
heard_the_talk
november 2007 by cshalizi
PLoS Computational Biology - From Inverse Problems in Mathematical Physiology to Quantitative Differential Diagnoses
statistics inverse_problems automated_diagnosis state_estimation zenker.sven rubin.jonathan clermont.gilles have_read computational_statistics heard_the_talk identifiability in_NB
november 2007 by cshalizi
statistics inverse_problems automated_diagnosis state_estimation zenker.sven rubin.jonathan clermont.gilles have_read computational_statistics heard_the_talk identifiability in_NB
november 2007 by cshalizi
related tags
adaptation ⊕ ahmed.amr ⊕ albers.dave ⊕ amaral.luis ⊕ approximate_bayesian_computation ⊕ autism ⊕ automated_diagnosis ⊕ bergstrom.carl ⊕ biochemical_networks ⊕ brown.emery ⊕ causal_inference ⊕ change-point_problem ⊕ chow-liu_trees ⊕ citation_networks ⊕ clermont.gilles ⊕ clustering ⊕ coleman.todd ⊕ collective_cognition ⊕ community_discovery ⊕ computational_statistics ⊕ control_theory ⊕ data_mining ⊕ emergence ⊕ entropy_estimation ⊕ ergodic_theory ⊕ estimation ⊕ expectation-maximization ⊕ experimental_psychology ⊕ exponential_family_random_graphs ⊕ filtering ⊕ fontana.walter ⊕ fox.emily ⊕ graphical_models ⊕ haslinger.rob ⊕ have_read ⊕ heard_the_talk ⊖ hofman.jake ⊕ homophily ⊕ identifiability ⊕ indirect_inference ⊕ inference_to_latent_objects ⊕ influence ⊕ information_theory ⊕ internet ⊕ inverse_problems ⊕ in_NB ⊕ jordan.michael_i. ⊕ kass.rob ⊕ kith_and_kin ⊕ kontorovich.aryeh ⊕ krakauer.david ⊕ lasso ⊕ latent_variables ⊕ learning_theory ⊕ machine_learning ⊕ map-reduce ⊕ markov_models ⊕ mason.winter ⊕ minimax ⊕ model_selection ⊕ morris.martina ⊕ nardi.yuval ⊕ networked_life ⊕ networks ⊕ network_data_analysis ⊕ neural_data_analysis ⊕ neuroscience ⊕ non-equilibrium ⊕ nonparametrics ⊕ parallel_computing ⊕ phase_transitions ⊕ point_processes ⊕ raginsky.maxim ⊕ random_walks ⊕ re:AoS_project ⊕ re:critique_of_diffusion ⊕ re:democratic_cognition ⊕ re:do-institutions-evolve ⊕ re:homophily_and_confounding ⊕ re:phil-of-bayes_paper ⊕ re:stacs ⊕ regression ⊕ rinaldo.alessandro ⊕ rosvall.martin ⊕ rubin.jonathan ⊕ signal_transduction ⊕ smith.eric ⊕ smola.alex ⊕ social_cognition ⊕ social_life_of_the_mind ⊕ social_networks ⊕ sociology ⊕ sparsity ⊕ state_estimation ⊕ statistical_mechanics ⊕ statistics ⊕ stochastic_processes ⊕ systems_identification ⊕ text_mining ⊕ time_rescaling ⊕ time_series ⊕ to:blog ⊕ to:NB ⊕ to_read ⊕ to_teach:complexity-and-inference ⊕ to_teach:data-mining ⊕ to_teach:undergrad-ADA ⊕ tutorials ⊕ van_handel.ramon ⊕ variable_selection ⊕ ventura.valerie ⊕ via:ariddell ⊕ via:orgtheory ⊕ voter_model ⊕ vu.vincent ⊕ watts.duncan ⊕ web ⊕ yu.bin ⊕ zenker.sven ⊕Copy this bookmark: