cshalizi + exponential_families   29

Okabayashi , Geyer : Long range search for maximum likelihood in exponential families
"Exponential families are often used to model data sets with complex dependence. Maximum likelihood estimators (MLE) can be difficult to estimate when the likelihood is expensive to compute. Markov chain Monte Carlo (MCMC) methods based on the MCMC-MLE algorithm in [17] are guaranteed to converge in theory under certain conditions when starting from any value, but in practice such an algorithm may labor to converge when given a poor starting value. We present a simple line search algorithm to find the MLE of a regular exponential family when the MLE exists and is unique. The algorithm can be started from any initial value and avoids the trial and error experimentation associated with calibrating algorithms like stochastic approximation. Unlike many optimization algorithms, this approach utilizes first derivative information only, evaluating neither the likelihood function itself nor derivatives of higher order than first. We show convergence of the algorithm for the case where the gradient can be calculated exactly. When it cannot, it has a particularly convenient form that is easily estimable with MCMC, making the algorithm still useful to a practitioner."
to:NB  statistics  exponential_families  exponential_family_random_graphs  network_data_analysis  estimation  monte_carlo  optimization  geyer.charles 
february 2012 by cshalizi
[1111.3054] Consistency under Sampling of Exponential Random Graph Models
"The growing availability of network data and of scientific interest in distributed systems has led to the rapid development of statistical models of network structure. Typically, however, these are models for the entire network, while the data consists only of a sampled sub-network. Parameters for the whole network, which is what is of interest, are estimated by applying the model to the sub-network. This assumes that the model is consistent under sampling, or, in terms of the theory of stochastic processes, that it defines a projective family. Focussing on the popular class of exponential random graph models (ERGMs), we show that this apparently trivial condition is in fact violated by many popular and scientifically appealing models, and that satisfying it drastically limits ERGM's expressive power. These results are actually special cases of more general ones about exponential families of dependent random variables, which we also prove. Using such results, we offer easily checked conditions for the consistency of maximum likelihood estimation in ERGMs, and discuss some possible constructive responses."
in_NB  self-promotion  exponential_family_random_graphs  exponential_families  statistical_inference_for_stochastic_processes  statistics  network_data_analysis  re:your_favorite_ergm_sucks  estimation  large_deviations 
november 2011 by cshalizi
[1111.0483] Optimally approximating exponential families
"This article studies exponential families $mathcal{E}$ on finite sets such that the information divergence $D(P|mathcal{E})$ of an arbitrary probability distribution from $mathcal{E}$ is bounded by some constant $D>0$. A particular class of low-dimensional exponential families that have low values of $D$ can be obtained from partitions of the state space. The main results concern optimality properties of these partition exponential families. Exponential families where $D=log(2)$ are studied in detail. This case is special, because if $D<log(2)$, then $mathcal{E}$ contains all probability measures with full support."
to:NB  exponential_families  probability  information_theory  approximation 
november 2011 by cshalizi
Generative Kernels for Exponential Families
"In this paper, we propose a family of kernels for the data distributions belonging to the exponential family. We call these kernels generative kernels because they take into account the generative process of the data. Our proposed method considers the geometry of the data distribution to build a set of efficient closed-form kernels best suited for that distribution. We compare our generative kernels on multinomial data and observe improved empirical performance across the board. Moreover, our generative kernels perform significantly better when training size is small, an important property of the generative models."
to:NB  kernel_methods  exponential_families  machine_learning 
november 2011 by cshalizi
[0812.0449] Locally adaptive estimation methods with application to univariate time series
"The paper offers a unified approach to the study of three locally adaptive estimation methods in the context of univariate time series from both theoretical and empirical points of view. A general procedure for the computation of critical values is given. The underlying model encompasses all distributions from the exponential family providing for great flexibility. The procedures are applied to simulated and real financial data distributed according to the Gaussian, volatility, Poisson, exponential and Bernoulli models. Numerical results exhibit a very reasonable performance of the methods."
time_series  statistics  estimation  exponential_families  non-stationarity  to:NB 
july 2011 by cshalizi
Models for Longitudinal Network Data
Includes "actor-oriented" models in the general exponential family random graph framework.
snijders.tom  network_data_analysis  exponential_families  markov_models  to_teach:complexity-and-inference 
april 2008 by cshalizi
The Role of Sufficiency and of Estimation in Thermodynamics (Mandelbrot, 1962)
Free reprint of paper earlier saved in JSTOR version. Zeroth law = conditional on a sufficient statistic, the parameter doesn't change the temperature, etc.
statistics  statistical_mechanics  exponential_families  sufficiency  gibbs_distributions  mandelbrot.benoit 
february 2008 by cshalizi

related tags

algebraic_statistics  anomaly_detection  approximation  arrow_of_time  barron.andrew  bender-de_moll.skye  books:recommended  brown.lawrence  butts.carter_t  causality  causal_inference  computational_statistics  density_estimation  estimation  expectation-maximization  exponential_families  exponential_family_random_graphs  fienberg.steve  functional_connectivity  geyer.charles  gibbs_distributions  goodreau.steven_m  graphical_models  handcock.mark_s  have_read  hunter.david_r  hypothesis_testing  information_theory  in_NB  janzing.dominik  jordan.michael_i.  kernel_methods  kith_and_kin  large_deviations  latent_variables  low-regret-learning  machine_learning  mandelbrot.benoit  markov_models  minimax  monte_carlo  moody.james  morris.martina  network_data_analysis  neural_coding_and_decoding  neural_data_analysis  non-stationarity  nonparametrics  occams_razor  online_learning  optimization  prediction  predictive_state_representations  probability  raginsky.maxim  random_fields  re:AoS_project  re:bayes_as_evol  re:naive-semi-supervised  re:XV_for_networks  re:your_favorite_ergm_sucks  regression  rinaldo.alessandro  self-promotion  semi-supervised_learning  sheu.chyong-hwa  snijders.tom  sparsity  statistical_inference_for_stochastic_processes  statistical_mechanics  statistics  sufficiency  thermodynamics  time_series  to:blog  to:NB  to_read  to_teach:complexity-and-inference  variational_methods  via:ded-maxim  wainwright.martin  willett.rebecca 

Copy this bookmark:



description:


tags: