cshalizi + exponential_families 29
Okabayashi , Geyer : Long range search for maximum likelihood in exponential families
february 2012 by cshalizi
"Exponential families are often used to model data sets with complex dependence. Maximum likelihood estimators (MLE) can be difficult to estimate when the likelihood is expensive to compute. Markov chain Monte Carlo (MCMC) methods based on the MCMC-MLE algorithm in [17] are guaranteed to converge in theory under certain conditions when starting from any value, but in practice such an algorithm may labor to converge when given a poor starting value. We present a simple line search algorithm to find the MLE of a regular exponential family when the MLE exists and is unique. The algorithm can be started from any initial value and avoids the trial and error experimentation associated with calibrating algorithms like stochastic approximation. Unlike many optimization algorithms, this approach utilizes first derivative information only, evaluating neither the likelihood function itself nor derivatives of higher order than first. We show convergence of the algorithm for the case where the gradient can be calculated exactly. When it cannot, it has a particularly convenient form that is easily estimable with MCMC, making the algorithm still useful to a practitioner."
to:NB
statistics
exponential_families
exponential_family_random_graphs
network_data_analysis
estimation
monte_carlo
optimization
geyer.charles
february 2012 by cshalizi
[1111.3054] Consistency under Sampling of Exponential Random Graph Models
november 2011 by cshalizi
"The growing availability of network data and of scientific interest in distributed systems has led to the rapid development of statistical models of network structure. Typically, however, these are models for the entire network, while the data consists only of a sampled sub-network. Parameters for the whole network, which is what is of interest, are estimated by applying the model to the sub-network. This assumes that the model is consistent under sampling, or, in terms of the theory of stochastic processes, that it defines a projective family. Focussing on the popular class of exponential random graph models (ERGMs), we show that this apparently trivial condition is in fact violated by many popular and scientifically appealing models, and that satisfying it drastically limits ERGM's expressive power. These results are actually special cases of more general ones about exponential families of dependent random variables, which we also prove. Using such results, we offer easily checked conditions for the consistency of maximum likelihood estimation in ERGMs, and discuss some possible constructive responses."
in_NB
self-promotion
exponential_family_random_graphs
exponential_families
statistical_inference_for_stochastic_processes
statistics
network_data_analysis
re:your_favorite_ergm_sucks
estimation
large_deviations
november 2011 by cshalizi
[1111.0483] Optimally approximating exponential families
november 2011 by cshalizi
"This article studies exponential families $mathcal{E}$ on finite sets such that the information divergence $D(P|mathcal{E})$ of an arbitrary probability distribution from $mathcal{E}$ is bounded by some constant $D>0$. A particular class of low-dimensional exponential families that have low values of $D$ can be obtained from partitions of the state space. The main results concern optimality properties of these partition exponential families. Exponential families where $D=log(2)$ are studied in detail. This case is special, because if $D<log(2)$, then $mathcal{E}$ contains all probability measures with full support."
to:NB
exponential_families
probability
information_theory
approximation
november 2011 by cshalizi
Generative Kernels for Exponential Families
november 2011 by cshalizi
"In this paper, we propose a family of kernels for the data distributions belonging to the exponential family. We call these kernels generative kernels because they take into account the generative process of the data. Our proposed method considers the geometry of the data distribution to build a set of efficient closed-form kernels best suited for that distribution. We compare our generative kernels on multinomial data and observe improved empirical performance across the board. Moreover, our generative kernels perform significantly better when training size is small, an important property of the generative models."
to:NB
kernel_methods
exponential_families
machine_learning
november 2011 by cshalizi
[0812.0449] Locally adaptive estimation methods with application to univariate time series
july 2011 by cshalizi
"The paper offers a unified approach to the study of three locally adaptive estimation methods in the context of univariate time series from both theoretical and empirical points of view. A general procedure for the computation of critical values is given. The underlying model encompasses all distributions from the exponential family providing for great flexibility. The procedures are applied to simulated and real financial data distributed according to the Gaussian, volatility, Poisson, exponential and Bernoulli models. Numerical results exhibit a very reasonable performance of the methods."
time_series
statistics
estimation
exponential_families
non-stationarity
to:NB
july 2011 by cshalizi
[1003.3157] Entropy-based parametric estimation of spike train statistics
march 2010 by cshalizi
Not sure there's anything new here...
exponential_families
neural_data_analysis
march 2010 by cshalizi
Sequential Anomaly Detection in the Presence of Noise and Limited Feedback (Raginsky et al., submitted 2009)
november 2009 by cshalizi
FHTAGN!
statistics
time_series
statistical_inference_for_stochastic_processes
information_theory
exponential_families
anomaly_detection
raginsky.maxim
willett.rebecca
have_read
to:blog
november 2009 by cshalizi
Sequential Probability Assignment Via Online Convex Programming Using Exponential Families (Raginsky, Marcia, Silva and Willett)
october 2009 by cshalizi
Today's seminar. Very cool.
have_read
statistics
statistical_inference_for_stochastic_processes
exponential_families
information_theory
prediction
minimax
optimization
to:blog
raginsky.maxim
online_learning
willett.rebecca
low-regret-learning
in_NB
october 2009 by cshalizi
Lawrence D. Brown Fundamentals of statistical exponential families with applications in statistical decision theory
february 2009 by cshalizi
The classic monograph free online (scanned PDF, with the original UGLY pre-latex typography).
exponential_families
statistics
books:recommended
brown.lawrence
february 2009 by cshalizi
Journal of Statistical Software — Special Issue on Statnet
may 2008 by cshalizi
yay, free, statistically-sound software for modeling network structures!
network_data_analysis
computational_statistics
exponential_families
handcock.mark_s
hunter.david_r
butts.carter_t
goodreau.steven_m
morris.martina
bender-de_moll.skye
moody.james
may 2008 by cshalizi
Models for Longitudinal Network Data
april 2008 by cshalizi
Includes "actor-oriented" models in the general exponential family random graph framework.
snijders.tom
network_data_analysis
exponential_families
markov_models
to_teach:complexity-and-inference
april 2008 by cshalizi
The Role of Sufficiency and of Estimation in Thermodynamics (Mandelbrot, 1962)
february 2008 by cshalizi
Free reprint of paper earlier saved in JSTOR version. Zeroth law = conditional on a sufficient statistic, the parameter doesn't change the temperature, etc.
statistics
statistical_mechanics
exponential_families
sufficiency
gibbs_distributions
mandelbrot.benoit
february 2008 by cshalizi
The Role of Sufficiency and of Estimation in Thermodynamics (Mandelbrot, 1962)
february 2008 by cshalizi
_Very_ nice. Zeroth law = you only need to worry about sufficient statistics; etc.
statistics
statistical_mechanics
exponential_families
sufficiency
gibbs_distributions
mandelbrot.benoit
february 2008 by cshalizi
related tags
algebraic_statistics ⊕ anomaly_detection ⊕ approximation ⊕ arrow_of_time ⊕ barron.andrew ⊕ bender-de_moll.skye ⊕ books:recommended ⊕ brown.lawrence ⊕ butts.carter_t ⊕ causality ⊕ causal_inference ⊕ computational_statistics ⊕ density_estimation ⊕ estimation ⊕ expectation-maximization ⊕ exponential_families ⊖ exponential_family_random_graphs ⊕ fienberg.steve ⊕ functional_connectivity ⊕ geyer.charles ⊕ gibbs_distributions ⊕ goodreau.steven_m ⊕ graphical_models ⊕ handcock.mark_s ⊕ have_read ⊕ hunter.david_r ⊕ hypothesis_testing ⊕ information_theory ⊕ in_NB ⊕ janzing.dominik ⊕ jordan.michael_i. ⊕ kernel_methods ⊕ kith_and_kin ⊕ large_deviations ⊕ latent_variables ⊕ low-regret-learning ⊕ machine_learning ⊕ mandelbrot.benoit ⊕ markov_models ⊕ minimax ⊕ monte_carlo ⊕ moody.james ⊕ morris.martina ⊕ network_data_analysis ⊕ neural_coding_and_decoding ⊕ neural_data_analysis ⊕ non-stationarity ⊕ nonparametrics ⊕ occams_razor ⊕ online_learning ⊕ optimization ⊕ prediction ⊕ predictive_state_representations ⊕ probability ⊕ raginsky.maxim ⊕ random_fields ⊕ re:AoS_project ⊕ re:bayes_as_evol ⊕ re:naive-semi-supervised ⊕ re:XV_for_networks ⊕ re:your_favorite_ergm_sucks ⊕ regression ⊕ rinaldo.alessandro ⊕ self-promotion ⊕ semi-supervised_learning ⊕ sheu.chyong-hwa ⊕ snijders.tom ⊕ sparsity ⊕ statistical_inference_for_stochastic_processes ⊕ statistical_mechanics ⊕ statistics ⊕ sufficiency ⊕ thermodynamics ⊕ time_series ⊕ to:blog ⊕ to:NB ⊕ to_read ⊕ to_teach:complexity-and-inference ⊕ variational_methods ⊕ via:ded-maxim ⊕ wainwright.martin ⊕ willett.rebecca ⊕Copy this bookmark: