cshalizi + filtering   24

[0810.2123] Forgetting of the initial distribution for non-ergodic Hidden Markov Chains
"In this paper, the forgetting of the initial distribution for a non-ergodic Hidden Markov Models (HMM) is studied. A new set of conditions is proposed to establish the forgetting property of the filter, which significantly extends all the existing results. Both a pathwise-type convergence of the total variation distance of the filter started from two different initial distributions, and a convergence in expectation are considered. The results are illustrated using generic models of non-ergodic HMM and extend all the results known so far."
to:NB  filtering  markov_models  state_estimation  stochastic_processes 
12 weeks ago by cshalizi
Online Learning with Hidden Markov Models
"We present an online version of the expectation-maximization (EM) algorithm for hidden Markov models (HMMs). The sufficient statistics required for parameters estimation is computed recursively with time, that is, in an online way instead of using the batch forward-backward procedure. This computational scheme is generalized to the case where the model parameters can change with time by introducing a discount factor into the recurrence relations. The resulting algorithm is equivalent to the batch EM algorithm, for appropriate discount factor and scheduling of parameters update. On the other hand, the online algorithm is able to deal with dynamic environments, i.e., when the statistics of the observed data is changing with time. The implications of the online algorithm for probabilistic modeling in neuroscience are briefly discussed."
to:NB  markov_models  filtering  state_estimation  statistics  em_algorithm 
february 2012 by cshalizi
[1202.2945] Sequential Monte Carlo smoothing for general state space hidden Markov models
"Computing smoothing distributions, the distributions of one or more states conditional on past, present, and future observations is a recurring problem when operating on general hidden Markov models. The aim of this paper is to provide a foundation of particle-based approximation of such distributions and to analyze, in a common unifying framework, different schemes producing such approximations. In this setting, general convergence results, including exponential deviation inequalities and central limit theorems, are established. In particular, time uniform bounds on the marginal smoothing error are obtained under appropriate mixing conditions on the transition kernel of the latent chain. In addition, we propose an algorithm approximating the joint smoothing distribution at a cost that grows only linearly with the number of particles."
to:NB  filtering  statistics  state_estimation  particle_filters  state-space_models  stochastic_processes  ergodic_theory  moulines.eric  douc.randal 
february 2012 by cshalizi
IEEE Xplore - Online Learning of Noisy Data
"We study online learning of linear and kernel-based predictors, when individual examples are corrupted by random noise, and both examples and noise type can be chosen adversarially and change over time. We begin with the setting where some auxiliary information on the noise distribution is provided, and we wish to learn predictors with respect to the squared loss. Depending on the auxiliary information, we show how one can learn linear and kernel-based predictors, using just 1 or 2 noisy copies of each example. We then turn to discuss a general setting where virtually nothing is known about the noise distribution, and one wishes to learn with respect to general losses and using linear and kernel-based predictors. We show how this can be achieved using a random, essentially constant number of noisy copies of each example. Allowing multiple copies cannot be avoided: Indeed, we show that the setting becomes impossible when only one noisy copy of each instance can be accessed. To obtain our results we introduce several novel techniques, some of which might be of independent interest."
to:NB  online_learning  filtering  kernel_methods  machine_learning  cesa-bianchi.nicolo  low-regret_learning 
december 2011 by cshalizi
[1111.6801] The direct L2 geometric structure on a manifold of probability densities with applications to Filtering
"In this paper we introduce a projection method for the space of probability distributions based on the differential geometric approach to statistics. This method is based on a direct L2 metric as opposed to the usual Hellinger distance and the related Fisher Information metric. We explain how this apparatus can be used for the nonlinear filtering problem, in relationship also to earlier projection methods based on the Fisher metric. Past projection filters focused on the Fisher metric and the exponential families that made the filter correction step exact. In this work we introduce the mixture projection filter, namely the projection filter based on the direct $L^2$ metric and based on a manifold given by a mixture of pre-assigned densities. The resulting prediction step in the filtering problem is described by a linear differential equation, while the correction step can be made exact."
in_NB  filtering  state_estimation  information_geometry  time_series 
december 2011 by cshalizi
Ergodicity of Hidden Markov Model - Mathematics of Control, Signals, and Systems (MCSS), Volume 17, Number 4
"In this paper we study ergodic properties of hidden Markov models with a generalized observation structure. In particular sufficient conditions for the existence of a unique invariant measure for the pair filter-observation are given. Furthermore, necessary and sufficient conditions for the existence of a unique invariant measure of the triple state-observation-filter are provided in terms of asymptotic stability in probability of incorrectly initialized filters. We also study the asymptotic properties of the filter and of the state estimator based on the observations as well as on the knowledge of the initial state. Their connection with minimal and maximal invariant measures is also studied."
in_NB  stochastic_processes  ergodic_theory  markov_models  filtering  re:almost_none 
october 2011 by cshalizi
[1108.3968] Online Expectation Maximization based algorithms for inference in hidden Markov models
"The Expectation Maximization (EM) algorithm is a versatile tool for model parameter estimation in latent data models. When processing large data sets or data stream however, EM becomes intractable since it requires the whole data set to be available at each iteration of the algorithm. In this contribution, a new generic online EM algorithm for model parameter inference in general Hidden Markov Model is proposed. This new algorithm updates the parameter estimate after a block of observations is processed (online). The convergence of this new algorithm is established, and the rate of convergence is studied showing the impact of the block size. An averaging procedure is also proposed to improve the rate of convergence. Finally, practical illustrations are presented as well as extensions to some online stochastic EM when Sequential Monte Carlo methods have to be used in combination, in order to make the E-step tractable."
filtering  expectation-maximization  markov_models  statistics  statistical_inference_for_stochastic_processes  in_NB 
august 2011 by cshalizi
Carvalho, Johannes, Lopes, Polson: Particle Learning and Smoothing
"Particle learning (PL) provides state filtering, sequential parameter learning and smoothing in a general class of state space models. Our approach extends existing particle methods by incorporating the estimation of static parameters via a fully-adapted filter that utilizes conditional sufficient statistics for parameters and/or states as particles. State smoothing in the presence of parameter uncertainty is also solved as a by-product of PL. In a number of examples, we show that PL outperforms existing particle filtering alternatives and proves to be a competitor to MCMC."
particle_filters  filtering  state-space_models  state_estimation  estimation  time_series  statistics 
august 2010 by cshalizi
Failures of sequential Bayesian filters and the successes of shadowing filters in tracking of nonlinear deterministic and stochastic systems
"Sequential Bayesian filters, such as particle filters, are often presented as an ideal means of tracking the state of nonlinear systems. Here shadowing filters are demonstrated to perform better than sequential filters at tracking under specific circumstances. The success of shadowing filters is attributed to avoiding both well-known deficiencies of particle filters, and some newly identified problems." Huh.
particle_filters  filtering  state_estimation  state-space_models  time_series 
july 2009 by cshalizi
CRAN - Package sspir
State-space modeling with linear/Gaussian state evolution and generalized linear models for the observations. Looks reasonable, lacks a few improvements like diffuse initial conditions in the Kalman filter.
state-space_models  time_series  R  filtering  state_estimation  to_teach 
february 2009 by cshalizi
On error-free filtering of finite-state singular processes under dependent distortions - Prelov and van der Meulen
When can the state of one process be recovered without error from another? (Use of infinite time limit here is not quite relevant to my immediate needs so must see how to modify proof.)
prelov.v._v.  van_der_meulen.e._c.  filtering  state_estimation  information_theory  re:AoS_project 
february 2008 by cshalizi
[0708.3412] Observability and nonlinear filtering
"This paper develops a connection between the asymptotic stability of nonlinear filters and a notion of observability. We consider a general class of hidden Markov models in continuous time with compact signal state space, and call such a model observable if no two initial measures of the signal process give rise to the same law of the observation process. We demonstrate that observability implies stability of the filter, i.e., the filtered estimates become insensitive to the initial measure at large times. For the special case where the signal is a finite-state Markov process and the observations are of the white noise type, a complete (necessary and sufficient) characterization of filter stability is obtained in terms of a slightly weaker detectability condition. In addition to observability, the role of controllability in filter stability is explored. Finally, the results are partially extended to non-compact signal state spaces."
in_NB  markov_models  filtering  re:AoS_project  van_handel.ramon  heard_the_talk 
november 2007 by cshalizi

related tags

apollo_project  books:noted  books:recommended  branching_processes  cai.t._tony  cesa-bianchi.nicolo  change-point_problem  control_theory  douc.randal  dynamical_systems  econometrics  em_algorithm  epidemic_models  ergodic_theory  estimation  expectation-maximization  extended_kalman_filter  filtering  fraser.andrew  have_read  heard_the_talk  hilbert_space  history_of_technology  hypothesis_testing  information_geometry  information_theory  in_NB  kalman_filter  kernel_methods  kith_and_kin  laplace_approximation  low-regret_learning  machine_learning  macroeconomics  markov_models  martingales  monte_carlo  moulines.eric  nasa  online_learning  particle_filters  parzen.emanuel  prelov.v._v.  R  re:almost_none  re:AoS_project  re:social-networks-as-sensor-networks  re:your_favorite_dsge_sucks  scientific_computing  self-centered  simulation  spatial_statistics  splines  state-space_models  state_estimation  statistical_inference_for_stochastic_processes  statistics  stochastic_processes  time_series  to:blog  to:NB  to_teach  to_teach:complexity-and-inference  van_der_meulen.e._c.  van_handel.ramon  via:flaxman  via:guslacerda  wheels:reinvention_of 

Copy this bookmark:



description:


tags: