cshalizi + empirical_processes 38
[1205.3703] Generic chaining and the l1-penalty
11 days ago by cshalizi
"We address the choice of the tuning parameter $lambda$ in $ell_1$-penalized M-estimation. Our main concern is models which are highly nonlinear, such as the Gaussian mixture model. The number of parameters $p$ is moreover large, possibly larger than the number of observations $n$. The generic chaining technique of Talagrand[2005] is tailored for this problem. It leads to the choice $lambda asymp sqrt {log p / n}$, as in the standard Lasso procedure (which concerns the linear model and least squares loss)."
to:NB
to_read
statistics
empirical_processes
high-dimensional_statistics
van_de_geer.sara
11 days ago by cshalizi
[0805.1404] Adaptive estimation of a distribution function and its density in sup-norm loss by wavelet and spline projections
22 days ago by cshalizi
"Given an i.i.d. sample from a distribution $F$ on $mathbb{R}$ with uniformly continuous density $p_0$, purely data-driven estimators are constructed that efficiently estimate $F$ in sup-norm loss and simultaneously estimate $p_0$ at the best possible rate of convergence over H"older balls, also in sup-norm loss. The estimators are obtained by applying a model selection procedure close to Lepski's method with random thresholds to projections of the empirical measure onto spaces spanned by wavelets or $B$-splines. The random thresholds are based on suprema of Rademacher processes indexed by wavelet or spline projection kernels. This requires Bernstein-type analogs of the inequalities in Koltchinskii [Ann. Statist. 34 (2006) 2593-2656] for the deviation of suprema of empirical processes from their Rademacher symmetrizations."
to:NB
density_estimation
wavelets
splines
statistics
empirical_processes
22 days ago by cshalizi
[math/0612726] High Dimensional Probability
6 weeks ago by cshalizi
"About forty years ago it was realized by several researchers that the essential features of certain objects of Probability theory, notably Gaussian processes and limit theorems, may be better understood if they are considered in settings that do not impose structures extraneous to the problems at hand. For instance, in the case of sample continuity and boundedness of Gaussian processes, the essential feature is the metric or pseudometric structure induced on the index set by the covariance structure of the process, regardless of what the index set may be. This point of view ultimately led to the Fernique-Talagrand majorizing measure characterization of sample boundedness and continuity of Gaussian processes, thus solving an important problem posed by Kolmogorov. Similarly, separable Banach spaces provided a minimal setting for the law of large numbers, the central limit theorem and the law of the iterated logarithm, and this led to the elucidation of the minimal (necessary and/or sufficient) geometric properties of the space under which different forms of these theorems hold. However, in light of renewed interest in Empirical processes, a subject that has considerably influenced modern Statistics, one had to deal with a non-separable Banach space, namely $mathcal{L}_{infty}$. With separability discarded, the techniques developed for Gaussian processes and for limit theorems and inequalities in separable Banach spaces, together with combinatorial techniques, led to powerful inequalities and limit theorems for sums of independent bounded processes over general index sets, or, in other words, for general empirical processes."
to:NB
empirical_processes
probability
stochastic_processes
high-dimensional_probability
convergence_of_stochastic_processes
concentration_of_measure
6 weeks ago by cshalizi
[1203.1112] Continuous mapping approach to the asymptotics of U- and V-statistics
11 weeks ago by cshalizi
"We derive a new representation for U- and V-statistics. Using this representation the asymptotic distribution of U- and V-statistics can be derived by a direct application of the Continuous Mapping Theorem. That novel approach not only encompasses most of the results on the asymptotic distribution known in literature, but also allows for the first time a unifying treatment of non-degenerate and degenerate U- and V-statistics. Moreover, it yields a new and powerful tool to derive the asymptotic distribution of very general U- and V-statistics based on long-memory sequences. This will be exemplified by several astonishing examples. In particular, we shall present an example where weak convergence of the U or V-statistic occurs at the rate a_n^3 when a_n is the rate of weak convergence of the empirical process. We also introduce the notion of asymptotic (non-) degeneracy which often appears in the presence of long-memory sequences."
to:NB
statistics
statistical_inference_for_stochastic_processes
empirical_processes
11 weeks ago by cshalizi
[0810.5565] Limit Behaviour of Sequential Empirical Measure Processes
february 2012 by cshalizi
"In this paper, we obtain some uniform laws of large numbers and functional central limit theorems for sequential empirical measure processes indexed by classes of product functions satisfying appropriate Vapnik-Chervonenkis properties."
in_NB
empirical_processes
stochastic_processes
statistics
february 2012 by cshalizi
[1201.3569] Exponential Concentration Inequalities for Additive Functionals of Markov Chains
january 2012 by cshalizi
"Using the renewal approach we prove exponential inequalities for additive functionals and empirical processes of ergodic Markov chains, thus obtaining counterparts of inequalities for sums of independent random variables. The inequalities do not require functions of the chain to be bounded and moreover have all the constants accessible whenever the usual drift condition holds, which is crucial for practical applications e.g. in MCMC algorithms."
in_NB
stochastic_processes
empirical_processes
markov_models
concentration_of_measure
january 2012 by cshalizi
[1201.2256] Empirical Processes of Markov Chains and Dynamical Systems Indexed by Classes of Functions
january 2012 by cshalizi
"We study weak convergence of empirical processes of dependent data, indexed by classes of functions. We obtain results that are especially suitable for data arising from dynamical systems and Markov chains, where the Central Limit Theorem for partial sums is commonly derived via the spectral gap technique. Our results apply, e.g. to the empirical process of ergodic torus automorphisms."
in_NB
empirical_processes
stochastic_processes
markov_models
central_limit_theorem
dynamical_systems
january 2012 by cshalizi
[1111.3486] New Concentration Inequalities for Suprema of Empirical Processes
november 2011 by cshalizi
"While effective concentration inequalities for suprema of empirical processes exist under boundedness or strict tail assumptions, no comparable results have been available under considerably weaker assumptions. In this paper, we derive concentration inequalities assuming only low moments for an envelope of the empirical process. These concentration inequalities are beneficial even when the envelope is much larger than the single functions under consideration."
in_NB
concentration_of_measure
probability
empirical_processes
van_de_geer.sara
to_read
november 2011 by cshalizi
[1111.2450] The Bernstein-Orlicz norm and deviation inequalities
november 2011 by cshalizi
"We introduce two new concepts designed for the study of empirical processes. First, we introduce a new Orlicz norm which we call the Bernstein-Orlicz norm. This new norm interpolates sub-Gaussian and sub-exponential tail behavior. In particular, we show how this norm can be used to simplify the derivation of deviation inequalities for suprema of collections of random variables. Secondly, we introduce chaining and generic chaining along a tree. These simplify the well-known concepts of chaining and generic chaining. The supremum of the empirical process is then studied as a special case. We show that chaining along a tree can be done using entropy with bracketing. Finally, we establish a deviation inequality for the empirical process for the unbounded case."
in_NB
empirical_processes
deviation_bounds
probability
van_de_geer.sara
november 2011 by cshalizi
Generalization Bound for Infinitely Divisible Empirical Process
november 2011 by cshalizi
"In this paper, we study the generalization bound for an empirical process of samples independently drawn from an infinitely divisible (ID) distribution, which is termed as the ID empirical process. In particular, based on a martingale method, we develop deviation inequalities for the sequence of random variables of an ID distribution. By applying the obtained deviation inequalities, we then show the generalization bound for ID empirical process based on the annealed Vapnik- Chervonenkis (VC) entropy. Afterward, according to Sauer’s lemma, we get the generalization bound for ID empirical process based on the VC dimension. Finally, by using a resulted result bound, we analyze the asymptotic convergence of ID empirical process and show that the convergence rate of ID empirical process can reach O((frac{Lambda_mathcal{F}(2N)}{N})^{frac{1}{1.3}}) and it is faster than the results of the generic i.i.d. empirical process (Vapnik, 1999) "
in_NB
learning_theory
empirical_processes
stochastic_processes
levy_processes
martingales
re:almost_none
november 2011 by cshalizi
[1110.0963] An Empirical Process Central Limit Theorem for Multidimensional Dependent Data
october 2011 by cshalizi
"Let $(U_n(t))_{tinR^d}$ be the empirical process associated to an $R^d$-valued stationary process $(X_i)_{ige 0}$. We give general conditions, which only involve processes $(f(X_i))_{ige 0}$ for a restricted class of functions $f$, under which weak convergence of $(U_n(t))_{tinR^d}$ can be proved. This is particularly useful when dealing with data arising from dynamical systems or functional of Markov chains. This result improves those of [DDV09] and [DD11], where the technique was first introduced, and provides new applications."
empirical_processes
stochastic_processes
dynamical_systems
central_limit_theorem
in_NB
october 2011 by cshalizi
[1108.3886] On generic chaining and the smallest singular value of random matrices with heavy tails
august 2011 by cshalizi
"We present a very general chaining method which allows one to control the supremum of the empirical process $\sup_{h \in H} |N^{-1}\sum_{i=1}^N h^2(X_i)-\E h^2|$ in rather general situations. We use this method to establish two main results. First, a quantitative (non asymptotic) version of the classical Bai-Yin Theorem on the singular values of a random matrix with i.i.d entries that have heavy tails, and second, a sharp estimate on the quadratic empirical process when $H=\{\inr{t,\cdot} : t \in T\}$, $T \subset \R^n$ and $\mu$ is an isotropic, unconditional, log-concave measure."
empirical_processes
random_matrix_theory
probability
in_NB
august 2011 by cshalizi
[0908.3666] On the minimal penalty for Markov order estimation
august 2011 by cshalizi
I think this is the only time I have seen the LIL actually _used_ for anything.
van_handel.ramon
empirical_processes
model_selection
markov_models
statistics
stochastic_processes
law_of_the_iterated_logarithm
in_NB
august 2011 by cshalizi
[1107.3806] Asymptotics for minimisers of convex processes
july 2011 by cshalizi
From 1993: "By means of two simple convexity arguments we are able to develop a general method for proving consistency and asymptotic normality of estimators that are defined by minimisation of convex criterion functions. This method is then applied to a fair range of different statistical estimation problems, including Cox regression, logistic and Poisson regression, least absolute deviation regression outside model conditions, and pseudo-likelihood estimation for Markov chains. Our paper has two aims. The first is to exposit the method itself, which in many cases, under reasonable regularity conditions, leads to new proofs that are simpler than the traditional proofs. Our second aim is to exploit the method to its limits for logistic regression and Cox regression, where we seek asymptotic results under as weak regularity conditions as possible. For Cox regression in particular we are able to weaken previously published regularity conditions substantially."
statistics
estimation
pollard.david
hjort.nils_lid
empirical_processes
have_read
in_NB
july 2011 by cshalizi
Seijo , Sen : A continuous mapping theorem for the smallest argmax functional
may 2011 by cshalizi
"This paper introduces a version of the argmax continuous mapping theorem that applies to M-estimation problems in which the objective functions converge to a limiting process with multiple maximizers. The concept of the smallest maximizer of a function in the d-dimensional Skorohod space is introduced and its main properties are studied. The resulting continuous mapping theorem is applied to three problems arising in change-point regression analysis. Some of the results proved in connection to the d-dimensional Skorohod space are also of independent interest."
statistics
estimation
empirical_processes
may 2011 by cshalizi
Giné , Nickl : Adaptive estimation of a distribution function and its density in sup-norm loss by wavelet and spline projections
november 2010 by cshalizi
"Given an i.i.d. sample from a distribution F on ℝ with uniformly continuous density p0, purely data-driven estimators are constructed that efficiently estimate F in sup-norm loss and simultaneously estimate p0 at the best possible rate of convergence over Hölder balls, also in sup-norm loss. The estimators are obtained by applying a model selection procedure close to Lepski’s method with random thresholds to projections of the empirical measure onto spaces spanned by wavelets or B-splines. The random thresholds are based on suprema of Rademacher processes indexed by wavelet or spline projection kernels. This requires Bernstein-type analogs of the inequalities in Koltchinskii [Ann. Statist. 34 (2006) 2593–2656] for the deviation of suprema of empirical processes from their Rademacher symmetrizations."
learning_theory
density_estimation
empirical_processes
november 2010 by cshalizi
Bobkov , Götze : Concentration of empirical distribution functions with applications to non-i.i.d. models
november 2010 by cshalizi
"The concentration of empirical measures is studied for dependent data, whose joint distribution satisfies Poincaré-type or logarithmic Sobolev inequalities. The general concentration results are then applied to spectral empirical distribution functions associated with high-dimensional random matrices."
concentration_of_measure
empirical_processes
stochastic_processes
random_matrix_theory
november 2010 by cshalizi
[1010.0535] Asymptotic Normality of Support Vector Machines for Classification and Regression
october 2010 by cshalizi
"In nonparametric classification and regression problems, support vector machines (SVMs) attract much attention in theoretical and in applied statistics. In an abstract sense, SVMs can be seen as regularized M-estimators for a parameter in a (typically infinite dimensional) reproducing kernel Hilbert space. In the article, it is shown that the difference between the empirical SVM and the theoretical SVM is asymptotically normal with rate $\sqrt{n}$. That is, the standardized difference converges weakly to a Gaussian process in the reproducing kernel Hilbert space. This is done by an application of the functional delta-method and by showing that the SVM-functional is suitably Hadamard-differentiable."
empirical_processes
support_vector_machines
statistics
hilbert_space
gaussian_processes
october 2010 by cshalizi
[1009.0282] Empirical processes, typical sequences and coordinated actions in standard Borel spaces
september 2010 by cshalizi
"This paper proposes a new notion of typical sequences on a wide class of abstract alphabets (so-called standard Borel spaces), which is based on approximations of memoryless sources by empirical distributions uniformly over a class of measurable "test functions." In the finite-alphabet case, we can take all uniformly bounded functions and recover the usual notion of strong typicality (or typicality under the total variation distance). For a general alphabet, however, this function class turns out to be too large, and must be restricted. With this in mind, we define typicality with respect to any Glivenko-Cantelli function class (i.e., a function class that admits a Uniform Law of Large Numbers)..."
empirical_processes
large_deviations
measure_theory
method_of_types
raginsky.maxim
kith_and_kin
stochastic_processes
have_read
september 2010 by cshalizi
[1007.2964] The Gap Dimension and Uniform Laws of Large Numbers for Ergodic Processes
july 2010 by cshalizi
Sequel to their recent Annals of Probability paper, where they use the same trick to get convergence for function classes in terms of the gap (a.k.a. "fat shattering") dimension.
ergodic_theory
stochastic_processes
learning_theory
empirical_processes
statistics
statistical_inference_for_stochastic_processes
nobel.andrew
adams.terrence
have_read
july 2010 by cshalizi
Berti, Crimaldi, Pratelli, Rigo: Rate of convergence of predictive distributions for dependent data
january 2010 by cshalizi
Exchangeable sequences, rather than anything interesting, but worth looking at for ideas? ETA: Eh.
prediction
statistics
empirical_processes
stochastic_processes
have_read
learning_theory
statistical_inference_for_stochastic_processes
january 2010 by cshalizi
[0902.1448] Empirical spectral processes for locally stationary time series
january 2010 by cshalizi
"A time-varying empirical spectral process indexed by classes of functions is defined for locally stationary time series. We derive weak convergence in a function space, and prove a maximal exponential inequality and a Glivenko--Cantelli-type convergence result. The results use conditions based on the metric entropy of the index class. In contrast to related earlier work, no Gaussian assumption is made. As applications, quasi-likelihood estimation, goodness-of-fit testing and inference under model misspecification are discussed. In an extended application, uniform rates of convergence are derived for local Whittle estimates of the parameter curves of locally stationary time series models."
empirical_processes
fourier_analysis
time_series
statistical_inference_for_stochastic_processes
non-stationarity
january 2010 by cshalizi
The Likelihood Ratio Test Under Nonstandard Conditions
june 2009 by cshalizi
I very much like the approach of treating the likelihood ratio as an empirical process; why haven't I seen it before? (Also, the state-of-the-art in simulating Gaussian processes must be much better now than what Hansen was doing in '92, which would make this even more practical.)
empirical_processes
hypothesis_testing
statistics
likelihood_ratio_tests
econometrics
time_series
hansen.bruce
have_read
june 2009 by cshalizi
Statistics 210B: Theoretical Statistics
april 2009 by cshalizi
Michael Jordan's theoretical statistics class. Needless to say, I approve.
statistics
learning_theory
empirical_processes
machine_learning
asymptotics
estimation
via:mreid
jordan.michael_i.
april 2009 by cshalizi
related tags
adams.terrence ⊕ andrews.donald_w._k. ⊕ asymptotics ⊕ central_limit_theorem ⊕ concentration_of_measure ⊕ convergence_of_stochastic_processes ⊕ density_estimation ⊕ deviation_bounds ⊕ donskers_theorem ⊕ dynamical_systems ⊕ econometrics ⊕ empirical_processes ⊖ ergodic_theory ⊕ estimation ⊕ fourier_analysis ⊕ functional_central_limit_theorem ⊕ gaussian_processes ⊕ goodness-of-fit ⊕ hansen.bruce ⊕ have_read ⊕ high-dimensional_probability ⊕ high-dimensional_statistics ⊕ hilbert_space ⊕ hjort.nils_lid ⊕ hypothesis_testing ⊕ in_NB ⊕ jordan.michael_i. ⊕ kith_and_kin ⊕ large_deviations ⊕ law_of_the_iterated_logarithm ⊕ learning_theory ⊕ levy_processes ⊕ likelihood_ratio_tests ⊕ machine_learning ⊕ markov_models ⊕ martingales ⊕ measure_theory ⊕ method_of_types ⊕ mixing ⊕ model_selection ⊕ nobel.andrew ⊕ non-stationarity ⊕ pollard.david ⊕ prediction ⊕ probability ⊕ raginsky.maxim ⊕ random_matrix_theory ⊕ random_projections ⊕ re:almost_none ⊕ re:XV_for_mixing ⊕ re:your_favorite_dsge_sucks ⊕ splines ⊕ statistical_inference_for_stochastic_processes ⊕ statistics ⊕ stochastic_processes ⊕ support_vector_machines ⊕ time_series ⊕ to:NB ⊕ to_read ⊕ van_de_geer.sara ⊕ van_handel.ramon ⊕ vc-dimension ⊕ via:ded-maxim ⊕ via:mreid ⊕ via:shivak ⊕ wavelets ⊕Copy this bookmark: