[1204.3941] A Log-Linear Graphical Model for Inferring Genetic Networks from High-Throughput Sequencing Data
5 weeks ago by cshalizi
"We develop a novel method for estimating high-dimensional Poisson graphical models, the Log-Linear Graphical Model, allowing us to infer networks based on high-throughput sequencing data. Our model assumes that conditional on all other genes, each gene is Poisson, jointly defining a pair-wise Poisson Markov random field. We estimate our genetic networks via neighborhood selection by fitting `1-norm penalized log-linear models, an approach we call the Poisson Graphical Lasso. Additionally, we develop a fast parallel algorithm, permitting us to fit our graphical models to high-dimensional genomic data sets."
in_NB
graphical_models
gene_expression_data_analysis
lasso
network_data_analysis
statistics
regression
5 weeks ago by cshalizi
Exact Covariance Thresholding into Connected Components for Large-Scale Graphical Lasso
7 weeks ago by cshalizi
"We consider the sparse inverse covariance regularization problem or graphical lasso with regularization parameter λ. Suppose the sample covariance graph formed by thresholding the entries of the sample covariance matrix at λ is decomposed into connected components. We show that the vertex-partition induced by the connected components of the thresholded sample covariance graph (at λ) is exactly equal to that induced by the connected components of the estimated concentration graph, obtained by solving the graphical lasso problem for the same λ. This characterizes a very interesting property of a path of graphical lasso solutions. Furthermore, this simple rule, when used as a wrapper around existing algorithms for the graphical lasso, leads to enormous performance gains. For a range of values of λ, our proposal splits a large graphical lasso problem into smaller tractable problems, making it possible to solve an otherwise infeasible large-scale problem. We illustrate the graceful scalability of our proposal via synthetic and real-life microarray examples."
--- I wonder whether this hasn't some application to the PC algorithm?
to:NB
graphical_models
lasso
sparsity
statistics
heard_the_talk
--- I wonder whether this hasn't some application to the PC algorithm?
7 weeks ago by cshalizi
Structured Sparsity and Generalization
7 weeks ago by cshalizi
"We present a data dependent generalization bound for a large class of regularized algorithms which implement structured sparsity constraints. The bound can be applied to standard squared-norm regularization, the Lasso, the group Lasso, some versions of the group Lasso with overlapping groups, multiple kernel learning and other regularization schemes. In all these cases competitive results are obtained. A novel feature of our bound is that it can be applied in an infinite dimensional setting such as the Lasso in a separable Hilbert space or multiple kernel learning with a countable number of kernels."
to:NB
learning_theory
regression
sparsity
statistics
lasso
7 weeks ago by cshalizi
Optimization with Sparsity-Inducing Penalties
12 weeks ago by cshalizi
"Sparse estimation methods are aimed at using or obtaining parsimonious representations of data or models. They were first dedicated to linear variable selection but numerous extensions have now emerged such as structured sparsity or kernel selection. It turns out that many of the related estimation problems can be cast as convex optimization problems by regularizing the empirical risk with appropriate nonsmooth norms. The goal of this monograph is to present from a general perspective optimization tools and techniques dedicated to such sparsity-inducing penalties. We cover proximal methods, block-coordinate descent, reweighted l2-penalized techniques, working-set and homotopy methods, as well as non-convex formulations and extensions, and provide an extensive set of experiments to compare various algorithms from a computational point of view."
to:NB
statistics
machine_learning
optimization
sparsity
lasso
12 weeks ago by cshalizi
[0805.1179] Autoregressive Process Modeling via the Lasso Procedure
12 weeks ago by cshalizi
"The Lasso is a popular model selection and estimation procedure for linear models that enjoys nice theoretical properties. In this paper, we study the Lasso estimator for fitting autoregressive time series models. We adopt a double asymptotic framework where the maximal lag may increase with the sample size. We derive theoretical results establishing various types of consistency. In particular, we derive conditions under which the Lasso estimator for the autoregressive coefficients is model selection consistent, estimation consistent and prediction consistent. Simulation study results are reported."
to:NB
time_series
statistics
lasso
sparsity
variable_selection
kith_and_kin
heard_the_talk
rinaldo.alessandro
nardi.yuval
12 weeks ago by cshalizi
[0810.3177] Inferring sparse Gaussian graphical models with latent structure
february 2012 by cshalizi
"Our concern is selecting the concentration matrix's nonzero coefficients for a sparse Gaussian graphical model in a high-dimensional setting. This corresponds to estimating the graph of conditional dependencies between the variables. We describe a novel framework taking into account a latent structure on the concentration matrix. This latent structure is used to drive a penalty matrix and thus to recover a graphical model with a constrained topology. Our method uses an $ell_1$ penalized likelihood criterion. Inference of the graph of conditional dependencies between the variates and of the hidden variables is performed simultaneously in an iterative textsc{em}-like algorithm. The performances of our method is illustrated on synthetic as well as real data, the latter concerning breast cancer."
to:NB
graphical_models
lasso
sparsity
statistics
inference_to_latent_objects
february 2012 by cshalizi
Sun , Wang , Fang : Regularized k-means clustering of high-dimensional data and its asymptotic consistency
february 2012 by cshalizi
"K-means clustering is a widely used tool for cluster analysis due to its conceptual simplicity and computational efficiency. However, its performance can be distorted when clustering high-dimensional data where the number of variables becomes relatively large and many of them may contain no information about the clustering structure. This article proposes a high-dimensional cluster analysis method via regularized k-means clustering, which can simultaneously cluster similar observations and eliminate redundant variables. The key idea is to formulate the k-means clustering in a form of regularization, with an adaptive group lasso penalty term on cluster centers. In order to optimally balance the trade-off between the clustering model fitting and sparsity, a selection criterion based on clustering stability is developed. The asymptotic estimation and selection consistency of the regularized k-means clustering with diverging dimension is established. The effectiveness of the regularized k-means clustering is also demonstrated through a variety of numerical experiments as well as applications to two gene microarray examples. The regularized clustering framework can also be extended to the general model-based clustering."
in_NB
clustering
statistics
lasso
data_mining
to_teach:data-mining
february 2012 by cshalizi
[1201.0224] Estimation of Treatment Effects with High-Dimensional Controls
january 2012 by cshalizi
"We propose methods for inference on the average effect of a treatment on a scalar outcome in the presence of very many controls. Our setting is a partially linear regression model containing the treatment/policy variable and a large number $p$ of controls or series terms, with $p$ that is possibly much larger than the sample size $n$, but where only $s < n$ unknown controls or series terms are needed to approximate the regression function accurately. The latter sparsity condition makes it possible to estimate the entire regression function as well as the average treatment effect by selecting an approximately the right set of controls using Lasso and related methods. We develop estimation and inference methods for the average treatment effect in this setting, proposing a novel "post double selection" method that provides attractive inferential and estimation properties. In our analysis, in order to cover realistic applications, we expressly allow for imperfect selection of the controls and account for the impact of selection errors on estimation and inference. In order to cover typical applications in economics, we employ the selection methods designed to deal with non-Gaussian and heteroscedastic disturbances. We illustrate the use of new methods with numerical simulations and an application to the effect of abortion on crime rates."
to:NB
to_teach:undergrad-ADA
regression
causal_inference
lasso
sparsity
econometrics
instrumental_variables
hansen.christian
january 2012 by cshalizi
[1201.0220] Inference for High-Dimensional Sparse Econometric Models
january 2012 by cshalizi
"This article is about estimation and inference methods for high dimensional sparse (HDS) regression models in econometrics. High dimensional sparse models arise in situations where many regressors (or series terms) are available and the regression function is well-approximated by a parsimonious, yet unknown set of regressors. The latter condition makes it possible to estimate the entire regression function effectively by searching for approximately the right set of regressors. We discuss methods for identifying this set of regressors and estimating their coefficients based on $ell_1$-penalization and describe key theoretical results. In order to capture realistic practical situations, we expressly allow for imperfect selection of regressors and study the impact of this imperfect selection on estimation and inference results. We focus the main part of the article on the use of HDS models and methods in the instrumental variables model and the partially linear model. We present a set of novel inference results for these models and illustrate their use with applications to returns to schooling and growth regression."
to:NB
regression
sparsity
instrumental_variables
econometrics
to_teach:undergrad-ADA
lasso
hansen.christian
january 2012 by cshalizi
[1111.4416] Sparse Group Selection Through Co-Adaptive Penalties
november 2011 by cshalizi
"Recent work has focused on the problem of conducting linear regression when the number of covariates is very large, potentially greater than the sample size. To facilitate this, one useful tool is to assume that the model can be well approximated by a fit involving only a small number of covariates -- a so called sparsity assumption, which leads to the Lasso and other methods. In many situations, however, the covariates can be considered to be structured, in that the selection of some variables favours the selection of others -- with variables organised into groups entering or leaving the model simultaneously as a special case. This structure creates a different form of sparsity. In this paper, we suggest the Co-adaptive Lasso to fit models accommodating this form of `group sparsity'. The Co-adaptive Lasso is fast and simple to calculate, and we show that it holds theoretical advantages over the Lasso, performs well under a broad set of conclusions, and is very competitive in empirical simulations in comparison with previously suggested algorithms like the Group Lasso and the Adaptive Lasso."
to:NB
regression
sparsity
lasso
statistics
november 2011 by cshalizi
[1111.0559] Model Selection in Undirected Graphical Models with the Elastic Net
november 2011 by cshalizi
"Structure learning in random fields has attracted considerable attention due to its difficulty and importance in areas such as remote sensing, computational biology, natural language processing, protein networks, and social network analysis. We consider the problem of estimating the probabilistic graph structure associated with a Gaussian Markov Random Field (GMRF), the Ising model and the Potts model, by extending previous work on $l_1$ regularized neighborhood estimation to include the elastic net $l_1+l_2$ penalty. Additionally, we show numerical evidence that the edge density plays a role in the graph recovery process. Finally, we introduce a novel method for augmenting neighborhood estimation by leveraging pair-wise neighborhood union estimates."
in_NB
graphical_models
model_selection
lasso
sparsity
november 2011 by cshalizi
[1108.0775] Optimization with Sparsity-Inducing Penalties
august 2011 by cshalizi
"Sparse estimation methods are aimed at using or obtaining parsimonious representations of data or models. They were first dedicated to linear variable selection but numerous extensions have now emerged such as structured sparsity or kernel selection. It turns out that many of the related estimation problems can be cast as convex optimization problems by regularizing the empirical risk with appropriate non-smooth norms. The goal of this paper is to present from a general perspective optimization tools and techniques dedicated to such sparsity-inducing penalties. We cover proximal methods, block-coordinate descent, reweighted $\ell_2$-penalized techniques, working-set and homotopy methods, as well as non-convex formulations and extensions, and provide an extensive set of experiments to compare various algorithms from a computational point of view."
learning_theory
sparsity
optimization
lasso
to:NB
machine_learning
statistics
august 2011 by cshalizi
[1107.0189] The Lasso, correlated design, and improved oracle inequalities
july 2011 by cshalizi
"We study high-dimensional linear models and the $\ell_1$-penalized least squares estimator, also known as the Lasso estimator. In literature, oracle inequalities have been derived under restricted eigenvalue or compatibility conditions. In this paper, we complement this with entropy conditions which allow one to improve the dual norm bound, and demonstrate how this leads to new oracle inequalities. The new oracle inequalities show that a smaller choice for the tuning parameter and a trade-off between $\ell_1$-norms and small compatibility constants are possible. This implies, in particular for correlated design, improved bounds for the prediction error of the Lasso estimator as compared to the methods based on restricted eigenvalue or compatibility conditions only."
lasso
regression
model_selection
van_de_geer.sara
july 2011 by cshalizi
[1106.5242] High Dimensional Sparse Econometric Models: An Introduction
june 2011 by cshalizi
I love how they just flat-out identify "econometrics" with "linear regression with Gaussian noise"; but it looks like a clean exposition with proofs.
regression
lasso
variable_selection
econometrics
june 2011 by cshalizi
Nonparametric Regression on a Graph
march 2011 by cshalizi
Sounds like they're putting a lasso-type penalty on shifts between adjacent points on a graph. How does this compare to Belkin-Niyogi style graph regularization?
regression
smoothing
network_data_analysis
lasso
to:NB
march 2011 by cshalizi
[1009.2302] The Predictive Lasso
september 2010 by cshalizi
"We propose a shrinkage procedure for simultaneous variable selection and estimation in generalized linear models (GLMs) with an explicit predictive motivation. The procedure estimates the coefficients by minimizing the Kullback-Leibler divergence of a set of predictive distributions to the corresponding predictive distributions for the full model, subject to an $l_1$ constraint on the coefficient vector. This results in selection of a parsimonious model with similar predictive performance to the full model. Thanks to its similar form to the original lasso problem for GLMs, our procedure can benefit from available $l_1$-regularization path algorithms. Simulation studies and real-data examples confirm the efficiency of our method in terms of predictive performance on future observations."
regression
lasso
variable_selection
sparsity
information_theory
statistics
september 2010 by cshalizi
[1004.2287] An empirical comparative study of approximate methods for binary graphical models; application to the search of associations among causes of death in French death certificates
april 2010 by cshalizi
"Looking for associations among multiple variables is a topical issue in statistics due to the increasing amount of data encountered in biology, medicine and many other domains involving statistical applications. Graphical models have recently gained popularity for this purpose in the statistical literature. Following the ideas of the LASSO procedure designed for the linear regression framework, recent developments dealing with graphical model selection have been based on $\ell_1$-penalization. In the binary case, however, exact inference is generally very slow or even intractable because of [the] log-partition function. Various approximate methods have recently been proposed in the literature ... Through an extensive simulation study, we show that a simple modification of a method relying on a Gaussian approximation achieves good performance and is very fast. We present a real application in which we search for associations among causes of death recorded on French death certificates."
graphical_models
lasso
model_selection
april 2010 by cshalizi
[1004.2304] Spatio-Temporal Graphical Model Selection
april 2010 by cshalizi
"We consider the problem of estimating the topology of spatial interactions in a discrete state, discrete time spatio-temporal graphical model where the interactions affect the temporal evolution of each agent in a network. Among other models, the susceptible, infected, recovered ($SIR$) model for interaction events fall into this framework. We pose the problem as a structure learning problem and solve it using an $\ell_1$-penalized likelihood convex program. We evaluate the solution on a simulated spread of infectious over a complex network. Our topology estimates outperform those of a standard spatial Markov random field graphical model selection using $\ell_1$-regularized logistic regression."
graphical_models
random_fields
lasso
model_selection
april 2010 by cshalizi
[0712.0881] On the "degrees of freedom" of the lasso
august 2009 by cshalizi
Reading the abstract and introduction makes me feel al of a sudden like I don't, in fact, understand the concept of "degrees of freedom". (I mean, I understand it in mechanics!)
lasso
regression
sparsity
degrees_of_freedom
statistics
estimation
model_selection
to_read
august 2009 by cshalizi
[0906.3590] Forest Garrote
june 2009 by cshalizi
We have got to do something about the nams of techniques in this area. I don't mind the whimsy, it's just that combinations like this don't work, metaphorically.
ensemble_methods
classifiers
statistics
machine_learning
sparsity
variable_selection
lasso
june 2009 by cshalizi
"Statistical Theory and Methods for Complex, High-Dimensional Data"
june 2009 by cshalizi
Loads of talks.
statistics
machine_learning
model_selection
graphical_models
regression
latent_variables
principal_components
factor_analysis
dimension_reduction
lasso
bioinformatics
track_down_references
via:shivak
june 2009 by cshalizi
[0901.3202] Model-Consistent Sparse Estimation through the Bootstrap
january 2009 by cshalizi
"if we run the Lasso for several bootstrapped replications of a given sample, then intersecting the supports of the Lasso bootstrap estimates leads to consistent model selection"
lasso
linear_regression
model_selection
variable_selection
bootstrap
january 2009 by cshalizi
[0901.2234] Sparse Causal Discovery in Multivariate Time Series
january 2009 by cshalizi
Umm. I'd have been a lot happier with this if they'd called it "sparse vector autoregression". Still, interesting and worth a second read.
time_series
causal_inference
regression
sparsity
lasso
january 2009 by cshalizi
[0812.5087] Estimating Time-Varying Networks
january 2009 by cshalizi
Heard Eric talk about this at the Dec. '08 SFI workshop. Very neat.
markov_models
logistic_regression
lasso
kith_and_kin
in_NB
network_data_analysis
kolar.mladen
song.le
xing.eric
ahmed.amr
january 2009 by cshalizi
related tags
ahmed.amr ⊕ bioinformatics ⊕ books:recommended ⊕ bootstrap ⊕ buhlmann.peter ⊕ causal_inference ⊕ classifiers ⊕ clustering ⊕ data_mining ⊕ degrees_of_freedom ⊕ dimension_reduction ⊕ econometrics ⊕ ensemble_methods ⊕ estimation ⊕ estimation_of_dynamical_systems ⊕ factor_analysis ⊕ gene_expression_data_analysis ⊕ geometry ⊕ graphical_models ⊕ hansen.christian ⊕ heard_the_talk ⊕ inference_to_latent_objects ⊕ information_theory ⊕ instrumental_variables ⊕ in_NB ⊕ ising_model ⊕ kith_and_kin ⊕ kolar.mladen ⊕ lasso ⊖ latent_variables ⊕ learning_theory ⊕ linear_regression ⊕ logistic_regression ⊕ machine_learning ⊕ markov_models ⊕ method_of_sieves ⊕ model_selection ⊕ nardi.yuval ⊕ network_data_analysis ⊕ optimization ⊕ oracle_inequalities ⊕ phase_transitions ⊕ principal_components ⊕ random_fields ⊕ regression ⊕ ridge_regression ⊕ rinaldo.alessandro ⊕ smoothing ⊕ song.le ⊕ sparsity ⊕ statistical_inference_for_stochastic_processes ⊕ statistics ⊕ stochastic_differential_equations ⊕ tibshirani.robert ⊕ time_series ⊕ to:NB ⊕ to_read ⊕ to_teach:data-mining ⊕ to_teach:undergrad-ADA ⊕ track_down_references ⊕ van_de_geer.sara ⊕ variable_selection ⊕ via:shivak ⊕ xing.eric ⊕Copy this bookmark: