mraginsky + optimization 94
"Approximate dynamic programming via Bellman inequalities" (Wang and Boyd)
6 weeks ago by mraginsky
"In this paper we introduce new methods for finding functions that lower bound
the value function of a stochastic control problem, using an iterated form of the Bellman inequality. Our method is based on solving linear or semidefinite programs, and
produces both a bound on the optimal objective, as well as a suboptimal policy that
appears to works very well. These results extend and improve bounds obtained by
authors in a previous paper using a single Bellman inequality condition. We describe
the methods in a general setting, and show how they can be applied in specific cases
including the finite state case, constrained linear quadratic control, switched affine
control, and multi-period portfolio investment."
papers
to-read
control-theory
dynamic-programming
optimization
the value function of a stochastic control problem, using an iterated form of the Bellman inequality. Our method is based on solving linear or semidefinite programs, and
produces both a bound on the optimal objective, as well as a suboptimal policy that
appears to works very well. These results extend and improve bounds obtained by
authors in a previous paper using a single Bellman inequality condition. We describe
the methods in a general setting, and show how they can be applied in specific cases
including the finite state case, constrained linear quadratic control, switched affine
control, and multi-period portfolio investment."
6 weeks ago by mraginsky
[1202.6258] A Stochastic Gradient Method with an Exponential Convergence Rate for Strongly-Convex Optimization with Finite Training Sets
march 2012 by mraginsky
We propose a new stochastic gradient method for optimizing the sum of a finite set of smooth functions, where the sum is strongly convex. While standard stochastic gradient methods converge at sublinear rates for this problem, the proposed method incorporates a memory of previous gradient values in order to achieve a linear convergence rate. Numerical experiments indicate that the new algorithm can dramatically outperform standard algorithms.
papers
to-read
optimization
convex-programming
march 2012 by mraginsky
"Online sequential optimization with biased gradients"
february 2012 by mraginsky
We study a class of stochastic optimization problems where though the objective functions
may not be convex, they satisfy a generalization of convexity, called the sequentially convex
property. We focus on a setting where the distribution of the underlying uncertainty is unknown and the manager must make a decision in real-time based on historical data. Since sequentially convex functions are not necessarily convex, they pose difficulties in applying standard adaptive methods for convex optimization. We propose a nonparametric algorithm based on a gradient descent method, and show that the T-season average expected cost differs from the minimum cost by at most O(1/sqrt(T)). Our analysis is based on a careful quantication of the bias that is inherent in gradient estimation due to the adaptive nature of the problem. We demonstrate the usefulness of the concept of sequential convexity by applying it to three canonical problems in inventory control, capacity allocation, and the lifetime buy decision, under the assumption that the manager does not know the demand distributions and has access only to historical sales (censored demand) data.
papers
to-read
online-learning
optimization
convex-programming
may not be convex, they satisfy a generalization of convexity, called the sequentially convex
property. We focus on a setting where the distribution of the underlying uncertainty is unknown and the manager must make a decision in real-time based on historical data. Since sequentially convex functions are not necessarily convex, they pose difficulties in applying standard adaptive methods for convex optimization. We propose a nonparametric algorithm based on a gradient descent method, and show that the T-season average expected cost differs from the minimum cost by at most O(1/sqrt(T)). Our analysis is based on a careful quantication of the bias that is inherent in gradient estimation due to the adaptive nature of the problem. We demonstrate the usefulness of the concept of sequential convexity by applying it to three canonical problems in inventory control, capacity allocation, and the lifetime buy decision, under the assumption that the manager does not know the demand distributions and has access only to historical sales (censored demand) data.
february 2012 by mraginsky
[1105.2274] Data-Distributed Weighted Majority and Online Mirror Descent
february 2012 by mraginsky
In this paper, we focus on the question of the extent to which online learning can benefit from distributed computing. We focus on the setting in which $N$ agents online-learn cooperatively, where each agent only has access to its own data. We propose a generic data-distributed online learning meta-algorithm. We then introduce the Distributed Weighted Majority and Distributed Online Mirror Descent algorithms, as special cases. We show, using both theoretical analysis and experiments, that compared to a single agent: given the same computation time, these distributed algorithms achieve smaller generalization errors; and given the same generalization errors, they can be $N$ times faster.
papers
to-read
machine-learning
online-learning
optimization
distributed-systems
february 2012 by mraginsky
Sebastien Bubeck, "Introduction to Online Optimization"
december 2011 by mraginsky
lecture notes for ORF570: Special Topics in Statistics and Operations Research, Prediction Games (Princeton)
lecture-notes
online-learning
optimization
prediction
december 2011 by mraginsky
Huber: On the Non-Optimality of Optimal Procedures
july 2011 by mraginsky
"This paper discusses some subtle, and largely overlooked, differences between conceptual and mathematical optimization goals in statistics, and illustrates them by examples."
papers
to-read
statistics
robust_statistics
optimization
polemics
july 2011 by mraginsky
[1106.4739] Nonasymptotic bounds on the estimation error of MCMC algorithms
june 2011 by mraginsky
"We address the problem of upper bounding the mean square error of MCMC estimators. Our analysis is non-asymptotic. We first establish a general result valid for essentially all ergodic Markov chains encountered in Bayesian computation and a possibly unbounded target function $f.$ The bound is sharp in the sense that the leading term is exactly $\asvar/n$, where $\asvar$ is the CLT asymptotic variance. Next, we proceed to specific assumptions and give explicit computable bounds for geometrically and polynomially ergodic Markov chains. As a corollary we provide results on confidence estimation."
papers
to-read
optimization
algorithms
MCMC
machine-learning
june 2011 by mraginsky
[1103.4296] Randomized Smoothing for Stochastic Optimization
march 2011 by mraginsky
"We analyze convergence rates of stochastic optimization procedures for non-smooth convex optimization problems. By combining randomized smoothing techniques with accelerated gradient methods, we obtain optimal variance-based convergence rates of stochastic optimization procedures, both in expectation and with high probability, To the best of our knowledge, these are the first variance-based rates for non-smooth optimization. We give several applications of our results to statistical estimation problems, and provide experimental results that demonstrate the effectiveness of the proposed algorithms. We also describe how a combination of our algorithm with recent work on decentralized optimization yields a distributed stochastic optimization algorithm that is order-optimal."
papers
to-read
optimization
convex-programming
march 2011 by mraginsky
[1102.4442] Internal Regret with Partial Monitoring. Calibration-Based Optimal Algorithms
february 2011 by mraginsky
"We provide consistent random algorithms for sequential decision under partial monitoring, i.e. when the decision maker does not observe the outcomes but receives instead random feedback signals. Those algorithms have no internal regret in the sense that, on the set of stages where the decision maker chose his action according to a given law, the average payoff could not have been improved in average by using any other fixed law. They are based on a generalization of calibration, no longer defined in terms of a Voronoi diagram but instead of a Laguerre diagram (a more general concept). This allows us to bound, for the first time in this general framework, the expected average internal -- as well as the usual external -- regret at stage $n$ by $O(n^{-1/3})$, which is known to be optimal."
papers
to-read
learning-theory
online-learning
game-theory
optimization
february 2011 by mraginsky
Decision Making in Large Scale Systems (Spring 2004)
january 2011 by mraginsky
This course is an introduction to the theory and application of large-scale dynamic programming. Topics include Markov decision processes, dynamic programming algorithms, simulation-based algorithms, theory and algorithms for value function approximation, and policy search methods. The course examines games and applications in areas such as dynamic resource allocation, finance and queueing networks.
lecture-notes
control-theory
dynamic-programming
optimization
via:stochastix
january 2011 by mraginsky
[1010.4207] Convex Analysis and Optimization with Submodular Functions: a Tutorial (Francis Bach)
november 2010 by mraginsky
"Set-functions appear in many areas of computer science and applied mathematics, such as machine learning, computer vision, operations research or electrical networks. Among these set-functions, submodular functions play an important role, similar to convex functions on vector spaces. In this tutorial, the theory of submodular functions is presented, in a self-contained way, with all results shown from first principles. A good knowledge of convex analysis is assumed."
papers
to-read
lecture-notes
convex-programming
optimization
machine-learning
algorithms
submodular-functions
november 2010 by mraginsky
[1011.1936] Blackwell Approachability and Low-Regret Learning are Equivalent
november 2010 by mraginsky
"We consider the celebrated Blackwell Approachability Theorem for two-player games with vector payoffs. We show that Blackwell's result is equivalent, via efficient reductions, to the existence of "no-regret" algorithms for Online Linear Optimization. Indeed, we show that any algorithm for one such problem can be efficiently converted into an algorithm for the other. We provide a useful application of this reduction: the first efficient algorithm for calibrated forecasting."
to-read
papers
online-learning
decision-making
game-theory
optimization
november 2010 by mraginsky
Subexponential Lower Bound for Randomized Pivot Rules! | Combinatorics and more
november 2010 by mraginsky
"Oliver Friedman, Thomas Dueholm Hansen, and Uri Zwick have managed to prove subexponential lower bounds of the form for ... two basic randomized pivot rules for the simplex algorithm! This is the first result of its kind and deciding if this is possible was an open problem for several decades." And they do it using MDPs!
papers
to-read
computer-science
computation
optimization
linear-programming
dynamic-programming
Markov-decision-processes
lower-bounds
november 2010 by mraginsky
[1009.3958] Approximate Inference and Stochastic Optimal Control
september 2010 by mraginsky
"We propose a novel reformulation of the stochastic optimal control problem as an approximate inference problem, demonstrating, that such a interpretation leads to new practical methods for the original problem. In particular we characterise a novel class of iterative solutions to the stochastic optimal control problem based on a natural relaxation of the exact dual formulation. These theoretical insights are applied to the Reinforcement Learning problem where they lead to new model free, off policy methods for discrete and continuous problems."
papers
to-read
control-theory
reinforcement-learning
optimization
information-theory
graphical-models
september 2010 by mraginsky
[1009.0679] Optimal Uncertainty Quantification
september 2010 by mraginsky
Hmm: "We propose a rigorous framework for Uncertainty Quantification (UQ) in which the UQ objectives and the assumptions/information set are brought to the forefront. This framework, which we call \emph{Optimal Uncertainty Quantification} (OUQ), is based on the observation that, given a set of assumptions and information about the problem, there exist optimal bounds on uncertainties: these are obtained as extreme values of well-defined optimization problems corresponding to extremizing probabilities of failure, or of deviations, subject to the constraints imposed by the scenarios compatible with the assumptions and information. In particular, this framework does not implicitly impose inappropriate assumptions, nor does it repudiate relevant information. ..."
papers
to-read
information-theory
decision-making
optimization
september 2010 by mraginsky
Niemiro: Asymptotics for $M$-Estimators Defined by Convex Minimization
september 2010 by mraginsky
Found this reference while reading Pollard and Hjort's "Asymptotics for minimisers of convex processes"
papers
to-read
statistics
convex-programming
optimization
september 2010 by mraginsky
Probability Collectives (David H. Wolpert)
july 2010 by mraginsky
"We recently proved that game theory and statistical physics are identical when cast in terms of information theory.
We call the associated formalism Probability Collectives (PC). PC opens many new lines of research, and provides new approaches to problems in distributed control and distributed optimization."
research
papers
reference
control-theory
distributed-systems
decision-making
game-theory
statistical-physics
optimization
We call the associated formalism Probability Collectives (PC). PC opens many new lines of research, and provides new approaches to problems in distributed control and distributed optimization."
july 2010 by mraginsky
Belief Propagation for Min-cost Network Flow: Convergence and Correctness
april 2010 by mraginsky
"...we present a simple modification of the BP algorithm which gives a fully polynomial-time randomized approximation scheme (FPRAS)...This is the first instance where BP is proved to have fully-polynomial running time."
papers
to-read
optimization
graphical-models
algorithms
complexity
via:shivak
april 2010 by mraginsky
Reasoning in Reduced Information Spaces
april 2010 by mraginsky
"...a website for research and coordination on Decentralized Techniques for Reasoning in Reduced Information Spaces (ONR 2009 Multi-disciplinary University Research Project)"
decision-making
decentralized-control
machine-learning
distributed-systems
AI
algorithms
optimization
april 2010 by mraginsky
Computational Aspects of Evolution
january 2010 by mraginsky
A course taught by Christos Papadimitriou
evolution
complexity
computation
computer-science
optimization
lecture-notes
algorithms
january 2010 by mraginsky
Sequences Project
january 2010 by mraginsky
Michael Steele's page on individual sequences, learning in games, etc.
reference
universal-prediction
machine-learning
statistics
game-theory
statistical-learning
optimization
via:mreid
january 2010 by mraginsky
Statistical Science
december 2009 by mraginsky
Issue on Interface of Probability and Algorithms
papers
to-read
statistics
probability
optimization
algorithms
computer-science
complexity
december 2009 by mraginsky
[0911.3357] Fundamentals of Large Sensor Networks: Connectivity, Capacity, Clocks and Computation
november 2009 by mraginsky
Nikolaos M. Freris, Hemant Kowshik, P. R. Kumar
papers
to-read
sensor-networks
distributed-systems
optimization
information-theory
algorithms
november 2009 by mraginsky
[0910.5460] Gibbs Measures and Phase Transitions on Sparse Random Graphs
november 2009 by mraginsky
lecture notes by Amir Dembo, Andrea Montanari
to-read
lecture-notes
reference
graph-theory
graphical-models
sparsity
statistical-physics
algorithms
combinatorics
optimization
november 2009 by mraginsky
[0910.2065v1] Decentralized Multi-Armed Bandit with Multiple Distributed Players (Keqin Liu, Qing Zhao)
november 2009 by mraginsky
From the abstract: "We consider multi-armed bandit with distributed players, where each player independently samples one of N stochastic processes with unknown parameters and accrues reward in each slot without information exchange. Users choosing the same arm collide, and none or only one receives reward depending on the collision model. This problem can be formulated as a decentralized multi-armed bandit problem."
papers
to-read
distributed-systems
algorithms
optimization
decision-making
decentralized-control
control-theory
november 2009 by mraginsky
Convex Games in Banach Spaces (Karthik Sridharan, Ambuj Tewari)
november 2009 by mraginsky
Technical Report TTIC-TR-2009-6, September 2009
papers
to-read
optimization
convex-programming
learning-theory
game-theory
geometric-functional-analysis
banach-spaces
filetype:pdf
media:document
november 2009 by mraginsky
[0910.4627] Self-concordant analysis for logistic regression
october 2009 by mraginsky
Francis Bach (INRIA Rocquencourt)
papers
have-read
statistics
learning-theory
optimization
algorithms
convex-programming
october 2009 by mraginsky
[0901.0633] Optimal control as a graphical model inference problem
september 2009 by mraginsky
B. Kappen, V. Gomez, M. Opper
papers
have-read
graphical-models
control-theory
optimization
september 2009 by mraginsky
[0908.4073] Distributed Averaging via Lifted Markov Chains
august 2009 by mraginsky
Kyomin Jung, Devavrat Shah, Jinwoo Shin
papers
to-read
distributed-computing
algorithms
optimization
graph-theory
information-theory
august 2009 by mraginsky
[0908.3108] Nonparametric estimation by convex programming
august 2009 by mraginsky
Anatoli B. Juditsky, Arkadi S. Nemirovski
papers
have-read
statistics
optimization
august 2009 by mraginsky
Estimation of Sparse Binary Pairwise Markov Networks using Pseudo-likelihoods
may 2009 by mraginsky
Holger Höfling, Robert Tibshirani; JMLR 10(Apr):883--906, 2009 (via cshalizi)
papers
have-read
statistics
machine-learning
graphical-models
optimization
algorithms
may 2009 by mraginsky
[0903.1468] Taking Advantage of Sparsity in Multi-Task Learning
march 2009 by mraginsky
Karim Lounici, Massimiliano Pontil, Alexandre B. Tsybakov, Sara van de Geer
papers
to-read
statistics
machine-learning
learning-theory
sparsity
optimization
march 2009 by mraginsky
[0809.2085] Clustered Multi-Task Learning: A Convex Formulation
march 2009 by mraginsky
Laurent Jacob, Francis Bach (INRIA Rocquencourt), Jean-Philippe Vert
papers
have-read
machine-learning
algorithms
optimization
march 2009 by mraginsky
related tags
active-learning ⊕ AI ⊕ algorithms ⊕ banach-spaces ⊕ bayesian ⊕ bayesian-learning ⊕ blogs ⊕ books ⊕ combinatorics ⊕ complexity ⊕ compressed-sensing ⊕ computation ⊕ computer-science ⊕ computer-vision ⊕ conferences ⊕ consensus-algorithms ⊕ control-theory ⊕ convex-programming ⊕ data-mining ⊕ decentralized-control ⊕ decision-making ⊕ distributed-computing ⊕ distributed-systems ⊕ dynamic-programming ⊕ dynamical-systems ⊕ economics ⊕ energy ⊕ environment ⊕ evolution ⊕ experimental-design ⊕ filetype:pdf ⊕ filtering ⊕ game-theory ⊕ geometric-functional-analysis ⊕ graph-theory ⊕ graphical-models ⊕ have-read ⊕ homepages ⊕ information-theory ⊕ learning-theory ⊕ lecture-notes ⊕ linear-programming ⊕ lower-bounds ⊕ machine-learning ⊕ markets ⊕ Markov-decision-processes ⊕ mathematics ⊕ matlab ⊕ MCMC ⊕ media:document ⊕ message-passing ⊕ model-selection ⊕ online-learning ⊕ optimal-search ⊕ optimization ⊖ papers ⊕ papes ⊕ people ⊕ planning ⊕ polemics ⊕ policy ⊕ prediction ⊕ preprints ⊕ probability ⊕ re:active_feature_selection_project ⊕ re:distributed_decisions_project ⊕ reference ⊕ reinforcement-learning ⊕ research ⊕ robust_statistics ⊕ sensor-networks ⊕ signal-processing ⊕ socialism ⊕ software ⊕ sparsity ⊕ statistical-learning ⊕ statistical-physics ⊕ statistics ⊕ submodular-functions ⊕ sustainability ⊕ to-read ⊕ universal-prediction ⊕ via:mreid ⊕ via:shivak ⊕ via:stochastix ⊕ workshops ⊕Copy this bookmark: