dvse + via:cshalizi 4
[1205.2265] Efficient Constrained Regret Minimization
17 days ago by dvse
"Online learning constitutes a mathematical framework to analyze sequential decision making problems in adversarial environments. The learner repeatedly chooses an action, the environment responds with an outcome, and then the learner receives a reward for the played action. The goal of the learner is to maximize his total reward. However, there are situations in which, in addition to maximizing the cumulative reward, there are some additional constraints/goals on the sequence of decisions that must be satisfied by the learner. For example, in textit{online marketing}, simultaneously maximizing the cumulative reward and the number of buyers to take advantage of word-of-mouth advertising for future marketing seems to be a more ambitious goal than only maximizing cumulative reward. As another example, learning from costly expert advice captures more realistic settings than the original setting in applications such as routing in networks with power constraint. In this paper we study an extension to the online learning where the learner aims to maximize the total reward given that some additional constraints need to be satisfied. We propose Lagrangian exponentially weighted average (textbf{LEWA}) algorithm, an efficient algorithm to solve constrained online learning, which is a primal dual variant of the well known exponentially weighted average algorithm and inspired by the theory of Lagrangian method in constrained optimization. We establish the regret and the violation of the constraint bounds in full information and bandit feedback models."
online_learning
convex_optimization
via:cshalizi
17 days ago by dvse
Discovery of the Kalman Filter as a Practical Tool for Aerospace and Industry
february 2012 by dvse
History of the adoption of the Kalman filter in aero/astro
control_theory
state_estimation
kalman_filter
via:cshalizi
february 2012 by dvse
A Method of Handling Curvilinear Correlation for Any Number of Variables (Ezekiel, 1924)
february 2012 by dvse
Additive regression models from 1924, together with an algorithm which looks even more labour intensive than Whittaker graduation!
regression
additive_models
statistics
via:cshalizi
february 2012 by dvse
On a New Method of Graduation
february 2012 by dvse
Whittaker introduces 1D smoothing in 1922, complete with the Bayesian derivation. There is an earlier German paper with a similar model.
actuarial
splines
smoothing
regression
statistics
via:cshalizi
february 2012 by dvse
Copy this bookmark: