mraginsky + decision-making   66

The Complexity of Exchange - Axtell - 2005 - The Economic Journal - Wiley Online Library
"The computational complexity of two classes of market mechanisms is compared. First the Walrasian interpretation in which prices are centrally computed by an auctioneer. Recent results on the computational complexity are reviewed. The non-polynomial complexity of these algorithms makes Walrasian general equilibrium an implausible conception. Second, a decentralised picture of market processes is described, involving concurrent exchange within transient coalitions of agents. These processes feature price dispersion, yield allocations that are not in the core, modify the distribution of wealth, are always stable, but path-dependent. Replacing the Walrasian framing of markets requires substantial revision of conventional wisdom concerning markets."
papers  to-read  economics  decision-making  via:cshalizi 
yesterday by mraginsky
Cognitive Democracy — Crooked Timber
"Over the last couple of years, Cosma Shalizi and I have been working together on various things, including, inter alia, the relationship between complex systems, democracy and the Internet. These are big unwieldy topics, and trying to think about them systematically is hard. Even so, we’ve gotten to the point where we at least feel ready to start throwing stuff at a wider audience, to get feedback on what works and what doesn’t. Here’s a paper we’re working on, which argues that we should (for some purposes at least), think of markets, hierarchy and democracy in terms of their capacity to solve complex collective problems, makes the case that democracy will on average do the job a lot better than the other two ways, and then looks at different forms of collective information processing on the Internet as experiments that democracies can learn from."
to-read  blogs  decision-making  institutions  markets  democracy  policy 
4 days ago by mraginsky
A Behavioral Defense of Rational Expectations
This paper studies decision making by agents who value optimism, but are unsure of their environment. As in Brunnermeier and Parker (2005), an agent’s optimism is assumed to be tempered by the decision costs it imposes. As in Hansen and Sargent (2008), an agent’s uncertainty about his environment leads him to formulate ‘robust’ decision rules. It is shown that when combined, these two considerations can lead agents to adhere to the Rational Expectations Hypothesis. Rather than being the outcome of the sophisticated statistical calculations of an impassive expected utility maximizer, Rational Expectations can instead be viewed as a useful approximation in environments where agents struggle to strike a balance between doubt and hope.
papers  to-read  rationality  economics  robust_control  decision-making  information-theory  decision-theory 
11 days ago by mraginsky
[1205.0858] Controlled Sensing for Multihypothesis Testing
"The problem of multiple hypothesis testing with observation control is considered in both fixed sample size and sequential settings. In the fixed sample size setting, for binary hypothesis testing, it is shown that the optimal exponent for the maximal error probability corresponds to the maximum Chernoff information over the choice of controls. It is also shown that a pure stationary open-loop control policy is asymptotically optimal within the larger class of all causal control policies. For multihypothesis testing in the fixed sample size setting, lower and upper bounds on the optimal error exponent are derived. It is also shown through an example with three hypotheses that the optimal causal control policy can be strictly better than the optimal open-loop control policy. In the sequential setting, a test based on earlier work by Chernoff for binary hypothesis testing, is shown to be first-order asymptotically optimal for multihypothesis testing for vanishing error probabilities. The role of past information and randomization in designing optimal control policies is discussed."
papers  to-read  information-theory  decision-making  feedback 
24 days ago by mraginsky
[1204.5721] Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. Although the study of bandit problems dates back to the Thirties, exploration-exploitation trade-offs arise in several modern applications, such as ad placement, website optimization, and packet routing. Mathematically, a multi-armed bandit is defined by the payoff process associated with each option. In this survey, we focus on two extreme cases in which the analysis of regret is particularly simple and elegant: i.i.d. payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, we also analyze some of the most important variants and extensions, such as the contextual bandit model.
papers  to-read  bandit-problems  online-learning  control-theory  dynamic-programming  decision-making 
4 weeks ago by mraginsky
[1204.2975] Multiple Objects: Error Exponents in Hypotheses Testing and Identification
"We servey [sic!] a series of investigations of optimal testing of multiple hypotheses conserning various multiobject models. These studies are a bright instance of application of methods and technics developed in Shannon information theory to solution of typical statistical problems."
papers  to-read  feedback-information-theory  statistics  active-learning  decision-making 
6 weeks ago by mraginsky
Entropy and the value of information for investors (Cabrales et al.)
Consider any investor who fears ruin facing any set of invest-
ments that satisfy no-arbitrage. Before investing, he can purchase informa-
tion about the state of nature in the form of an information structure. Given
his prior, information structure is more informative than information struc-
ture if whenever he rejects at some price, he also rejects at that price.
We show that this complete informativeness ordering is represented by the
decrease in entropy of his beliefs, regardless of his preferences, initial wealth
or investment problem. It is also shown that no prior-independent informa-
tiveness ordering based on similar premises exists.
papers  to-read  economics  decision-making  comparison-of-experiments  re:knightian-uncertainty 
8 weeks ago by mraginsky
Information Order in Decision and Agency Problems (Ian Jewitt)
This paper establishes information order in a number of special settings. We
discuss Lehmann’s condition, relate it to Blackwell’s and discuss its application in
statistical decision problems, certain Bayesian games and in principal agent problems. We distinguish between KR (Karlin Rubin) monotone preferences and single
crossing preferences and between Lehmann’s and Persico’s theorems. A generalisation of Lehmann’s condition is given that is applicable to situations where monotone
likelihood ratio does not apply. The principal agent problem is treated both via
the first-order approach and for the general finite action model. Points of contact
with Lehmann’s condition are identified in both cases.
papers  to-read  economics  decision-making  comparison-of-experiments  re:knightian-uncertainty 
8 weeks ago by mraginsky
"Signaling and uncertainty: a case study" (Castanon and Sandell)
"This paper studies the well-known counter example of Witsenhausen when the  initial  uncertainty is  small.  Using an asymptotic approach, it  is established that linear  strategies are  asymptotically optimal over a large class of nonlinear  strategies.  This  serves as a guideline for optimal solutions of non-classical problems with very noisy communication channels."
papers  to-read  decision-making  decentralized-control  communication 
december 2010 by mraginsky
[1011.1936] Blackwell Approachability and Low-Regret Learning are Equivalent
"We consider the celebrated Blackwell Approachability Theorem for two-player games with vector payoffs. We show that Blackwell's result is equivalent, via efficient reductions, to the existence of "no-regret" algorithms for Online Linear Optimization. Indeed, we show that any algorithm for one such problem can be efficiently converted into an algorithm for the other. We provide a useful application of this reduction: the first efficient algorithm for calibrated forecasting."
to-read  papers  online-learning  decision-making  game-theory  optimization 
november 2010 by mraginsky
Knightian Decision Theory I - Decisions in Economics and Finance, Volume 25, Number 2
"A theory of choice under uncertainty is proposed which removes the completeness assumption from the Anscombe–Aumann formulation of Savage's theory and introduces an inertia assumption. The inertia assumption is that there is such a thing as the status quo and an alternative is accepted only if it is preferred to the status quo. This theory is one way of giving rigorous expression to Frank Knight's distinction between risk and uncertainty."
to-read  papers  decision-making  economics 
september 2010 by mraginsky
[1009.0679] Optimal Uncertainty Quantification
Hmm: "We propose a rigorous framework for Uncertainty Quantification (UQ) in which the UQ objectives and the assumptions/information set are brought to the forefront. This framework, which we call \emph{Optimal Uncertainty Quantification} (OUQ), is based on the observation that, given a set of assumptions and information about the problem, there exist optimal bounds on uncertainties: these are obtained as extreme values of well-defined optimization problems corresponding to extremizing probabilities of failure, or of deviations, subject to the constraints imposed by the scenarios compatible with the assumptions and information. In particular, this framework does not implicitly impose inappropriate assumptions, nor does it repudiate relevant information. ..."
papers  to-read  information-theory  decision-making  optimization 
september 2010 by mraginsky
[1008.4654] Freezing and Sleeping: Tracking Experts that Learn by Evolving Past Posteriors
Hot damn! "... in this paper we re-examine Freund's problem in case the experts have internal structure that enables them to learn. In this case the problem has two possible interpretations: should the experts learn from all data or only from the subsequence on which they are being tracked? The MPP algorithm solves the first case. Our contribution is to generalise MPP to address the second option. The results we obtain apply to any expert structure that can be formalised using (expert) hidden Markov models. Curiously enough, for our interpretation there are \emph{two} natural reference schemes: freezing and sleeping. For each scheme, we provide an efficient prediction strategy and prove the relevant loss bound."
papers  to-read  online-learning  decision-making  machine-learning  algorithms 
august 2010 by mraginsky
Probability Collectives (David H. Wolpert)
"We recently proved that game theory and statistical physics are identical when cast in terms of information theory.

We call the associated formalism Probability Collectives (PC). PC opens many new lines of research, and provides new approaches to problems in distributed control and distributed optimization."
research  papers  reference  control-theory  distributed-systems  decision-making  game-theory  statistical-physics  optimization 
july 2010 by mraginsky
David Blackwell - Statistical Modeling, Causal Inference, and Social Science
A quote from Blackwell himself: "Basically, I'm not interested in doing research and I never have been, I'm interested in understanding, which is quite a different thing. And often to understand something you have to work it out yourself because no one else has done it."
people  statistics  research  probability  decision-making 
july 2010 by mraginsky
Generalizing Apprenticeship Learning across Hypothesis Classes .oO(ML Discuss)
"This paper develops a generalized apprenticeship learning protocol for reinforcement-learning agents with access to a teacher who provides policy traces (transition and reward observations). We characterize sufficient conditions of the underlying models for efficient apprenticeship learning and link this criteria to two established learnability classes (KWIK and Mistake Bound). We then construct efficient apprenticeship-learning algorithms in a number of domains, including two types of relational MDPs. We instantiate our approach in a software agent and a robot agent that learn effectively from a human teacher."
papers  to-read  machine-learning  reinforcement-learning  control-theory  decision-making 
july 2010 by mraginsky
[1006.4039] Cooperative Autonomous Online Learning
Abstract: "Online learning is becoming increasingly popular for training on large datasets. However, the sequential nature of online learning requires a centralized learner to store data and update parameters. In this paper, we consider a fully decentralized setting, cooperative autonomous online learning, with a distributed data source. The learners perform learning with local parameters while periodically communicating with a small subset of neighbors to exchange information. We define the regret in terms of an implicit aggregated parameter of the learners for such a setting and prove regret bounds similar to the classical sequential online learning." Damn, looks like I've been scooped!
papers  have-read  decision-making  online-learning  distributed-systems 
june 2010 by mraginsky
Bayesian Learning in Social Networks
Daron Acemoglu, Munther A. Dahleh, Ilan Lobel, Asuman Ozdaglar; NBER Working Paper No. 14040
papers  have-read  social-networks  bayesian-learning  game-theory  decision-making  distributed-systems 
april 2010 by mraginsky
Reasoning in Reduced Information Spaces
"...a website for research and coordination on Decentralized Techniques for Reasoning in Reduced Information Spaces (ONR 2009 Multi-disciplinary University Research Project)"
decision-making  decentralized-control  machine-learning  distributed-systems  AI  algorithms  optimization 
april 2010 by mraginsky
[0910.2065v1] Decentralized Multi-Armed Bandit with Multiple Distributed Players (Keqin Liu, Qing Zhao)
From the abstract: "We consider multi-armed bandit with distributed players, where each player independently samples one of N stochastic processes with unknown parameters and accrues reward in each slot without information exchange. Users choosing the same arm collide, and none or only one receives reward depending on the collision model. This problem can be formulated as a decentralized multi-armed bandit problem."
papers  to-read  distributed-systems  algorithms  optimization  decision-making  decentralized-control  control-theory 
november 2009 by mraginsky
Zero Intelligence Agents (Drew Conway's blog)
How can the social sciences, mathematics and computer science combine to affect national security policy?
blogs  decision-making  economics  forensics  politics  policy  security 
june 2008 by mraginsky

related tags

active-learning  adaptive-systems  AI  algorithms  bandit-problems  bayesian  bayesian-learning  biology  blogs  books  cells  collective-behavior  communication  communications  comparison-of-experiments  complex-systems  complexity  consensus-algorithms  control-theory  convex-programming  data-fusion  decentralized-control  decision-making  decision-theory  democracy  distributed-computing  distributed-systems  dynamic-programming  dynamical-systems  economics  evolution  experimental-design  feedback  feedback-information-theory  filetype:pdf  filtering  forensics  game-theory  graph-theory  have-read  homepages  information-theory  institutions  learning-theory  machine-learning  markets  media:document  multiagent-systems  neuroscience  online-learning  optimization  papers  people  planning  policy  politics  prediction  probability  psychology  quantization  rationality  re:active_feature_selection_project  re:causality_project  re:distributed_decisions_project  re:knightian-uncertainty  reference  reinforcement-learning  research  RIP  robust_control  security  sensor-networks  signal-processing  social-networks  socialism  statistical-learning  statistical-physics  statistics  submodular-functions  systems-biology  to-read  via:cshalizi  via:mreid 

Copy this bookmark:



description:


tags: