SF, big ideas, ideology: what is to be done? - Charlie's Diary
5 days ago
Too many good bits to quote just one.
science_fiction
literary_criticism
the_singularity_has_happened
stross.charlie
5 days ago
Gregory Jaczko to Resign as N.R.C. Chairman After Stormy Tenure - NYTimes.com
6 days ago
Not every day you read a story like this about a grad school classmate.
via:haslinger
6 days ago
The Literary Equivalent of Code That Performs Nothing | Beyond The Beyond | Wired.com
7 days ago
"I suspect that the literary equivalent of “code that performs nothing” is authorial “voice.” Literary null-objects are certain kinds of refined word-choices and grammatical stylings that convey the “message” of “I’m the author and you’re the reader.” One can skate along on this blather for quite a while without venturing to state much of anything; but “voice” keeps the channels open, and prevents the crash-state of the reader standing up and throwing the book against the wall."
literary_criticism
sterling.bruce
7 days ago
[1205.3845] Forecasting with Historical Data or Process Knowledge under Misspecification: A Comparison
8 days ago
"When faced with the task of forecasting a dynamic system, practitioners often have available historical data, knowledge of the system, or a combination of both. While intuition dictates that perfect knowledge of the system should in theory yield perfect forecasting, often knowledge of the system is only partially known, known up to parameters, or known incorrectly. In contrast, forecasting using previous data without any process knowledge might result in accurate prediction for simple systems, but will fail for highly nonlinear and chaotic systems. In this paper, the authors demonstrate how even in chaotic systems, forecasting with historical data is preferable to using process knowledge if this knowledge exhibits certain forms of misspecification. Through an extensive simulation study, a range of misspecification and forecasting scenarios are examined with the goal of gaining an improved understanding of the circumstances under which forecasting from historical data is to be preferred over using process knowledge."
to:NB
to_read
prediction
time_series
misspecification
re:growing_ensemble_project
8 days ago
Phys. Rev. Lett. 108, 200601 (2012): Number of Relevant Directions in Principal Component Analysis and Wishart Random Matrices
8 days ago
"We compute analytically, for large N, the probability P(N+,N) that a N×N Wishart random matrix has N+ eigenvalues exceeding a threshold Nζ, including its large deviation tails. This probability plays a benchmark role when performing the principal component analysis of a large empirical data set. We find that P(N+,N)≈exp[-βN2ψζ(N+/N)], where β is the Dyson index of the ensemble and ψζ(κ) is a rate function that we compute explicitly in the full range 0≤κ≤1 and for any ζ. The rate function ψζ(κ) displays a quadratic behavior modulated by a logarithmic singularity close to its minimum κ⋆(ζ). This is shown to be a consequence of a phase transition in an associated Coulomb gas problem. The variance Δ(N) of the number of relevant components is also shown to grow universally (independent of ζ) as Δ(N)∼(βπ2)-1lnN for large N."
to:NB
to_read
principal_components
large_deviations
random_matrices
stochastic_processes
high-dimensional_probability
re:g_paper
phase_transitions
8 days ago
Metro te Saluere lubert | Flickr - Photo Sharing!
9 days ago
"Wallsend, in North Tyneside, was the end of Hadrian's Wall, constructed by the Romans to defend the border of the Empire against the rebelling tribes beyond. To commemorate this historical connection (which is also reflected in the town's name), since 2003 most of the signs in Wallsend station on the Tyne and Wear Metro have been in both English and Latin. Large photographs taken in Wallsend which have all text changed to Latin are also displayed."
funny:geeky
photos
roman_empire
via:making_light
9 days ago
Lam , Yao : Factor modeling for high-dimensional time series: Inference for the number of factors
10 days ago
"This paper deals with the factor modeling for high-dimensional time series based on a dimension-reduction viewpoint. Under stationary settings, the inference is simple in the sense that both the number of factors and the factor loadings are estimated in terms of an eigenanalysis for a nonnegative definite matrix, and is therefore applicable when the dimension of time series is on the order of a few thousands. Asymptotic properties of the proposed method are investigated under two settings: (i) the sample size goes to infinity while the dimension of time series is fixed; and (ii) both the sample size and the dimension of time series go to infinity together. In particular, our estimators for zero-eigenvalues enjoy faster convergence (or slower divergence) rates, hence making the estimation for the number of factors easier. In particular, when the sample size and the dimension of time series go to infinity together, the estimators for the eigenvalues are no longer consistent. However, our estimator for the number of the factors, which is based on the ratios of the estimated eigenvalues, still works fine. Furthermore, this estimation shows the so-called “blessing of dimensionality” property in the sense that the performance of the estimation may improve when the dimension of time series increases. A two-step procedure is investigated when the factors are of different degrees of strength. Numerical illustration with both simulated and real data is also reported."
to:NB
dimension_reduction
factor_analysis
time_series
high-dimensional_statistics
inference_to_latent_objects
10 days ago
Wang , Phillips : A specification test for nonlinear nonstationary models
10 days ago
"We provide a limit theory for a general class of kernel smoothed U-statistics that may be used for specification testing in time series regression with nonstationary data. The test framework allows for linear and nonlinear models with endogenous regressors that have autoregressive unit roots or near unit roots. The limit theory for the specification test depends on the self-intersection local time of a Gaussian process. A new weak convergence result is developed for certain partial sums of functions involving nonstationary time series that converges to the intersection local time process. This result is of independent interest and is useful in other applications. Simulations examine the finite sample performance of the test."
to:NB
time_series
non-stationarity
model-checking
statistics
misspecification
10 days ago
Rigollet : Kullback–Leibler aggregation and misspecified generalized linear models
10 days ago
"In a regression setup with deterministic design, we study the pure aggregation problem and introduce a natural extension from the Gaussian distribution to distributions in the exponential family. While this extension bears strong connections with generalized linear models, it does not require identifiability of the parameter or even that the model on the systematic component is true. It is shown that this problem can be solved by constrained and/or penalized likelihood maximization and we derive sharp oracle inequalities that hold both in expectation and with high probability. Finally all the bounds are proved to be optimal in a minimax sense."
to:NB
regression
ensemble_methods
statistics
10 days ago
[1205.3703] Generic chaining and the l1-penalty
11 days ago
"We address the choice of the tuning parameter $lambda$ in $ell_1$-penalized M-estimation. Our main concern is models which are highly nonlinear, such as the Gaussian mixture model. The number of parameters $p$ is moreover large, possibly larger than the number of observations $n$. The generic chaining technique of Talagrand[2005] is tailored for this problem. It leads to the choice $lambda asymp sqrt {log p / n}$, as in the standard Lasso procedure (which concerns the linear model and least squares loss)."
to:NB
to_read
statistics
empirical_processes
high-dimensional_statistics
van_de_geer.sara
11 days ago
Phys. Rev. Lett. 108, 200403 (2012): Time Asymmetry of Probabilities Versus Relativistic Causal Structure: An Arrow of Time
12 days ago
"There is an incompatibility between the symmetries of causal structure in relativity theory and the signaling abilities of probabilistic devices with inputs and outputs: while time reversal in relativity will not introduce the ability to signal between spacelike separated regions, this is not the case for probabilistic devices with spacelike separated input-output pairs. We explicitly describe a nonsignaling device which becomes a perfect signaling device under time reversal, where time reversal can be conceptualized as playing backwards a videotape of an agent manipulating the device. This leads to an arrow of time that is identifiable when studying the correlations of events for spacelike separated regions. Somewhat surprisingly, although the time reversal of Popescu-Rohrlich boxes also allows agents to signal, it does not yield a perfect signaling device. Finally, we realize time reversal using postselection, which could to lead experimental implementation."
to:NB
causality
physics
relativity
arrow_of_time
to_read
12 days ago
[1205.3208] A New Family of Generalized 3D Cat Maps
12 days ago
"Since the 1990s chaotic cat maps are widely used in data encryption, for their very complicated dynamics within a simple model and desired characteristics related to requirements of cryptography. The number of cat map parameters and the map period length after discretization are two major concerns in many applications for security reasons. In this paper, we propose a new family of 36 distinctive 3D cat maps with different spatial configurations taking existing 3D cat maps [1]-[4] as special cases. Our analysis and comparisons show that this new 3D cat maps family has more independent map parameters and much longer averaged period lengths than existing 3D cat maps. The presented cat map family can be extended to higher dimensional cases."
(to_teach tags for clsses which use the cat map as an example)
to:NB
cat_map
dynamical_systems
cryptography
to_teach:complexity-and-inference
to_teach:statcomp
to_teach:undergrad-ADA
(to_teach tags for clsses which use the cat map as an example)
12 days ago
Earth and other unlikely worlds: News From Elsewhere
12 days ago
"But maybe somewhere on Titan there are dry salt pans, old, large, and very flat, like the Bonneville salt pans in Nevada, but composed of something like asphalt. Imagine the drag-racing possibilities..."
space_exploration
titan
12 days ago
Book Review: Direct Democracy Worldwide
12 days ago
"In his book Direct Democracy Worldwide, David Altman moves beyond the classic narratives of Greek city-states and New England town halls to demonstrate that this form of government is pertinent today despite its still relatively modest use at the national level. However, although some forms of direct democracy, particularly citizen initiatives, may enhance a larger representational context, others offer little opportunity for authentic popular voice. Direct democracy here is a tool, rather than a system, a tool that has the potential to be harnessed to refine the limitations of representation. Thus, Altman provides a rich evaluation of the possibilities for such input—a much needed addition to this literature—while initiating a longer term agenda for scholars of democracy.
"More historic understandings of direct democracy have offered a simplistic understanding of its use: Citizens gather in a common place, or through a ballot, and themselves determine the policy that will govern their polity. Yet Altman provokes the reader to consider a much more complex constellation of possibilities in his first chapter. Rather than consider direct democracy as a Weberian ideal type of political order, he effectively offers a vision of this process as a function within a larger representational system."
(etc., etc.)
book_reviews
track_down_references
democracy
political_science
re:democratic_cognition
"More historic understandings of direct democracy have offered a simplistic understanding of its use: Citizens gather in a common place, or through a ballot, and themselves determine the policy that will govern their polity. Yet Altman provokes the reader to consider a much more complex constellation of possibilities in his first chapter. Rather than consider direct democracy as a Weberian ideal type of political order, he effectively offers a vision of this process as a function within a larger representational system."
(etc., etc.)
12 days ago
Quantitative patterns of stylistic influence in the evolution of literature
13 days ago
"Literature is a form of expression whose temporal structure, both in content and style, provides a historical record of the evolution of culture. In this work we take on a quantitative analysis of literary style and conduct the first large-scale temporal stylometric study of literature by using the vast holdings in the Project Gutenberg Digital Library corpus. We find temporal stylistic localization among authors through the analysis of the similarity structure in feature vectors derived from content-free word usage, nonhomogeneous decay rates of stylistic influence, and an accelerating rate of decay of influence among modern authors. Within a given time period we also find evidence for stylistic coherence with a given literary topic, such that writers in different fields adopt different literary styles. This study gives quantitative support to the notion of a literary “style of a time” with a strong trend toward increasingly contemporaneous stylistic influence."
It'll be interesting to see how they handle the bias induced by selective retention.
to:NB
to_read
literary_history
text_mining
kith_and_kin
rockmore.dan
krakuer.david
It'll be interesting to see how they handle the bias induced by selective retention.
13 days ago
Archaeology as a social science
13 days ago
"Because of advances in methods and theory, archaeology now addresses issues central to debates in the social sciences in a far more sophisticated manner than ever before. Coupled with methodological innovations, multiscalar archaeological studies around the world have produced a wealth of new data that provide a unique perspective on long-term changes in human societies, as they document variation in human behavior and institutions before the modern era. We illustrate these points with three examples: changes in human settlements, the roles of markets and states in deep history, and changes in standards of living. Alternative pathways toward complexity suggest how common processes may operate under contrasting ecologies, populations, and economic integration."
to:NB
archaeology
social_science_methodology
13 days ago
Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation - Fearnhead - 2012 - Journal of the Royal Statistical Society: Series B (Statistical Methodology) - Wiley Online Library
13 days ago
"Many modern statistical applications involve inference for complex stochastic models, where it is easy to simulate from the models, but impossible to calculate likelihoods. Approximate Bayesian computation (ABC) is a method of inference for such models. It replaces calculation of the likelihood by a step which involves simulating artificial data for different parameter values, and comparing summary statistics of the simulated data with summary statistics of the observed data. Here we show how to construct appropriate summary statistics for ABC in a semi-automatic manner. We aim for summary statistics which will enable inference about certain parameters of interest to be as accurate as possible. Theoretical results show that optimal summary statistics are the posterior means of the parameters. Although these cannot be calculated analytically, we use an extra stage of simulation to estimate how the posterior means vary as a function of the data; and we then use these estimates of our summary statistics within ABC. Empirical results show that our approach is a robust method for choosing summary statistics that can result in substantially more accurate ABC analyses than the ad hoc choices of summary statistics that have been proposed in the literature. We also demonstrate advantages over two alternative methods of simulation-based inference."
to:NB
indirect_inference
estimation
statistics
approximate_bayesian_computation
computational_statistics
to_teach:complexity-and-inference
re:stacs
13 days ago
Estimating the Causal Effects of Social Interaction with Endogenous Networks
13 days ago
"Identifying causal effects attributable to network membership is a key challenge in empirical studies of social networks. In this article, we examine the consequences of endogeneity for inferences about the effects of networks on network members’ behavior. Using the House office lottery (in which newly elected members select their office spaces in a randomly chosen order) as an instrumental variable to estimate the causal impact of legislative networks on roll call behavior and cosponsorship decisions in the 105th–112th Houses, we find no evidence that office proximity affects patterns of legislative behavior. These results contrast with decades of congressional scholarship and recent empirical studies. Our analysis demonstrates the importance of accounting for selection processes and omitted variables in estimating the causal impact of networks."
to:NB
causal_inference
re:critique_of_diffusion
social_influence
congress
network_data_analysis
social_networks
homophily
re:homophily_and_confounding
13 days ago
[1205.2609] Which Spatial Partition Trees are Adaptive to Intrinsic Dimension?
13 days ago
"Recent theory work has found that a special type of spatial partition tree - called a random projection tree - is adaptive to the intrinsic dimension of the data from which it is built. Here we examine this same question, with a combination of theory and experiments, for a broader class of trees that includes k-d trees, dyadic trees, and PCA trees. Our motivation is to get a feel for (i) the kind of intrinsic low dimensional structure that can be empirically verified, (ii) the extent to which a spatial partition can exploit such structure, and (iii) the implications for standard statistical tasks such as regression, vector quantization, and nearest neighbor search."
to:NB
decision_trees
prediction
regression
statistics
dimension_reduction
machine_learning
13 days ago
[1205.2736] How Visibility and Divided Attention Constrain Social Contagion
13 days ago
"How far and how fast does information spread in social media? Researchers have recently examined a number of factors that affect information diffusion in online social networks, including: the novelty of information, users' activity levels, who they pay attention to, and how they respond to friends' recommendations. Using URLs as markers of information, we carry out a detailed study of retweeting, the primary mechanism by which information spreads on the Twitter follower graph. Our empirical study examines how users respond to an incoming stimulus, i.e., a tweet (message) from a friend, and reveals that %retweeting behavior is constrained by a few simple principles. the "principle of least effort" combined with limited attention plays a dominant role in retweeting behavior. Specifically, we observe that users retweet information when it is most visible, such as when it near the top of their Twitter stream. Moreover, our measurements quantify how a user's limited attention is divided among incoming tweets, providing novel evidence that highly connected individuals are less likely to propagate an arbitrary tweet. Our study indicates that the finite ability to process incoming information constrains social contagion, and we conclude that rapid decay of visibility is the primary barrier to information propagation online."
to:NB
social_contagion
networked_life
epidemiology_of_representations
13 days ago
Likelihood inference for discriminating between long-memory and change-point models - Yau - 2012 - Journal of Time Series Analysis - Wiley Online Library
13 days ago
"We develop a likelihood ratio (LR) test procedure for discriminating between a short-memory time series with a change-point (CP) and a long-memory (LM) time series. Under the null hypothesis, the time series consists of two segments of short-memory time series with different means and possibly different covariance functions. The location of the shift in the mean is unknown. Under the alternative, the time series has no shift in mean but rather is LM. The LR statistic is defined as the normalized log-ratio of the Whittle likelihood between the CP model and the LM model, which is asymptotically normally distributed under the null. The LR test provides a parametric alternative to the CUSUM test proposed by Berkes et al. (2006). Moreover, the LR test is more general than the CUSUM test in the sense that it is applicable to changes in other marginal or dependence features other than a change-in-mean. We show its good performance in simulations and apply it to two data examples."
to:NB
time_series
change-point_problem
long-range_dependence
statistics
to_teach:undergrad-ADA
hypothesis_testing
13 days ago
Unfogged: Look for it in the latest DSM
17 days ago
"There's a certain kind of insanity, not a feeling of not being oneself any longer, but a loss of a grip on what it would be to feel, or not feel, oneself at all, which is apt to be brought on at family reunions, especially those at which one encounters distant relatives for the first time. At such gatherings one gets a keen sense of the family "look", and the general family mode of behavior. One sees not only one's features return to one in the features of countless others; one sees, too, one's squabbles with one's siblings (say) reflected in the bickering of great-aunt so-and-so and her husband. There seems to vanish any sense of oneself as an individual, oneself as something substantial, rather than a congeries of free-floating traits that happen to be combined, here, in these proportions. But even this is too much; it's not as if one can say that one is so much of Cousin X in this respect, so much of Great-Aunt Y in this other respect, and so on, for X and Y stand revealed as having no more solidity in themselves than one oneself has (as we know, a family reunion is a system with no positive elements). Nor can one console oneself with the thought that at least one is the only one to have these traits in this precise proportion, because, even if that is true (and how can one be sure?) it is only accidental.
"At its most extreme, someone who suffers from this delusion will not only see himself as just a congeries of shared traits; he will also begin to see in every person one encounters nothing more than a collection of traits previously encountered in oneself or in others ... those others and that self in turn lacking any real substance. Nothing new under the sun, here or there, no potential for surprise; you've seen everything already.
"The name of this delusion is, of course, metempsychosis."
funny:geeky
moral_philosophy
personal_identity
wolfson.ben
"At its most extreme, someone who suffers from this delusion will not only see himself as just a congeries of shared traits; he will also begin to see in every person one encounters nothing more than a collection of traits previously encountered in oneself or in others ... those others and that self in turn lacking any real substance. Nothing new under the sun, here or there, no potential for surprise; you've seen everything already.
"The name of this delusion is, of course, metempsychosis."
17 days ago
Measures of mutual and causal dependence between two time series
17 days ago
"New measures are proposed for mutual and causal dependence between two time series, based on information theoretical ideas. The measure of mutual dependence is shown to be the sum of the measure of unidirectional causal dependence from the first time series to the second, the measure of unidirectional causal dependence from the second to the first, and the measure of instantaneous causal dependence. The measures are applicable to any kind of time series: continuous, discrete, or categorical."
to:NB
causality
information_theory
stochastic_processes
rissanen.jorma
via:coleman
17 days ago
[1205.2265] Efficient Constrained Regret Minimization
17 days ago
"Online learning constitutes a mathematical framework to analyze sequential decision making problems in adversarial environments. The learner repeatedly chooses an action, the environment responds with an outcome, and then the learner receives a reward for the played action. The goal of the learner is to maximize his total reward. However, there are situations in which, in addition to maximizing the cumulative reward, there are some additional constraints/goals on the sequence of decisions that must be satisfied by the learner. For example, in textit{online marketing}, simultaneously maximizing the cumulative reward and the number of buyers to take advantage of word-of-mouth advertising for future marketing seems to be a more ambitious goal than only maximizing cumulative reward. As another example, learning from costly expert advice captures more realistic settings than the original setting in applications such as routing in networks with power constraint. In this paper we study an extension to the online learning where the learner aims to maximize the total reward given that some additional constraints need to be satisfied. We propose Lagrangian exponentially weighted average (textbf{LEWA}) algorithm, an efficient algorithm to solve constrained online learning, which is a primal dual variant of the well known exponentially weighted average algorithm and inspired by the theory of Lagrangian method in constrained optimization. We establish the regret and the violation of the constraint bounds in full information and bandit feedback models."
to:NB
low-regret_learning
optimization
machine_learning
17 days ago
Neural Circuit Reconfiguration by Social Status
17 days ago
"The social rank of an animal is distinguished by its behavior relative to others in its community. Although social-status-dependent differences in behavior must arise because of differences in neural function, status-dependent differences in the underlying neural circuitry have only begun to be described. We report that dominant and subordinate crayfish differ in their behavioral orienting response to an unexpected unilateral touch, and that these differences correlate with functional differences in local neural circuits that mediate the responses. The behavioral differences correlate with simultaneously recorded differences in leg depressor muscle EMGs and with differences in the responses of depressor motor neurons recorded in reduced, in vitro preparations from the same animals. The responses of local serotonergic interneurons to unilateral stimuli displayed the same status-dependent differences as the depressor motor neurons. These results indicate that the circuits and their intrinsic serotonergic modulatory components are configured differently according to social status, and that these differences do not depend on a continuous descending signal from higher centers."
to:NB
neuroscience
experimental_biology
experimental_sociology
crustaceans
social_neuroscience
17 days ago
[1204.3863] The mechanics of stochastic slowdown in evolutionary games
17 days ago
"We study the stochastic dynamics of evolutionary games, and focus on the so-called `stochastic slowdown' effect, previously observed in (Altrock et. al, 2010) for simple evolutionary dynamics. Slowdown here refers to the fact that a beneficial mutation may take longer to fixate than a neutral one. More precisely, the fixation time conditioned on the mutant taking over can show a maximum at intermediate selection strength. We show that this phenomenon is present in the prisoner's dilemma, and also discuss counterintuitive slowdown and speedup in coexistence games. In order to establish the microscopic origins of these phenomena, we calculate the average sojourn times. This allows us to identify the transient states which contribute most to the slowdown effect, and enables us to provide an understanding of slowdown in the takeover of a small group of cooperators by defectors: Defection spreads quickly initially, but the final steps to takeover can be delayed significantly. The analysis of coexistence games reveals even more intricate behavior. In small populations, the conditional average fixation time can show multiple extrema as a function of the selection strength, e.g., slowdown, speedup, and slowdown again. We classify two-player games with respect to the possibility to observe non-monotonic behavior of the conditional average fixation time as a function of selection strength."
to:NB
evolutionary_game_theory
re:do-institutions-evolve
17 days ago
[1205.0241] Counterfactual Graphical Models for Mediation Analysis via Path-Specific Effects
17 days ago
"Potential outcome counterfactuals represent variation in the outcome of interest after a hypothetical treatment or intervention is performed. Causal graphical models are a concise, intuitive way of representing causal assumptions, including independence constraints among such counterfactuals. Much of modern causal inference is concerned with expressing cause effect relationships of interest in counterfactual form, showing how the resulting counterfactuals can be identified (that is expressed in terms of available data, using domain-specific causal assumptions), and subsequently estimated using statistical methods. In this paper we will use causal graphical models to analyze the identification problem of the so-called emph{path-specific effects}, that is effects of treatment on outcome along certain specified causal paths. Such effects arise in mediation analysis settings where it's important to distinguish direct and indirect effects of treatment. We review existing results on path-specific effects in the fully observable, static treatment setting, and extend them to settings with time-varying treatments, and latent variables."
to:NB
causal_inference
shpister.ilya
graphical_models
17 days ago
[1204.5421] Epidemics on a stochastic model of temporal network
17 days ago
"Contacts between individuals serve as pathways where infections may propagate. These contact patterns can be represented by network structures. Static structures have been the common modeling paradigm but recent results suggest that temporal structures play different roles to regulate the spread of infections or infection-like dynamics. On temporal networks a vertex is active only at certain moments and inactive otherwise such that a contact is not continuously available. In several empirical networks, the time between two consecutive vertex-activation events typically follows heterogeneous activity (e.g. bursts). In this chapter, we present a simple and intuitive stochastic model of a temporal network and investigate how epidemics co-evolves with the temporal structures, focusing on the growth dynamics of the epidemics. The model assumes no underlying topological structure and is only constrained by the time between two consecutive events of vertex activation. The main observation is that the speed of the infection spread is different in case of heterogeneous and homogeneous temporal patterns but the differences depend on the stage of the epidemics. In comparison to the homogeneous scenario, the power law case results in a faster growth in the beginning but turns out to be slower after a certain time, taking several time steps to reach the whole network."
to:NB
networks
epidemic_models
re:social-networks-as-sensor-networks
17 days ago
[1205.1828] The Natural Gradient by Analogy to Signal Whitening, and Recipes and Tricks for its Use
18 days ago
"The natural gradient allows for more efficient gradient descent by removing dependencies and biases inherent in a function's parameterization. Several papers present the topic thoroughly and precisely. It remains a very difficult idea to get your head around however. The intent of this note is to provide simple intuition for the natural gradient and its use. We review how an ill conditioned parameter space can undermine learning, introduce the natural gradient by analogy to the more widely understood concept of signal whitening, and present tricks and specific prescriptions for applying the natural gradient to learning problems."
Does this ever mention the phrase "Fisher information"?
to:NB
optimization
statistics
estimation
fisher_information
information_geometry
Does this ever mention the phrase "Fisher information"?
18 days ago
Cryfield Grange
18 days ago
Home sweet home for the next two weeks. (And yes, I'm bookmarking this so I'll have access to the address.)
warwick
travel
18 days ago
[1203.3504] On Measurement Bias in Causal Inference
18 days ago
"This paper addresses the problem of measurement errors in causal inference and highlights several algebraic and graphical methods for eliminating systematic bias induced by such errors. In particulars, the paper discusses the control of partially observable confounders in parametric and non parametric models and the computational problem of obtaining bias-free effect estimates in such models."
to:NB
causal_inference
inference_to_latent_objects
pearl.judea
to_teach:undergrad-ADA
statistics
error_in_variables
via:arthegall
18 days ago
Long story; short pier: Clews
18 days ago
"And the time ... in physics class, when we were doing these basic (very basic) labs on probability, and I had a little handheld pachinko machine? With a bunch of balls, and evenly spaced rods, and stalls at the bottom? And you tilt it down, and all the balls roll to the top, and you tilt it back, and they come cascading down, and hit the rods, and either bounce left or right, and in the end you’ve got this lovely little bell curve of balls at the bottom, because law of averages and such most balls bounce left, then right, then left, or some combination thereof, and end up in the middle? And only a few go left-left-left-left, or right-right-right-right, and end up on either end? —Anyway, it’s my turn, so I tilt it down, then back again, and click-clack-click-clack-click, and wouldn’t you know it, I’ve got an almost perfect reverse bell curve. Towering stacks of balls to the left and right, and almost nothing at all in the middle.
"So I go to the teacher running the show and hold it out to him and say, okay, now what, smart guy? (“If it fails to agree, under novel experiments or with refined measuring techniques, it is not said that one should not be happy.”)
"And the teacher looks at the little handheld pachinko machine, cocks an eyebrow, tilts it down, tilts it back, clack-click-clack-click-clack. Perfect bell curve.
"“There,” he says. “Fixed it for you.”
"—And I can’t for the life of me tell you which of those gestures is the argument with the universe, and which the sermon on the way things ought to be, dammit. —And that might just be my problem."
funny:geeky
probability
central_limit_theorem
to_teach
at_that_moment_the_student_was_enlightened
"So I go to the teacher running the show and hold it out to him and say, okay, now what, smart guy? (“If it fails to agree, under novel experiments or with refined measuring techniques, it is not said that one should not be happy.”)
"And the teacher looks at the little handheld pachinko machine, cocks an eyebrow, tilts it down, tilts it back, clack-click-clack-click-clack. Perfect bell curve.
"“There,” he says. “Fixed it for you.”
"—And I can’t for the life of me tell you which of those gestures is the argument with the universe, and which the sermon on the way things ought to be, dammit. —And that might just be my problem."
18 days ago
Schlozman, K. and Verba, S., Brady, H.: The Unheavenly Chorus: Unequal Political Voice and the Broken Promise of American Democracy.
18 days ago
"The Unheavenly Chorus is the first book to look at the political participation of individual citizens alongside the political advocacy of thousands of organized interests--membership associations such as unions, professional associations, trade associations, and citizens groups, as well as organizations like corporations, hospitals, and universities. Drawing on numerous in-depth surveys of members of the public as well as the largest database of interest organizations ever created--representing more than thirty-five thousand organizations over a twenty-five-year period--this book conclusively demonstrates that American democracy is marred by deeply ingrained and persistent class-based political inequality. The well educated and affluent are active in many ways to make their voices heard, while the less advantaged are not. This book reveals how the political voices of organized interests are even less representative than those of individuals, how political advantage is handed down across generations, how recruitment to political activity perpetuates and exaggerates existing biases, how political voice on the Internet replicates these inequalities--and more."
to:NB
inequality
democracy
us_politics
political_science
re:democratic_cognition
books:noted
18 days ago
Exploring Space: Don't Sell Robots Short : Uncertain Principles
19 days ago
"So, while the pro-human arguments based on science sound convincing at first, they're ultimately not that great, and fall back on the same sort of vague and airy platitudes as the general "inspiration" argument. And given the gigantic cost multiplier involved in sending humans rather than robots (or in addition to robots), I'd really like to see more than that."
space_exploration
robots_and_robotics
19 days ago
Lai , Huang , Lee : Fixed and random effects selection in nonparametric additive mixed models
19 days ago
"This paper considers the problem of model selection in a nonparametric additive mixed modeling framework. The fixed effects are modeled nonparametrically using truncated series expansions with B-spline basis. Estimation and selection of such nonparametric fixed effects are simultaneously achieved by using the adaptive group lasso methodology, while the random effects are selected by a traditional backward selection mechanism. To facilitate the automatic selection of model dimension, computable expressions for the degrees of freedom for both the fixed and random effects components are derived, and the Bayesian Information criterion (BIC) is used to select the final model choice. Theoretically it is shown that this BIC model selection method is consistent, while computationally a practical algorithm is developed for solving the optimization problem involved. Simulation results show that the proposed methodology is often capable of selecting the correct significant fixed and random effects components, especially when the sample size and/or signal to noise ratio are not too small. The new method is also applied to two real data sets."
to:NB
regression
additive_models
statistics
19 days ago
[1205.1406] Graph Prediction in a Low-Rank and Autoregressive Setting
19 days ago
"We study the problem of prediction for evolving graph data. We formulate the problem as the minimization of a convex objective encouraging sparsity and low-rank of the solution, that reflect natural graph properties. The convex formulation allows to obtain oracle inequalities and efficient solvers. We provide empirical results for our algorithm and comparison with competing methods, and point out two open questions related to compressed sensing and algebra of low-rank and sparse matrices."
to:NB
network_data_analysis
prediction
statistics
low-rank_approximation
19 days ago
Using Internet Data for Economic Research
20 days ago
"The data used by economists can be broadly divided into two categories. First, structured datasets arise when a government agency, trade association, or company can justify the expense of assembling records. The Internet has transformed how economists interact with these datasets by lowering the cost of storing, updating, distributing, finding, and retrieving this information. Second, some economic researchers affirmatively collect data of interest. For researcher-collected data, the Internet opens exceptional possibilities both by increasing the amount of information available for researchers to gather and by lowering researchers' costs of collecting information. In this paper, I explore the Internet's new datasets, present methods for harnessing their wealth, and survey a sampling of the research questions these data help to answer. The first section of this paper discusses "scraping" the Internet for data—that is, collecting data on prices, quantities, and key characteristics that are already available on websites but not yet organized in a form useful for economic research. A second part of the paper considers online experiments, including experiments that the economic researcher observes but does not control (for example, when Amazon or eBay alters site design or bidding rules); and experiments in which a researcher participates in design, including those conducted in partnership with a company or website, and online versions of laboratory experiments. Finally, I discuss certain limits to this type of data collection, including both "terms of use" restrictions on websites and concerns about privacy and confidentiality."
to:NB
economics
data_sets
web
re:your_favorite_dsge_sucks
20 days ago
Accurately estimating neuronal correlation requires a new spike-sorting paradigm
20 days ago
"Neurophysiology is increasingly focused on identifying coincident activity among neurons. Strong inferences about neural computation are made from the results of such studies, so it is important that these results be accurate. However, the preliminary step in the analysis of such data, the assignment of spike waveforms to individual neurons (“spike-sorting”), makes a critical assumption which undermines the analysis: that spikes, and hence neurons, are independent. We show that this assumption guarantees that coincident spiking estimates such as correlation coefficients are biased. We also show how to eliminate this bias. Our solution involves sorting spikes jointly, which contrasts with the current practice of sorting spikes independently of other spikes. This new “ensemble sorting” yields unbiased estimates of coincident spiking, and permits more data to be analyzed with confidence, improving the quality and quantity of neurophysiological inferences. These results should be of interest outside the context of neuronal correlations studies. Indeed, simultaneous recording of many neurons has become the rule rather than the exception in experiments, so it is essential to spike sort correctly if we are to make valid inferences about any properties of, and relationships between, neurons."
to:NB
heard_the_talk
neuroscience
neural_data_analysis
ventura.valerie
kith_and_kin
statistics
inference_to_latent_objects
20 days ago
Clarke , Clarke : Prediction in several conventional contexts
20 days ago
"We review predictive techniques from several traditional branches of statistics. Starting with prediction based on the normal model and on the empirical distribution function, we proceed to techniques for various forms of regression and classification. Then, we turn to time series, longitudinal data, and survival analysis. Our focus throughout is on the mechanics of prediction more than on the properties of predictors."
(to_teach tags are tentative.)
to:NB
prediction
statistics
classifiers
regression
to_teach:undergrad-ADA
to_teach:data-mining
(to_teach tags are tentative.)
20 days ago
No-Regret Learning and a Mechanism for Distributed Multiagent Planning
20 days ago
"We develop a novel mechanism for coordinated, distributed multiagent planning. We consider problems stated as a col- lection of single-agent planning problems coupled by com- mon soft constraints on resource consumption. (Resources may be real or fictitious, the latter introduced as a tool for factoring the problem). A key idea is to recast the dis- tributed planning problem as learning in a repeated game between the original agents and a newly introduced group of adversarial agents who influence prices for the resources. The adversarial agents benefit from arbitrage: that is, their incentive is to uncover violations of the resource usage con- straints and, by selfishly adjusting prices, encourage the original agents to avoid plans that cause such violations. If all agents employ no-regret learning algorithms in the course of this repeated interaction, we are able to show that our mechanism can achieve design goals such as social op- timality (efficiency), budget balance, and Nash-equilibrium convergence to within an error which approaches zero as the agents gain experience. In particular, the agents’ average plans converge to a socially optimal solution for the original planning task. We present experiments in a simulated net- work routing domain demonstrating our method’s ability to reliably generate sound plans."
online_learning
economics
markets_as_collective_calculating_devices
re:knightian_uncertainty
gordon.geoff
to:NB
low-regret_learning
20 days ago
Noahpinion: Down with particle physics, up with Big Energy Research!
20 days ago
Noah speaks hyperbolically, as usual, but I find myself agreeing more than is comfortable. (Or, to put it more optimistically: if we solve the energy problem, it will be, relative to society's resources, _much cheaper_ to build really powerful accelerators in a generation or three.)
science_policy
particle_physics
20 days ago
The Echo Park Time Travel Mart
21 days ago
Must get some of those posters.
funny:geeky
time_travel
feel-good_consumerism
via:minoli
21 days ago
Ehm , Gneiting : Local proper scoring rules of order two
21 days ago
"Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the predictive distribution and on the event or value that materializes. A scoring rule is proper if it encourages truthful reporting. It is local of order k if the score depends on the predictive density only through its value and the values of its derivatives of order up to k at the realizing event. Complementing fundamental recent work by Parry, Dawid and Lauritzen, we characterize the local proper scoring rules of order 2 relative to a broad class of Lebesgue densities on the real line, using a different approach. In a data example, we use local and nonlocal proper scoring rules to assess statistically postprocessed ensemble weather forecasts."
to:NB
prediction
scoring_rules
statistics
gneiting.tilmann
21 days ago
Dawid , Lauritzen , Parry : Proper local scoring rules on discrete sample spaces
21 days ago
"A scoring rule is a loss function measuring the quality of a quoted probability distribution Q for a random variable X, in the light of the realized outcome x of X; it is proper if the expected score, under any distribution P for X, is minimized by quoting Q = P. Using the fact that any differentiable proper scoring rule on a finite sample space is the gradient of a concave homogeneous function, we consider when such a rule can be local in the sense of depending only on the probabilities quoted for points in a nominated neighborhood of x. Under mild conditions, we characterize such a proper local scoring rule in terms of a collection of homogeneous functions on the cliques of an undirected graph on the space . A useful property of such rules is that the quoted distribution Q need only be known up to a scale factor. Examples of the use of such scoring rules include Besag’s pseudo-likelihood and Hyvärinen’s method of ratio matching."
to:NB
prediction
scoring_rules
statistics
lauritzen.steffen
dawid.philip
21 days ago
Parry , Dawid , Lauritzen : Proper local scoring rules
21 days ago
"We investigate proper scoring rules for continuous distributions on the real line. It is known that the log score is the only such rule that depends on the quoted density only through its value at the outcome that materializes. Here we allow further dependence on a finite number m of derivatives of the density at the outcome, and describe a large class of such m-local proper scoring rules: these exist for all even m but no odd m. We further show that for m ≥ 2 all such m-local rules can be computed without knowledge of the normalizing constant of the distribution."
to:NB
prediction
scoring_rules
lauritzen.steffen
dawid.philip
statistics
21 days ago
The case of the 500-mile email
22 days ago
I guessed what the cause must've been, and this was still hilarious. (Also, it's old enough that I might have heard it before.) Still, this is great.
funny:geeky
networked_life
via:wilkins
22 days ago
[0805.1404] Adaptive estimation of a distribution function and its density in sup-norm loss by wavelet and spline projections
22 days ago
"Given an i.i.d. sample from a distribution $F$ on $mathbb{R}$ with uniformly continuous density $p_0$, purely data-driven estimators are constructed that efficiently estimate $F$ in sup-norm loss and simultaneously estimate $p_0$ at the best possible rate of convergence over H"older balls, also in sup-norm loss. The estimators are obtained by applying a model selection procedure close to Lepski's method with random thresholds to projections of the empirical measure onto spaces spanned by wavelets or $B$-splines. The random thresholds are based on suprema of Rademacher processes indexed by wavelet or spline projection kernels. This requires Bernstein-type analogs of the inequalities in Koltchinskii [Ann. Statist. 34 (2006) 2593-2656] for the deviation of suprema of empirical processes from their Rademacher symmetrizations."
to:NB
density_estimation
wavelets
splines
statistics
empirical_processes
22 days ago
Testing parametric conditional distributions using the nonparametric smoothing method
22 days ago
"This paper proposes a new goodness-of-fit test for parametric conditional probability distributions using the nonparametric smoothing methodology. An asymptotic normal distribution is established for the test statistic under the null hypothesis of correct specification of the parametric distribution. The test is shown to have power against local alternatives converging to the null at certain rates. The test can be applied to testing for possible misspecifications in a wide variety of parametric models. A bootstrap procedure is provided for obtaining more accurate critical values for the test. Monte Carlo simulations show that the test has good power against some common alternatives."
to:NB
misspecification
density_estimation
smoothing
statistics
to_teach:undergrad-ADA
22 days ago
Nina Strohminger reviews _The Meaning of Disgust_ (Colin McGinn)
23 days ago
This is one of the most beautifully annihilating book reviews I have ever seen, and I say that with deep jealousy.
book_reviews
moral_psychology
emotion
disgust
moral_philosophy
evisceration
mcginn.colin
strohminger.nina
via:themonkeycage
23 days ago
[1204.6441] "I Wanted to Predict Elections with Twitter and all I got was this Lousy Paper" -- A Balanced Survey on Election Prediction using Twitter Data
24 days ago
"Predicting X from Twitter is a popular fad within the Twitter research subculture. It seems both appealing and relatively easy. Among such kind of studies, electoral prediction is maybe the most attractive, and at this moment there is a growing body of literature on such a topic. This is not only an interesting research problem but, above all, it is extremely difficult. However, most of the authors seem to be more interested in claiming positive results than in providing sound and reproducible methods. It is also especially worrisome that many recent papers seem to only acknowledge those studies supporting the idea of Twitter predicting elections, instead of conducting a balanced literature review showing both sides of the matter. After reading many of such papers I have decided to write such a survey myself. Hence, in this paper, every study relevant to the matter of electoral prediction using social media is commented. From this review it can be concluded that the predictive power of Twitter regarding elections has been greatly exaggerated, and that hard research problems still lie ahead."
to:NB
social_media
data_mining
prediction
have_read
24 days ago
Brad DeLong: The Macroeconomic Equiibrium-Restoring Forces of the Market Are Nowheresville:
24 days ago
"Thus I find myself becoming more Paleokeynesian by the hour, as the world keeps hitting me on the head with a brick, pausing after each blow to say: "Now do you understand?!" "
economics
macroeconomics
financial_crisis_of_2007--
delong.brad
mugged_by_reality
24 days ago
Amanda Palmer, Kickstarter, and Everything – Whatever
24 days ago
"In sum: It’s awesome that Palmer’s Kickstarter has done so well — but look at what it’s entailed. It’s entailed time, effort, planning and work both backward and forward in time. That currently $439,000 isn’t a windfall for her; it’s a marker of what all that commitment to the work has earned.
"If you’re one of the people looking at her Kickstarter money with stars in your eyes and awesome plans of your own in your head, ask yourself first: Have you put in the time? Earned the credibility? Scoped out the financial balance sheet? Made the commitment to fulfill every single thing you have promised? Palmer has. If you haven’t — on any of this — be aware that your results, shall we say, may vary."
(The earlier part, about the very long process of reputation- and network- building which got Palmer to this point, is quite astute, but too long to excerpt.)
kickstarter
art
networked_life
palmer.amanda
music
scalzi.john
"If you’re one of the people looking at her Kickstarter money with stars in your eyes and awesome plans of your own in your head, ask yourself first: Have you put in the time? Earned the credibility? Scoped out the financial balance sheet? Made the commitment to fulfill every single thing you have promised? Palmer has. If you haven’t — on any of this — be aware that your results, shall we say, may vary."
(The earlier part, about the very long process of reputation- and network- building which got Palmer to this point, is quite astute, but too long to excerpt.)
24 days ago
The Success of Stack Exchange: Crowdsourcing + Reputation Systems « Permutations
24 days ago
Odd that he doesn't mention slashdot. (Not odd that he doesn't mention Sterling's _Distraction_, with its rival confederations of nomadic biker-gangs, oriented around competing reputation systems.)
networked_life
internet
social_life_of_the_mind
reputation_systems
re:democratic_cognition
24 days ago
Uncovering Structure in High-Dimensions: Networks and Multi-task Learning Problems
25 days ago
"Extracting knowledge and providing insights into complex mechanisms underlying noisy high-dimensional data sets is of utmost importance in many scientific domains. Statistical modeling has become ubiquitous in the analysis of high-dimensional functional data in search of better understanding of cognition mechanisms, in the exploration of large-scale gene regulatory networks in hope of developing drugs for lethal diseases, and in prediction of volatility in stock market in hope of beating the market. Statistical analysis in these high-dimensional data sets is possible only if an estimation procedure exploits hidden structures underlying data.
"This thesis develops flexible estimation procedures with provable theoretical guarantees for uncovering unknown hidden structures underlying data generating process. Of particular interest are procedures that can be used on high-dimensional data sets where the number of samples n much smaller than the ambient dimension p. Learning in high-dimensions is difficult due to the curse of dimensionality, however, the special problem structure makes inference possible. Due to its importance for scientific discovery, we put emphasis on consistent structure recovery throughout the thesis. Particular focus is given to two important problems, semi-parametric estimation of networks and feature selection in multi-task learning."
to_read
network_data_analysis
machine_learning
high-dimensional_statistics
kolar.mladen
kith_and_kin
relational_learning
"This thesis develops flexible estimation procedures with provable theoretical guarantees for uncovering unknown hidden structures underlying data generating process. Of particular interest are procedures that can be used on high-dimensional data sets where the number of samples n much smaller than the ambient dimension p. Learning in high-dimensions is difficult due to the curse of dimensionality, however, the special problem structure makes inference possible. Due to its importance for scientific discovery, we put emphasis on consistent structure recovery throughout the thesis. Particular focus is given to two important problems, semi-parametric estimation of networks and feature selection in multi-task learning."
25 days ago
Lost in Transcription: Blue-eyed-people-are-all-related zombie news
25 days ago
"So, to recap, 1) Cool paper. 2) Sex between blue-eyed people is not incest. 3) We have no idea when or where this mutation came from, but it is now conceivable that we could ask the question. 4) Embarrassingly bad science reporting spontaneously rises from the grave four years later and tries to eat your brain. "
human_genetics
historical_genetics
bad_science_journalism
why_oh_why_cant_we_have_a_better_press_corps
wilkins.jon
25 days ago
Towards Integrative Causal Analysis of Heterogeneous Data Sets and Studies
25 days ago
"We present methods able to predict the presence and strength of conditional and unconditional dependencies (correlations) between two variables Y and Z never jointly measured on the same samples, based on multiple data sets measuring a set of common variables. The algorithms are specializations of prior work on learning causal structures from overlapping variable sets. This problem has also been addressed in the field of statistical matching. The proposed methods are applied to a wide range of domains and are shown to accurately predict the presence of thousands of dependencies. Compared against prototypical statistical matching algorithms and within the scope of our experiments, the proposed algorithms make predictions that are better correlated with the sample estimates of the unknown parameters on test data ; this is particularly the case when the number of commonly measured variables is low.
"The enabling idea behind the methods is to induce one or all causal models that are simultaneously consistent with (fit) all available data sets and prior knowledge and reason with them. This allows constraints stemming from causal assumptions (e.g., Causal Markov Condition, Faithfulness) to propagate. Several methods have been developed based on this idea, for which we propose the unifying name Integrative Causal Analysis (INCA). A contrived example is presented demonstrating the theoretical potential to develop more general methods for co-analyzing heterogeneous data sets. The computational experiments with the novel methods provide evidence that causally-inspired assumptions such as Faithfulness often hold to a good degree of approximation in many real systems and could be exploited for statistical inference. Code, scripts, and data are available at www.mensxmachina.org."
to:NB
to_read
causal_inference
graphical_models
to_teach:undergrad-ADA
"The enabling idea behind the methods is to induce one or all causal models that are simultaneously consistent with (fit) all available data sets and prior knowledge and reason with them. This allows constraints stemming from causal assumptions (e.g., Causal Markov Condition, Faithfulness) to propagate. Several methods have been developed based on this idea, for which we propose the unifying name Integrative Causal Analysis (INCA). A contrived example is presented demonstrating the theoretical potential to develop more general methods for co-analyzing heterogeneous data sets. The computational experiments with the novel methods provide evidence that causally-inspired assumptions such as Faithfulness often hold to a good degree of approximation in many real systems and could be exploited for statistical inference. Code, scripts, and data are available at www.mensxmachina.org."
25 days ago
Consistent Model Selection Criteria on High Dimensions
25 days ago
"Asymptotic properties of model selection criteria for high-dimensional regression models are studied where the dimension of covariates is much larger than the sample size. Several sufficient conditions for model selection consistency are provided. Non-Gaussian error distributions are considered and it is shown that the maximal number of covariates for model selection consistency depends on the tail behavior of the error distribution. Also, sufficient conditions for model selection consistency are given when the variance of the noise is neither known nor estimated consistently. Results of simulation studies as well as real data analysis are given to illustrate that finite sample performances of consistent model selection criteria can be quite different."
to:NB
model_selection
statistics
high-dimensional_probability
25 days ago
"The huge Package for High-dimensional Undirected Graph Estimation in R"
25 days ago
"We describe an R package named huge which provides easy-to-use functions for estimating high dimensional undirected graphs from data. This package implements recent results in the literature, including Friedman et al. (2007), Liu et al. (2009, 2012) and Liu et al. (2010). Compared with the existing graph estimation package glasso, the huge package provides extra features: (1) instead of using Fortan, it is written in C, which makes the code more portable and easier to modify; (2) besides fitting Gaussian graphical models, it also provides functions for fitting high dimensional semiparametric Gaussian copula models; (3) more functions like data-dependent model selection, data generation and graph visualization; (4) a minor convergence problem of the graphical lasso algorithm is corrected; (5) the package allows the user to apply both lossless and lossy screening rules to scale up large-scale problems, making a tradeoff between computational and statistical efficiency."
to:NB
to_teach:undergrad-ADA
graphical_models
statistics
kith_and_kin
wasserman.larry
roeder.kathryn
liu.han
25 days ago
Unconscious Relational Inference Recruits the Hippocampus
25 days ago
"Relational inference denotes the capacity to encode, flexibly retrieve, and integrate multiple memories to combine past experiences to update knowledge and improve decision-making in new situations. Although relational inference is thought to depend on the hippocampus and consciousness, we now show in young, healthy men that it may occur outside consciousness but still recruits the hippocampus. In temporally distinct and unique subliminal episodes, we presented word pairs that either overlapped (“winter–red”, “red–computer”) or not. Effects of unconscious relational inference emerged in reaction times recorded during unconscious encoding and in the outcome of decisions made 1 min later at test, when participants judged the semantic relatedness of two supraliminal words. These words were either episodically related through a common word (“winter–computer” related through “red”) or unrelated. Hippocampal activity increased during the unconscious encoding of overlapping versus nonoverlapping word pairs and during the unconscious retrieval of episodically related versus unrelated words. Furthermore, hippocampal activity during unconscious encoding predicted the outcome of decisions made at test. Hence, unconscious inference may influence decision-making in new situations."
Relations represented spatially?
to:NB
neuroscience
experimental_psychology
Relations represented spatially?
25 days ago
danah boyd | apophenia » Microsoft Research opens New York City lab
25 days ago
Well, this is good news for a change.
kith_and_kin
25 days ago
Misuse of hierarchical linear models overstates the significance of a reported association between OXTR and prosociality
27 days ago
Going from a p-value of 10^-16 to 0.027 is --- painful. IFrom the lack of a response, I tend to infer that there's no arguing back...
Prediction: the original association will continue to be cited without correction.
bad_data_analysis
hierarchical_models
human_genetics
evisceration
Prediction: the original association will continue to be cited without correction.
27 days ago
README: installing Rgraphviz
27 days ago
Install graphviz, then Rgraphviz, then (?) re-start R. Or at least that worked with the student in office hours. (I swear it's painless on a Mac.)
to_teach:undergrad-ADA
27 days ago
[1204.6703] Two SVDs Suffice: Spectral decompositions for probabilistic topic modeling and latent Dirichlet allocation
27 days ago
"Topic models can be seen as a generalization of the clustering problem, in that they posit that observations are generated due to multiple latent factors (e.g. the words in each document are generated as a mixture of several active topics, as opposed to just one). This increased representational power comes at the cost of a more challenging unsupervised learning problem of estimating the topic probability vectors (the distributions over words for each topic), when only the words are observed and the corresponding topics are hidden.
"We provide a simple and efficient learning procedure that is guaranteed to recover the parameters for a wide class of mixture models, including the popular latent Dirichlet allocation (LDA) model. For LDA, the procedure correctly recovers both the topic probability vectors and the prior over the topics, using only trigram statistics (i.e. third order moments, which may be estimated with documents containing just three words). The method, termed Excess Correlation Analysis (ECA), is based on a spectral decomposition of low order moments (third and fourth order) via two singular value decompositions (SVDs). Moreover, the algorithm is scalable since the SVD operations are carried out on k by k matrices, where k is the number of latent factors (e.g. the number of topics), rather than in the d-dimensional observed space (typically d >> k)."
That's a really remarkable claim, and I'd tag it to_be_shot_after_a_fair_trial if it weren't being made by genuinely serious people.
in_NB
to_read
latent_variables
topic_models
text_mining
mixture_models
statistics
machine_learning
cool_if_true
spectral_clustering
"We provide a simple and efficient learning procedure that is guaranteed to recover the parameters for a wide class of mixture models, including the popular latent Dirichlet allocation (LDA) model. For LDA, the procedure correctly recovers both the topic probability vectors and the prior over the topics, using only trigram statistics (i.e. third order moments, which may be estimated with documents containing just three words). The method, termed Excess Correlation Analysis (ECA), is based on a spectral decomposition of low order moments (third and fourth order) via two singular value decompositions (SVDs). Moreover, the algorithm is scalable since the SVD operations are carried out on k by k matrices, where k is the number of latent factors (e.g. the number of topics), rather than in the d-dimensional observed space (typically d >> k)."
That's a really remarkable claim, and I'd tag it to_be_shot_after_a_fair_trial if it weren't being made by genuinely serious people.
27 days ago
Larger than Life: Digital Creatures in a Family of Two-Dimensional Cellular Automata (Evans, 2001)
28 days ago
"We introduce the Larger than Life family of two-dimensional two-state cellular automata that generalize certain nearest neighbor outer totalistic cellular automaton rules to large neighborhoods. We describe linear and quadratic rescalings of John Conway's celebrated Game of Life to these large neighborhood cellular automaton rules and present corresponding generalizations of Life's famous gliders and spaceships. We show that, as is becoming well known for nearest neighbor cellular automaton rules, these ``digital creatures'' are ubiquitous for certain parameter values."
(Meta-comment: jeez, guys, how hard is it to re-direct old URLs? Or at least to have a working search box?)
cellular_automata
conways_life
have_read
to:NB
evans.kellie_m.
(Meta-comment: jeez, guys, how hard is it to re-direct old URLs? Or at least to have a working search box?)
28 days ago
UnderstandingSociety: Dewey on habits
28 days ago
A good summary, except for the very end. I don't think Dewey _does_ ignore deliberation in _Human Nature and Conduct_; everything he has to say about intelligence (and, to some extent, imagination) has to do with this, which is to say, the whole of Part 3 of the book.
dewey.john
habit
moral_psychology
social_psychology
cultural_transmission_of_cognitive_tools
little.daniel
28 days ago
[1204.6265] Statistical inference for dynamical systems: a review
28 days ago
"The topic of statistical inference for dynamical systems has been studied extensively across several fields. In this survey we focus on the problem of parameter estimation for non-linear dynamical systems. Our objective is to place results across distinct disciplines in a common setting and highlight opportunities for further research."
to:NB
to_read
statistical_inference_for_stochastic_processes
dynamical_systems
statistics
time_series
state-space_models
state-space_reconstruction
pillai.natesh
via:ded-maxim
28 days ago
See this user's network
academia
afghanistan
agent-based_models
american_hegemony
american_history
anthropology
archaeology
architecture
art
bad_data_analysis
bad_science
bad_science_journalism
banking
bayesianism
biochemical_networks
blogged
blogging
book_reviews
books:noted
books:recommended
bootstrap
burke.timothy
cartoons
cats
causal_inference
causality
central_asia
central_limit_theorem
class_struggles_in_america
classifiers
climate_change
clustering
cognitive_science
collective_cognition
comics
community_discovery
complexity
computer_networks_as_provinces_of_the_commonwealth_of_letters
concentration_of_measure
conferences
confidence_sets
corruption
coveted
creeping_authoritarianism
cthulhiana
cultural_criticism
cultural_exchange
data_analysis
data_mining
debunking
decision_theory
delong.brad
democracy
density_estimation
design
dimension_reduction
dynamical_systems
econometrics
economic_history
economic_policy
economics
education
empirical_processes
ensemble_methods
entropy_estimation
epidemic_models
ergodic_theory
estimation
evisceration
evolutionary_biology
evolutionary_psychology
experimental_psychology
finance
financial_crisis_of_2007--
financial_markets
financial_speculation
fmri
food
fraud
funny
funny:academic
funny:geeky
funny:laughing_instead_of_screaming
funny:malicious
funny:pointed
funny:sad
graphical_models
great_transformation
have_read
heard_the_talk
heavy_tails
history_of_ideas
history_of_science
hypothesis_testing
imperialism
in_NB
inequality
inference_to_latent_objects
information_retrieval
information_theory
institutions
intellectual_property
internet
kith_and_kin
krugman.paul
large_deviations
lasso
learning_theory
liberman.mark
libertarianism
linguistics
literary_criticism
lolcats
machine_learning
macro_from_micro
macroeconomics
market_failures_in_everything
markov_models
mixing
model_selection
modeling
modern_ruins
moral_psychology
moral_responsibility
mortgage_crisis
natural_history_of_truthiness
network_data_analysis
networked_life
networks
neuroscience
non-equilibrium
nonparametrics
obama.barack
our_decrepit_institutions
our_national_shame
philosophy
philosophy_of_science
photos
physics
pittsburgh
poetry
political_economy
political_science
practices_relating_to_the_transmission_of_genetic_information
prediction
pretty_pictures
probability
programming
progressive_forces
psychoceramics
psychology
R
racism
random_fields
re:almost_none
re:AoS_project
re:democratic_cognition
re:do-institutions-evolve
re:g_paper
re:homophily_and_confounding
re:phil-of-bayes_paper
re:social-networks-as-sensor-networks
re:stacs
re:XV_for_mixing
re:your_favorite_dsge_sucks
recipes
regression
regulation
rhetoric
risk_assessment
running_dogs_of_reaction
science_fiction
simulation
social_life_of_the_mind
social_media
social_networks
social_science_methodology
sociology
something_about_america
sparsity
statistical_inference_for_stochastic_processes
statistical_mechanics
statistics
stochastic_processes
stupid_security
text_mining
the_american_dilemma
the_continuing_crises
time_series
to:blog
to:NB
to_be_shot_after_a_fair_trial
to_read
to_teach:complexity-and-inference
to_teach:data-mining
to_teach:undergrad-ADA
torture
track_down_references
us-iraq_war
us_politics
utter_stupidity
vast_right-wing_conspiracy
via:?
via:arthegall
via:jbdelong
via:klk
visual_display_of_quantitative_information
whats_gone_wrong_with_america
why_oh_why_cant_we_have_a_better_academic_publishing_system
why_oh_why_cant_we_have_a_better_press_corps
yglesias.matthew