Vaguery + inference   14

[1109.2215] Finding missing edges and communities in incomplete networks
Many algorithms have been proposed for predicting missing edges in networks, but they do not usually take account of which edges are missing. We focus on networks which have missing edges of the form that is likely to occur in real networks, and compare algorithms that find these missing edges. We also investigate the effect of this kind of missing data on community detection algorithms.
network-theory  algorithms  inference  statistics  nudge-targets 
january 2012 by Vaguery
[1110.5063] Recovering a Clipped Signal in Sparseland
In many data acquisition systems it is common to observe signals whose amplitudes have been clipped. We present two new algorithms for recovering a clipped signal by leveraging the model assumption that the underlying signal is sparse in the frequency domain. Both algorithms employ ideas commonly used in the field of Compressive Sensing; the first is a modified version of Reweighted $ell_1$ minimization, and the second is a modification of a simple greedy algorithm known as Trivial Pursuit. An empirical investigation shows that both approaches can recover signals with significant levels of clipping
signal-processing  inference  compressive-sensing  algorithms  nudge-targets 
january 2012 by Vaguery
[1008.2555] Succinct Data Structures for Assembling Large Genomes
"Motivation: Second generation sequencing technology makes it feasible for many researches to obtain enough sequence reads to attempt the de novo assembly of higher eukaryotes (including mammals). De novo assembly not only provides a tool for understanding wide scale biological variation, but within human bio-medicine, it offers a direct way of observing both large scale structural variation and fine scale sequence variation. Unfortunately, improvements in the computational feasibility for de novo assembly have not matched the improvements in the gathering of sequence data. This is for two reasons: the inherent computational complexity of the problem, and the in-practice memory requirements of tools."
genomics  bioinformatics  discrete-mathematics  algorithms  nudge-targets  inference  data-driven 
august 2010 by Vaguery
[1008.1136] Recovering magnetization distributions from their noisy diffraction data
"We study, using simulated experiments inspired by thin film magnetic domain patterns, the feasibility of phase retrieval in X-ray diffractive imaging in the presence of intrinsic charge scattering given only photon-shot-noise limited diffraction data. We detail a reconstruction algorithm to recover the sample's magnetization distribution under such conditions, and compare its performance with that of Fourier transform holography. Concerning the design of future experiments, we also chart out the reconstruction limits of diffractive imaging when photon- shot-noise and the intensity of charge scattering noise are independently varied. This work is directly relevant to the time-resolved imaging of magnetic dynamics using coherent and ultrafast radiation from X-ray free electron lasers and also to broader classes of diffractive imaging experiments which suffer noisy data, missing data or both."
image-processing  materials-science  nudge-targets  inference  signal-processing 
august 2010 by Vaguery
[1008.1663] Learning Residual Finite-State Automata Using Observation Tables
"We define a two-step learner for RFSAs based on an observation table by using an algorithm for minimal DFAs to build a table for the reversal of the language in question and showing that we can derive the minimal RFSA from it after some simple modifications. We compare the algorithm to two other table-based ones of which one (by Bollig et al. 2009) infers a RFSA directly, and the other is another two-step learner proposed by the author. We focus on the criterion of query complexity."
finite-state-machine  machine-learning  algorithms  nudge-targets  learning-from-data  inference 
august 2010 by Vaguery
[1008.1414] Statistically validated networks in bipartite complex systems
"Many complex systems present an intrinsic bipartite nature and are often described and modeled in terms of networks [1-5]. Examples include movies and actors [1, 2, 4], authors and scientific papers [6-9], email accounts and emails [10], plants and animals that pollinate them [11, 12]. Bipartite networks are often very heterogeneous in the number of relationships that the elements of one set establish with the elements of the other set. … Here we introduce an unsupervised method to statistically validate each link of the projected network against a null hypothesis taking into account the heterogeneity of the system. We apply our method to three different systems…. In all these systems, both different in size and level of heterogeneity, we find that our method is able to detect network structures which are informative about the system…"
complexology  network-theory  algorithms  machine-learning  nudge-targets  inference  statistics 
august 2010 by Vaguery
[1007.1829] Topological reversibility and causality in feed-forward networks
"Systems whose organization displays causal asymmetry constraints, from evolutionary trees to river basins or transport networks, can be often described in terms of directed paths (causal flows) on a discrete state space. Such a set of paths defines a feed-forward, acyclic network. A key problem associated with these systems involves characterizing their intrinsic degree of path reversibility: given an end node in the graph, what is the uncertainty of recovering the process backwards until the origin? Here we propose a novel concept, \textit{topological reversibility}, which rigorously weigths such uncertainty in path dependency quantified as the minimum amount of information required to successfully revert a causal path.…"
complexology  network-theory  inference  heuristics  modeling 
july 2010 by Vaguery
[1007.2467] Parametric Level Set Methods for Inverse Problems
"In this paper, a parametric level set method for reconstruction of obstacles in general inverse problems is considered. General evolution equations for the reconstruction of unknown obstacles are derived in terms of the underlying level set parameters. We show that using the appropriate form of parameterizing the level set function results a significantly lower dimensional problem, which bypasses many difficulties with traditional level set methods, such as regularization, re-initialization and use of signed distance function.…"
image-processing  radiology  numerical-methods  inverse-problems  inference  nudge-targets 
july 2010 by Vaguery
[0906.5321] Efficient statistical inference for stochastic reaction processes
"We address the problem of estimating unknown model parameters and state variables in stochastic reaction processes when only sparse and noisy measurements are available. Using an asymptotic system size expansion for the backward equation we derive an efficient approximation for this problem. We demonstrate the validity of our approach on model systems and generalize our method to the case when some state variables are not observed."
models  statistics  inference  inverse-problems  nudge-targets  dynamical-systems 
july 2010 by Vaguery
[1005.5636] Astrocladistics: Multivariate Evolutionary Analysis in Astrophysics
"It is now clear that cladistics can be applied and be useful to the study of galaxy diversification. Many difficulties, conceptual and practical, have been solved,. Significant astrophysical results have been obtained and will be extended to larger samples of galaxies and globular clusters. However, many paths remain in the exploration of this new and large field of research."
astronomy  classification  cladistics  inference  nudge-targets  learning-from-data  model-discovery 
june 2010 by Vaguery
[1005.3694] Dynamics and Performance of Susceptibility Propagation on Synthetic Data
"The inverse Ising problem is a difficult combinatorial optimization problem in the class known as “NP-hard”. In theory, only approximate schemes, or methods that take more than polynomial time to find the answer are possible. Boltzmann Learning [1] is an iterative method where in one step the correlation functions are computed given an Ising model, and in another step the Ising model couplings are modified to adjust to data. In principle, Boltzmann learning can be employed to find the couplings with arbi- trary accuracy given accurate data and sufficient time, but the slow convergence of the Boltzmann learning makes it a very inefficient algorithm for most practical purposes."
inverse-problems  inference  complex-systems  ising-model  nudge-targets 
may 2010 by Vaguery
[0911.2651] Optimal map of the modular structure of complex networks
"…Generally speaking, modules are islands of highly connected nodes separated by a relatively small number of links. Every module can have contributions of links from any node in the network. The challenge is to disentangle these contributions to understand how the modular structure is built. The main problem is that the analysis of a certain partition into modules involves, in principle, as many data as number of modules times number of nodes. To confront this challenge, here we first define the contribution matrix, the mathematical object containing all the information about the partition of interest, and after, we use a Truncated Singular Value Decomposition to extract the best representation of this matrix in a plane. The analysis of this projection allow us to scrutinize the skeleton of the modular structure, revealing the structure of individual modules and their interrelations."
network-thinking  complexology  inference  modeling-is-not-mathematics  learning-from-data 
may 2010 by Vaguery
[1005.1327] Statistical Model Checking : An Overview
"Quantitative properties of stochastic systems are usually specified in logics that allow one to compare the measure of executions satisfying certain temporal properties with thresholds. The model checking problem for stochastic systems with respect to such logics is typically solved by a numerical approach that iteratively computes (or approximates) the exact measure of paths satisfying relevant subformulas; the algorithms themselves depend on the class of systems being analyzed as well as the logic used for specifying the properties. Another approach to solve the model checking problem is to \emph{simulate} the system for finitely many runs, and use \emph{hypothesis testing} to infer whether the samples provide a \emph{statistical} evidence for the satisfaction or violation of the specification. In this short paper, we survey the statistical approach, and outline its main advantages in terms of efficiency, uniformity, and simplicity."
complexology  simulation  statistics  models  modeling-is-not-mathematics  inference  explanatory-power 
may 2010 by Vaguery

Copy this bookmark:



description:


tags: