[1109.2215] Finding missing edges and communities in incomplete networks
january 2012 by Vaguery
Many algorithms have been proposed for predicting missing edges in networks, but they do not usually take account of which edges are missing. We focus on networks which have missing edges of the form that is likely to occur in real networks, and compare algorithms that find these missing edges. We also investigate the effect of this kind of missing data on community detection algorithms.
network-theory
algorithms
inference
statistics
nudge-targets
january 2012 by Vaguery
[1110.5063] Recovering a Clipped Signal in Sparseland
january 2012 by Vaguery
In many data acquisition systems it is common to observe signals whose amplitudes have been clipped. We present two new algorithms for recovering a clipped signal by leveraging the model assumption that the underlying signal is sparse in the frequency domain. Both algorithms employ ideas commonly used in the field of Compressive Sensing; the first is a modified version of Reweighted $ell_1$ minimization, and the second is a modification of a simple greedy algorithm known as Trivial Pursuit. An empirical investigation shows that both approaches can recover signals with significant levels of clipping
signal-processing
inference
compressive-sensing
algorithms
nudge-targets
january 2012 by Vaguery
[1008.2555] Succinct Data Structures for Assembling Large Genomes
august 2010 by Vaguery
"Motivation: Second generation sequencing technology makes it feasible for many researches to obtain enough sequence reads to attempt the de novo assembly of higher eukaryotes (including mammals). De novo assembly not only provides a tool for understanding wide scale biological variation, but within human bio-medicine, it offers a direct way of observing both large scale structural variation and fine scale sequence variation. Unfortunately, improvements in the computational feasibility for de novo assembly have not matched the improvements in the gathering of sequence data. This is for two reasons: the inherent computational complexity of the problem, and the in-practice memory requirements of tools."
genomics
bioinformatics
discrete-mathematics
algorithms
nudge-targets
inference
data-driven
august 2010 by Vaguery
[1008.1136] Recovering magnetization distributions from their noisy diffraction data
august 2010 by Vaguery
"We study, using simulated experiments inspired by thin film magnetic domain patterns, the feasibility of phase retrieval in X-ray diffractive imaging in the presence of intrinsic charge scattering given only photon-shot-noise limited diffraction data. We detail a reconstruction algorithm to recover the sample's magnetization distribution under such conditions, and compare its performance with that of Fourier transform holography. Concerning the design of future experiments, we also chart out the reconstruction limits of diffractive imaging when photon- shot-noise and the intensity of charge scattering noise are independently varied. This work is directly relevant to the time-resolved imaging of magnetic dynamics using coherent and ultrafast radiation from X-ray free electron lasers and also to broader classes of diffractive imaging experiments which suffer noisy data, missing data or both."
image-processing
materials-science
nudge-targets
inference
signal-processing
august 2010 by Vaguery
[1008.1663] Learning Residual Finite-State Automata Using Observation Tables
august 2010 by Vaguery
"We define a two-step learner for RFSAs based on an observation table by using an algorithm for minimal DFAs to build a table for the reversal of the language in question and showing that we can derive the minimal RFSA from it after some simple modifications. We compare the algorithm to two other table-based ones of which one (by Bollig et al. 2009) infers a RFSA directly, and the other is another two-step learner proposed by the author. We focus on the criterion of query complexity."
finite-state-machine
machine-learning
algorithms
nudge-targets
learning-from-data
inference
august 2010 by Vaguery
[1008.1414] Statistically validated networks in bipartite complex systems
august 2010 by Vaguery
"Many complex systems present an intrinsic bipartite nature and are often described and modeled in terms of networks [1-5]. Examples include movies and actors [1, 2, 4], authors and scientific papers [6-9], email accounts and emails [10], plants and animals that pollinate them [11, 12]. Bipartite networks are often very heterogeneous in the number of relationships that the elements of one set establish with the elements of the other set. … Here we introduce an unsupervised method to statistically validate each link of the projected network against a null hypothesis taking into account the heterogeneity of the system. We apply our method to three different systems…. In all these systems, both different in size and level of heterogeneity, we find that our method is able to detect network structures which are informative about the system…"
complexology
network-theory
algorithms
machine-learning
nudge-targets
inference
statistics
august 2010 by Vaguery
[1007.1829] Topological reversibility and causality in feed-forward networks
july 2010 by Vaguery
"Systems whose organization displays causal asymmetry constraints, from evolutionary trees to river basins or transport networks, can be often described in terms of directed paths (causal flows) on a discrete state space. Such a set of paths defines a feed-forward, acyclic network. A key problem associated with these systems involves characterizing their intrinsic degree of path reversibility: given an end node in the graph, what is the uncertainty of recovering the process backwards until the origin? Here we propose a novel concept, \textit{topological reversibility}, which rigorously weigths such uncertainty in path dependency quantified as the minimum amount of information required to successfully revert a causal path.…"
complexology
network-theory
inference
heuristics
modeling
july 2010 by Vaguery
[1007.2467] Parametric Level Set Methods for Inverse Problems
july 2010 by Vaguery
"In this paper, a parametric level set method for reconstruction of obstacles in general inverse problems is considered. General evolution equations for the reconstruction of unknown obstacles are derived in terms of the underlying level set parameters. We show that using the appropriate form of parameterizing the level set function results a significantly lower dimensional problem, which bypasses many difficulties with traditional level set methods, such as regularization, re-initialization and use of signed distance function.…"
image-processing
radiology
numerical-methods
inverse-problems
inference
nudge-targets
july 2010 by Vaguery
[0906.5321] Efficient statistical inference for stochastic reaction processes
july 2010 by Vaguery
"We address the problem of estimating unknown model parameters and state variables in stochastic reaction processes when only sparse and noisy measurements are available. Using an asymptotic system size expansion for the backward equation we derive an efficient approximation for this problem. We demonstrate the validity of our approach on model systems and generalize our method to the case when some state variables are not observed."
models
statistics
inference
inverse-problems
nudge-targets
dynamical-systems
july 2010 by Vaguery
[1005.5636] Astrocladistics: Multivariate Evolutionary Analysis in Astrophysics
june 2010 by Vaguery
"It is now clear that cladistics can be applied and be useful to the study of galaxy diversification. Many difficulties, conceptual and practical, have been solved,. Significant astrophysical results have been obtained and will be extended to larger samples of galaxies and globular clusters. However, many paths remain in the exploration of this new and large field of research."
astronomy
classification
cladistics
inference
nudge-targets
learning-from-data
model-discovery
june 2010 by Vaguery
[1005.3694] Dynamics and Performance of Susceptibility Propagation on Synthetic Data
may 2010 by Vaguery
"The inverse Ising problem is a difficult combinatorial optimization problem in the class known as “NP-hard”. In theory, only approximate schemes, or methods that take more than polynomial time to find the answer are possible. Boltzmann Learning [1] is an iterative method where in one step the correlation functions are computed given an Ising model, and in another step the Ising model couplings are modified to adjust to data. In principle, Boltzmann learning can be employed to find the couplings with arbi- trary accuracy given accurate data and sufficient time, but the slow convergence of the Boltzmann learning makes it a very inefficient algorithm for most practical purposes."
inverse-problems
inference
complex-systems
ising-model
nudge-targets
may 2010 by Vaguery
[0911.2651] Optimal map of the modular structure of complex networks
may 2010 by Vaguery
"…Generally speaking, modules are islands of highly connected nodes separated by a relatively small number of links. Every module can have contributions of links from any node in the network. The challenge is to disentangle these contributions to understand how the modular structure is built. The main problem is that the analysis of a certain partition into modules involves, in principle, as many data as number of modules times number of nodes. To confront this challenge, here we first define the contribution matrix, the mathematical object containing all the information about the partition of interest, and after, we use a Truncated Singular Value Decomposition to extract the best representation of this matrix in a plane. The analysis of this projection allow us to scrutinize the skeleton of the modular structure, revealing the structure of individual modules and their interrelations."
network-thinking
complexology
inference
modeling-is-not-mathematics
learning-from-data
may 2010 by Vaguery
[1005.1327] Statistical Model Checking : An Overview
may 2010 by Vaguery
"Quantitative properties of stochastic systems are usually specified in logics that allow one to compare the measure of executions satisfying certain temporal properties with thresholds. The model checking problem for stochastic systems with respect to such logics is typically solved by a numerical approach that iteratively computes (or approximates) the exact measure of paths satisfying relevant subformulas; the algorithms themselves depend on the class of systems being analyzed as well as the logic used for specifying the properties. Another approach to solve the model checking problem is to \emph{simulate} the system for finitely many runs, and use \emph{hypothesis testing} to infer whether the samples provide a \emph{statistical} evidence for the satisfaction or violation of the specification. In this short paper, we survey the statistical approach, and outline its main advantages in terms of efficiency, uniformity, and simplicity."
complexology
simulation
statistics
models
modeling-is-not-mathematics
inference
explanatory-power
may 2010 by Vaguery
related tags
algorithms ⊕ astronomy ⊕ Bayesian ⊕ bioinformatics ⊕ cladistics ⊕ classification ⊕ collaborative-filtering ⊕ complex-systems ⊕ complexology ⊕ compressive-sensing ⊕ crowdsourcing ⊕ data ⊕ data-driven ⊕ discrete-mathematics ⊕ dynamical-systems ⊕ explanatory-power ⊕ finite-state-machine ⊕ genomics ⊕ heuristics ⊕ image-processing ⊕ inference ⊖ inverse-problems ⊕ ising-model ⊕ learning-from-data ⊕ machine-learning ⊕ materials-science ⊕ model-discovery ⊕ modeling ⊕ modeling-is-not-mathematics ⊕ models ⊕ network-theory ⊕ network-thinking ⊕ nudge-targets ⊕ numerical-methods ⊕ radiology ⊕ research ⊕ signal-processing ⊕ simulation ⊕ statistics ⊕Copy this bookmark: