cshalizi + automata_theory 15
Efficiently identifying deterministic real-time automata from labeled data
february 2012 by cshalizi
"We develop a novel learning algorithm RTI for identifying a deterministic real-time automaton (DRTA) from labeled time-stamped event sequences. The RTI algorithm is based on the current state of the art in deterministic finite-state automaton (DFA) identification, called evidence-driven state-merging (EDSM). In addition to having a DFA structure, a DRTA contains time constraints between occurrences of consecutive events. Although this seems a small difference, we show that the problem of identifying a DRTA is much more difficult than the problem of identifying a DFA: identifying only the time constraints of a DRTA given its DFA structure is already NP-complete. In spite of this additional complexity, we show that RTI is a correct and complete algorithm that converges efficiently (from polynomial time and data) to the correct DRTA in the limit. To the best of our knowledge, this is the first algorithm that can identify a timed automaton model from time-stamped event sequences.
A straightforward alternative to identifying DRTAs is to identify a DFA that models time implicitly, i.e., a DFA that uses different states for different points in time. Such a DFA can be identified by first sampling the timed sequences using a fixed frequency, and subsequently applying EDSM to the resulting non-timed event sequences. We evaluate the performance of both RTI and this sampling approach experimentally on artificially generated data. In these experiments RTI outperforms the sampling approach significantly. Thus, we show that if we obtain data from a real-time system, it is easier to identify a DRTA from this data than to identify an equivalent DFA."
in_NB
automata_theory
machine_learning
grammar_induction
A straightforward alternative to identifying DRTAs is to identify a DFA that models time implicitly, i.e., a DFA that uses different states for different points in time. Such a DFA can be identified by first sampling the timed sequences using a fixed frequency, and subsequently applying EDSM to the resulting non-timed event sequences. We evaluate the performance of both RTI and this sampling approach experimentally on artificially generated data. In these experiments RTI outperforms the sampling approach significantly. Thus, we show that if we obtain data from a real-time system, it is easier to identify a DRTA from this data than to identify an equivalent DFA."
february 2012 by cshalizi
Stochastic Automata: Stability, Nondeterminism and Prediction
january 2011 by cshalizi
From 1981 but perhaps still worth looking at?
books:noted
automata_theory
markov_models
re:AoS_project
coveted
january 2011 by cshalizi
[1004.3246] The Complexity of Finding Reset Words in Finite Automata
june 2010 by cshalizi
For "reset word", read "synchronizing word".
automata_theory
synchronizing_words
re:AoS_project
via:vaguery
june 2010 by cshalizi
Grammatical Inference: Learning Automata and Grammars - Cambridge University Press
april 2010 by cshalizi
I was one of the referees for the MS. proposal. This will be good.
books:recommended
grammar_induction
machine_learning
automata_theory
april 2010 by cshalizi
APPLICATIONS OF AUTOMATA THEORY AND ALGEBRA
february 2010 by cshalizi
I actually read chunks of this while in graduate school. It was... strange.
complexity
automata_theory
borderline_psychoceramica
books:noted
february 2010 by cshalizi
Statistical Estimation Requires Unbounded Memory
july 2009 by cshalizi
"We investigate the existence of bounded-memory consistent estimators of various statistical functionals. This question is resolved in the negative in a rather strong sense. We propose various bounded-memory approximations, using techniques from automata theory and stochastic processes. Some questions of potential interest are raised for future work."
In other words: you need an unbounded memory even to estimate the bias of coin-flips, even if you know that there are only just two possible biases. This is pretty sweet (in a thoroughly negative way). --- A lot of work is done here by the strength of the notion of consistency employed (see e.g. the counter-examples with finite but small limiting error probabilities), but this is the standard one!
automata_theory
statistics
statistical_inference_for_stochastic_processes
computational_complexity
kontorovich.aryeh
kith_and_kin
have_read
In other words: you need an unbounded memory even to estimate the bias of coin-flips, even if you know that there are only just two possible biases. This is pretty sweet (in a thoroughly negative way). --- A lot of work is done here by the strength of the notion of consistency employed (see e.g. the counter-examples with finite but small limiting error probabilities), but this is the standard one!
july 2009 by cshalizi
FSA utilities: A toolbox to manipulate finite-state automata
june 2009 by cshalizi
"This paper describes the FSA Utilities toolbox: a collection of utilities to manipulate finite-state automata and finite-state transducers. Manipulations include determinization (both for finite-state acceptors and finite-state transducers), minimization, composition, complementation, intersection, Kleene closure, etc. Furthermore, various visualization tools are available to browse finite-state automata. The toolbox is implemented in SICStus Prolog." Where's something comparable in a sane language? (N.B., must handle transducers.)
automata_theory
programming
re:AoS_project
june 2009 by cshalizi
[0812.1949] Prediction with Restricted Resources and Finite Automata
december 2008 by cshalizi
"We obtain an index of the complexity of a random sequence by allowing the role of the measure in classical probability theory to be played by a function we call the generating mechanism. Typically, this generating mechanism will be a finite automata. We generate a set of biased sequences by applying a finite state automata with a specified number, $m$, of states to the set of all binary sequences. Thus we can index the complexity of our random sequence by the number of states of the automata. We detail optimal algorithms to predict sequences generated in this way."
prediction
automata_theory
complexity_measures
to_read
to_be_shot_after_a_fair_trial
december 2008 by cshalizi
Complexity: Hierarchical Structure and Scaling in Phyiscs [Badii and Politi] Labyrinth Books
june 2008 by cshalizi
Unquestionably still the best book on the mathematical foundations of complex systems theory, and the measurement of complexity.
books:recommended
complexity
cellular_automata
dynamical_systems
ergodic_theory
information_theory
automata_theory
formal_languages
chaos
algorithmic_information_theory
hierarchical_structure
badii.remo
politi.antonio
june 2008 by cshalizi
related tags
algorithmic_information_theory ⊕ approximation ⊕ automata_theory ⊖ badii.remo ⊕ books:noted ⊕ books:recommended ⊕ borderline_psychoceramica ⊕ cellular_automata ⊕ chaos ⊕ complexity ⊕ complexity_measures ⊕ computational_complexity ⊕ coveted ⊕ decision_theory ⊕ de_deo.simon ⊕ dynamical_systems ⊕ education ⊕ emergence ⊕ epistemology ⊕ ergodic_theory ⊕ formal_languages ⊕ game_theory ⊕ grammar_induction ⊕ have_read ⊕ hierarchical_structure ⊕ information_theory ⊕ in_NB ⊕ kith_and_kin ⊕ kontorovich.aryeh ⊕ logic ⊕ machine_learning ⊕ macro_from_micro ⊕ markov_models ⊕ methodology ⊕ mohri.mehryar ⊕ pereira.fernando ⊕ philosophy ⊕ philosophy_of_mind ⊕ philosophy_of_science ⊕ politi.antonio ⊕ prediction ⊕ programming ⊕ re:AoS_project ⊕ reinforcement_learning ⊕ riley.michael ⊕ statistical_inference_for_stochastic_processes ⊕ statistics ⊕ suppes.ptrick ⊕ synchronizing_words ⊕ to:NB ⊕ to_be_shot_after_a_fair_trial ⊕ to_read ⊕ transducers ⊕ via:leo ⊕ via:vaguery ⊕Copy this bookmark: