Bill Katz
november 2010 by nico.ash
Simple Full Text Search for App Engine
Revised 2009-07-12 to include addition of stemming, multi-word exact matching, multiple search entities, and stowing of designated parent titles in index entity keys.
algorithms
appengine
article
code
cs
datastore
development
django
engine
foss
library
googleappengine
google
gae
fulltext
search
model
programming
text
web
python
resource
indexing
google_app_engine
searchablemodel
Revised 2009-07-12 to include addition of stemming, multi-word exact matching, multiple search entities, and stowing of designated parent titles in index entity keys.
november 2010 by nico.ash
Damn Cool Algorithms: Levenshtein Automata - Nick's Blog
july 2010 by nico.ash
The basic insight behind Levenshtein automata is that it's possible to construct a Finite state automaton that recognizes exactly the set of strings within a given Levenshtein distance of a target word. We can then feed in any word, and the automaton will accept or reject it based on whether the Levenshtein distance to the target word is at most the distance specified when we constructed the automaton. Further, due to the nature of FSAs, it will do so in O(n) time with the length of the string being tested. Compare this to the standard Dynamic Programming Levenshtein algorithm, which takes O(mn) time, where m and n are the lengths of the two input words! It's thus immediately apparrent that Levenshtein automaton provide, at a minimum, a faster way for us to check many words against a single target word and maximum distance - not a bad improvement to start with!
algorithms
algorithm
search
python
levenshtein
distance
toread
programming
article
fuzzy
automata
july 2010 by nico.ash
Trending Low-Volume Google Search Terms - Introducing Gootrude
june 2008 by nico.ash
The basic strategy is to take a collection of search terms, automatically query Google for the number of results associated with each of these search terms (displayed by Google when doing a web search), and graph these numbers over time with Gnuplot.
google
imported
opensource
perl
search
software
tools
trends
visualization
june 2008 by nico.ash
related tags
ajax ⊕ algorithm ⊕ algorithms ⊕ amazon ⊕ appengine ⊕ article ⊕ automata ⊕ book ⊕ books ⊕ brain ⊕ code ⊕ compression ⊕ cool ⊕ copyright ⊕ crypto ⊕ cs ⊕ culture ⊕ data ⊕ database ⊕ datastore ⊕ db ⊕ development ⊕ distance ⊕ django ⊕ dopamine ⊕ ebook ⊕ ebooks ⊕ electronics ⊕ engine ⊕ flickr ⊕ foss ⊕ freebase ⊕ fts ⊕ fulltext ⊕ fuzzy ⊕ gae ⊕ google ⊕ googleappengine ⊕ google_app_engine ⊕ hacking ⊕ howto ⊕ images ⊕ imported ⊕ index ⊕ indexing ⊕ internet ⊕ iphone ⊕ ir ⊕ levenshtein ⊕ library ⊕ md5 ⊕ model ⊕ neurology ⊕ neuroscience ⊕ opensource ⊕ owner ⊕ password ⊕ pdf ⊕ perl ⊕ photo ⊕ photography ⊕ programming ⊕ psychology ⊕ python ⊕ reference ⊕ resource ⊕ search ⊖ searchablemodel ⊕ security ⊕ semanticweb ⊕ shell ⊕ shopping ⊕ social ⊕ software ⊕ sql ⊕ sqlite ⊕ technology ⊕ text ⊕ tools ⊕ toread ⊕ trends ⊕ tutorials ⊕ twitter ⊕ useful ⊕ visualization ⊕ web ⊕ web2.0 ⊕ wiki ⊕Copy this bookmark: