jm + algorithms   26

Open Data Structures
A free-as-in-speech as well as -beer textbook of data structures, covering a great range, including some I hadn't heard of before. Here's the full list: ArrayStack, FastArrayStack, ArrayQueue, ArrayDeque, DualArrayDeque, RootishArrayStack, SLList, DLList,
SEList, SkiplistSSet, SkiplistList, ChainedHashTable, LinearHashTable, BinaryTree, BinarySearchTree, Treap, ScapegoatTree, RedBlackTree, BinaryHeap, MeldableHeap, AdjacencyMatrix, AdjacencyLists, BinaryTrie, XFastTrie, and YFastTrie
algorithms  books  data-structures  computer-science  coding  tries  skiplists  arrays  queues  heap  trees  graphs  hashtables 
16 days ago by jm
Probabilistic Data Structures for Web Analytics and Data Mining « Highly Scalable Blog
Stream summary, count-min sketches, loglog counting, linear counters. Some nifty algorithms for probabilistic estimation of element frequencies and data-set cardinality (via proggit)
via:proggit  algorithms  probability  probabilistic  count-min  stream-summary  loglog-counting  linear-counting  estimation  big-data 
26 days ago by jm
InfoQ: Lock-free Algorithms
Michael Barker and Martin Thompson's talk at the last QCon on the LMAX Disruptor, and other nifty lock-free techniques and patterns. 'Martin Thompson and Michael Barker explain how Intel x86_64 processors and their memory model work, along with low-level techniques that help creating lock-free software.'
lock-free  locking  mutexes  algorithms  lmax  disruptor  infoq  slides  presentations  qcon  java 
4 weeks ago by jm
HN on "What it takes to build great machine learning products"
TBH, I think this discussion thread is more useful than the article itself. It's still remarkably difficult to successfully apply ML techniques to real-world problems :(
machine-learning  hacker-news  discussion  commentary  ai  algorithms 
6 weeks ago by jm
Near Neighbor Search in High Dimensional Data [PDF]
Detect near-duplicates; would be good for future Razor-like efficient near-duplicate detection. (slides)
slides  algorithms  email  performance  programming  near-neighbour-search  search 
february 2012 by jm
MapReduce Patterns, Algorithms, and Use Cases
'I digested a number of MapReduce patterns and algorithms to give a systematic view of the different techniques that can be found in the web or scientific articles. Several practical case studies are also provided. All descriptions and code snippets use the standard Hadoop’s MapReduce model with Mappers, Reduces, Combiners, Partitioners, and sorting.'
algorithms  hadoop  java  mapreduce  patterns  distcomp 
february 2012 by jm
algorithm - Generating shuffled range using a PRNG rather than shuffling - Stack Overflow
some reasonably good answers on using an LFSR or LCG to generate a full-cycle permutation with no repeats
lfsr  lcg  algorithms  permutation  shuffling 
december 2011 by jm
Using a Feistel Network for full-cycle permutation
nice algorithm. requires that the permuted set's size be a power of 2 however - although for smaller sets you can just skip to the next output value, since they're not going to repeat
feistel-network  full-cycle  permutation  shuffling  algorithms 
december 2011 by jm
Golomb-coded sets
'a probabilistic data structure conceptually similar to a Bloom filter, but with a more compact in-memory representation, and a slower query time.' could come in handy
gcs  bloom-filters  probabilistic  data-structures  memory  algorithms 
september 2011 by jm
/~colmmacc/ » Prime and Proper
algorithm to perform set membership tests on enumerated sets quickly and memory-efficiently, using multiplication by primes. Nice trick
hacks  colmmacc  prime-numbers  set-membership  bloom-filters  bignums  algorithms  programming  from delicious
september 2010 by jm
Introduction to parallel & distributed algorithms
really interesting parallel algorithm concepts. I'd seen parallel merge sort before from the map-reduce world, but some others are new to me and worth thinking about (via Hacker News)
via:hackernews  algorithms  distributed  parallel  map-reduce  merge-sort  sorting  from delicious
august 2010 by jm
jwz - What different sorting algorithms sound like
in the style of BBC Radiophonics Workshop, with copious flange -- my favourite is heap sort. this is brilliant (via jwz)
via:jwz  sound  music  sorting  algorithms  from delicious
august 2010 by jm
Interpolation search
neat search algo, via Jeremy Zawodny; can be more efficient than binary search (O(log log n)), for indexed, ordered arrays, at the cost of more computation per iteration
algorithms  programming  search  via:jzawodny  from delicious
july 2010 by jm
Chatroulette Working On Genital Recognition Algorithm
just *male* genitalia, mind. I dread to think of what the training corpus looks like
chatroulette  algorithms  machine-learning  genitalia  nsfw  slashdot  from delicious
june 2010 by jm
Damn Cool Algorithms: Spatial indexing
quadtrees, Hilbert curves, and geohashing, as seen in Google's new Closure library. useful for multidimensional addressing in general
algorithms  mapping  gis  indexing  quadtree  datastructures  spatial  geometry  from delicious
november 2009 by jm
Hacking a Google Interview
course notes from a 4-day MIT course on tech interviewing (via Hacker News)
interviews  google  hiring  puzzles  mit  questions  coding  computer-science  algorithms 
august 2009 by jm
iPhone Sudoku Grab: How does it all work?
lovely run-through of the computer-vision algorithms this iPhone app uses (via Waxy)
via:waxy  ai  image  programming  algorithms  graphics  iphone  ocr  computervision  opencv  sudoku 
august 2009 by jm
A short history of btrfs [LWN.net]
wow, sounds good! looking forward to this hitting production-ready status
btrfs  history  zfs  linux  open-source  licensing  storage  sysadmin  b-trees  b+trees  algorithms  fs  filesystems 
august 2009 by jm

related tags

ai  algorithms  arrays  b+trees  b-trees  big-data  bignums  bloom-filters  books  btrfs  burrows-wheeler-transform  c  caching  cassandra  chatroulette  coding  colmmacc  commentary  compression  computer-science  computervision  count-min  cs  data  data-structures  database  datamining  datastructures  dirichlet  discussion  disruptor  distcomp  distributed  dna  email  estimation  false-positives  feistel-network  filesystems  fs  full-cycle  gcs  geek  genitalia  genome  geometry  gis  google  graphics  graphs  hacker-news  hacks  hadoop  hashing  hashtables  heap  hiring  history  image  indexing  infoq  interviews  iphone  java  lcg  lcs  lectures  lfsr  licensing  linear-counting  linux  lmax  lock-free  locking  loglog-counting  machine-learning  mailinator  map-reduce  mapping  mapreduce  memory  merge-sort  mit  music  mutexes  near-neighbour-search  nsfw  ocr  open-source  opencv  optimization  parallel  patterns  performance  permutation  presentations  prime-numbers  probabilistic  probability  programming  puzzles  qcon  quadtree  questions  queues  rank  ranking  recommendations  scoring  search  sequencing  set-membership  shuffling  skiplists  slashdot  slides  sort  sorting  sorting-networks  sound  spatial  stack-overflow  storage  stream-summary  string-matching  strings  sudoku  sysadmin  tips  trees  tries  via:hackernews  via:jwz  via:jzawodny  via:proggit  via:waxy  zfs 

Copy this bookmark:



description:


tags: