threedaymonk + search 27
CommonCrawl
11 weeks ago by threedaymonk
‘Common Crawl produces and maintains a repository of web crawl data that is openly accessible to everyone.’ Free access from EC2.
search
internet
crawl
11 weeks ago by threedaymonk
Google's decreasingly useful, spam-filled web search
january 2011 by threedaymonk
‘Searching Google is now like asking a question in a crowded flea market of hungry, desperate, sleazy salesmen who all claim to have the answer to every question you ask.’
google
search
spam
seo
january 2011 by threedaymonk
Content-Search Deals Make Twitter Profitable - BusinessWeek
december 2009 by threedaymonk
So that’s how they’re going to make money.
twitter
internet
business
search
google
microsoft
december 2009 by threedaymonk
Compiling Sphinx on OSX Leopard
may 2009 by threedaymonk
If it’s a 64-bit machine, you might need this.
sphinx
search
osx
may 2009 by threedaymonk
gigapedia.com
april 2009 by threedaymonk
Apparently, this is a large repository of/search engine for bookwarez.
bookwarez
ebooks
reference
search
april 2009 by threedaymonk
Give Google feedback on “noresults” pages
january 2009 by threedaymonk
‘The #1 complaint (20+ comments) was “empty review” sites.’
google
search
reviews
seo
january 2009 by threedaymonk
The Xapian Project
july 2008 by threedaymonk
‘Xapian is an Open Source Search Engine Library, released under the GPL.’
search
c++
library
text
open-source
july 2008 by threedaymonk
NZBs(dot)ORG
july 2008 by threedaymonk
Simple, though not comprehensive, index of Usenet in the form of nzbs.
usenet
search
nzb
nntp
july 2008 by threedaymonk
Ruby/Bsearch: a binary search library for Ruby
march 2008 by threedaymonk
Potentially useful.
algorithm
ruby
search
library
march 2008 by threedaymonk
thrudb
january 2008 by threedaymonk
‘Thrudb is a set of simple services built on top of Facebook’s Thrift framework that provides indexing and document storage services for building and scaling websites.’
thrift
database
queue
search
reevoo-developer
january 2008 by threedaymonk
Grep in Project command for TextMate
august 2007 by threedaymonk
An excellent replacement for the unsatisfactory built-in search.
textmate
plugin
search
osx
august 2007 by threedaymonk
My life without Google
may 2007 by threedaymonk
Something I’ve been thinking about myself.
google
privacy
search
may 2007 by threedaymonk
Hyper Estraier: a full-text search system for communities
march 2007 by threedaymonk
Uses both n-grams and morphological analysis.
search
library
open-source
march 2007 by threedaymonk
acts_as_solr Rails plugin
february 2007 by threedaymonk
‘[A]dds full text search capabilities and many other nifty features from Apache’s Solr to any Rails model.’
ruby
rails
solr
search
plugin
webdev
lucene
february 2007 by threedaymonk
Google, Yahoo and Microsoft Agree to Standard Sitemaps Protocol
november 2006 by threedaymonk
Or, rather, Yahoo and Microsoft agree to standardise on Google’s format.
webdev
search
sitemaps
seo
reevoo
november 2006 by threedaymonk
Introducing acts_as_searchable
august 2006 by threedaymonk
Full-text search for Rails applications, using HyperEstraier (which provides morphological analysis and n-gram search).
ruby
rails
database
search
webdev
august 2006 by threedaymonk
related tags
ack ⊕ algorithm ⊕ algorithms ⊕ bookwarez ⊕ business ⊕ c++ ⊕ compression ⊕ compsci ⊕ crawl ⊕ cucumber ⊕ database ⊕ ebooks ⊕ google ⊕ grep ⊕ internet ⊕ java ⊕ json ⊕ library ⊕ lucene ⊕ microsoft ⊕ nntp ⊕ nzb ⊕ offline ⊕ open-source ⊕ osx ⊕ plugin ⊕ privacy ⊕ programming ⊕ queue ⊕ r ⊕ rails ⊕ reevoo ⊕ reevoo-developer ⊕ reference ⊕ reviews ⊕ ruby ⊕ search ⊖ security ⊕ seo ⊕ shell ⊕ sitemaps ⊕ solr ⊕ spam ⊕ sphinx ⊕ statistics ⊕ tdd ⊕ testing ⊕ text ⊕ textmate ⊕ thinkingsphinx ⊕ thrift ⊕ twitter ⊕ usenet ⊕ vim ⊕ webdev ⊕ wikipedia ⊕Copy this bookmark: