Changing Bits: Lucene's FuzzyQuery is 100 times faster in 4.0
april 2011 by mcroydon
"There are many exciting improvements in Lucene's eventual 4.0 (trunk) release, but the awesome speedup to FuzzyQuery really stands out, not only from its incredible gains but also because of the amazing behind-the-scenes story of how it all came to be."
lucene
solr
apache
java
search
april 2011 by mcroydon
elasticsearch - blog - 0.15.0 Released
march 2011 by mcroydon
Rolling out some interesting new features.
search
information-retrieval
march 2011 by mcroydon
Yelp Engineering Blog: mrjob: Distributed Computing for Everybody
november 2010 by mcroydon
A really nice wrapper around EMR.
algorithms
amazon
aws
cloudcomputing
computing
data
hadoop
framework
distributed
dist
development
datamining
library
map-reduce
map
mapreduce
nosql
opensource
yelp
webservices
search
reduce
python
programming
aa
elasticmapreduce
logs
emr
mrjob
november 2010 by mcroydon
SpotCloud - Cloud Capacity Clearing House / Spot Market: Home
november 2010 by mcroydon
An excellent example of the "have hay? need hay?" design pattern.
aggregator
cloud
cloud-computing
computing
market
gateway
search
platform
marketplace
iaas
november 2010 by mcroydon
wiki.dbpedia.org : About
october 2010 by mcroydon
Quite a lot of things in the linked open data universe point to dbpedia.
analysis
data
api
database
datamining
datasets
db
free
opensource
ontology
mashup
knowledge
information
programming
rdf
reference
search
semantic
semantic-web
web2.0
web
tools
sparql
semanticweb
semantic_web
wiki
wikipedia
xml
dbpedia
october 2010 by mcroydon
Official Google Blog: Google Instant, behind the scenes
september 2010 by mcroydon
A high-level look at Google Instant.
blog
design
development
engine
experience
search
google
history
optimization
future
gfs2
explanation
searchengine
seo
technology
to-read
web
instant
google
autosuggest
september 2010 by mcroydon
Google search index splits with MapReduce • The Register
september 2010 by mcroydon
Teach the world to Zig then Zag.
algorithms
architecture
bigdata
article
bigtable
gfs
distributed
database
computing
computers
caffeine
google
grid
hadoop
index
indexing
mapreduce
technology
search
scalability
research
programming
colossus
gfs2
news
september 2010 by mcroydon
WhoSampled - Discover and Discuss Music Samples, Remixes and Cover Songs
september 2010 by mcroydon
For some day in the future when I have free time.
archive
copyright
audio
cover
database
fun
dj
hip-hop
remix
reference
music
mp3
history
internet
resource
hiphop
sample
samples
search
songs
video
web2.0
youtube
september 2010 by mcroydon
Living Stories
july 2010 by mcroydon
Really quite a nice, useful, and "sticky" interface.
aggregation
aggregator
google
article
innovation
newspaper
media
news
journalism
labs
internet
interesting
newspapers
nytimes
nyt
onlinejournalism
search
publishing
washingtonpost
timeline
storytelling
stories
startup
july 2010 by mcroydon
Lucid Imagination » [UPDATE] Spatial Search in Apache Lucene and Solr
july 2010 by mcroydon
A look at the state of solr geosearch.
apache
geo
geospatial
lucene
search
solr
spatial
july 2010 by mcroydon
A fast, fuzzy, full-text index using Redis | PlayNice.ly
may 2010 by mcroydon
I really love Redis' primitive types and operations on them.
lucene
rails
redis
search
semanticweb
python
may 2010 by mcroydon
Lucid Imagination » State of Spatial Support in Apache Solr
march 2010 by mcroydon
Native spatial search in Solr is getting *really close*.
geo
lucene
search
solr
spatial
march 2010 by mcroydon
CKAN - Comprehensive Knowledge Archive Network - Home
march 2010 by mcroydon
Powers search at data.co.uk.
archive
community
database
dataset
data
datasets
free
open_source
open
online
knowledge
library
information
opencontent
opendata
opensource
reference
repository
software
semanticweb
search
resources
resource
research
statistics
tools
web
ckan
directory
openknowledge
march 2010 by mcroydon
Deduplication - Solr Wiki
march 2010 by mcroydon
Allowing solr to spot dupes in exact match or near match scenarios.
hash
information_retrieval
lucene
java
programming
search
solr
deduplication
march 2010 by mcroydon
ElasticSearch - Open Source, Distributed, RESTful Search Engine
february 2010 by mcroydon
A document store that happens to also do full-text search. Quite the powerful combination, methinks.
software
open-source
webdev
opensource
java
free
search
distributed
http
rest
open
cloud
source
tool
json
lucene
solr
cloudcomputing
engine
nosql
elastic
query
ha
fulltext
restful
searchengine
service
indexing
elasticsearch
search-engine
february 2010 by mcroydon
Meet The First Miners of the New Social Graph
february 2010 by mcroydon
Have you heard the one about soylent green?
data
tools
research
search
article
networking
analysis
social
twitter
list
graph
mining
metrics
socialmedia
networks
press
contacts
friends
socialgraph
hivemind
february 2010 by mcroydon
Lucid Imagination » The Seven Deadly Sins of Solr
january 2010 by mcroydon
An amusing take on the many things that you might be doing wrong.
search
performance
apache
tips
lucene
solr
ir
january 2010 by mcroydon
toastdriven's queued_search at master - GitHub
january 2010 by mcroydon
Daniel knocks out a reusable app that makes it easy to have Haystack queue updates and deletes so that you get the benefits of near-realtime updates (depending on how you cron up the consumer) without the delays on object save. If you're using Haystack in production, you probably want to be using this.
haystack
search
queues
python
django
solr
january 2010 by mcroydon
Seeking Profit in Open-Source Search Software - Bits Blog - NYTimes.com
january 2010 by mcroydon
This Lucid Imagination fanboi is happy to see them doing so well.
business
opensource
search
analysis
todo
lucene
startups
ip
archive
businessmodels
trend
january 2010 by mcroydon
The On-Line Encyclopedia of Integer Sequences
december 2009 by mcroydon
What it says on the tin really.
programming
reference
tools
database
research
science
algorithms
search
cool
algorithm
interesting
math
education
geek
tool
resource
academic
resources
encyclopedia
sequence
puzzle
maths
mathematics
sequences
integer
numbers
number
integers
series
combinatorics
december 2009 by mcroydon
Faceted Search Nycto Talk
december 2009 by mcroydon
CTOs realize how awesome faceted search is.
search
presentation
solr
information-retrieval
faceted
december 2009 by mcroydon
sbff - Project Hosting on Google Code
december 2009 by mcroydon
Takes the brute force approach but it'll be interesting to see what this comes up with.
web
software
database
opensource
api
java
search
computing
social
datamining
del.icio.us
recommendations
socialmedia
delicious
script
filtering
collaborative
users
bookmarks
bookmarking
experiments
friend
similar
december 2009 by mcroydon
Python Package Index : django-haystack 1.0.0-rc1
november 2009 by mcroydon
Haystack 1.0.0-rc1 on the streets, baby!
haystack
django
search
november 2009 by mcroydon
Commit 80681906d241b6fbfc75fa8022c2bb13f32d5a21 to toastdriven's django-haystack - GitHub
november 2009 by mcroydon
Daniel pushes Haystack 1.0.0rc1 out the door and the crowd goes wild!
django
haystack
search
november 2009 by mcroydon
Spatial Solr / JTeam: Enterprise Java, Open Source, software solutions
november 2009 by mcroydon
Spatial solr bits as an add-on.
solr
search
gis
november 2009 by mcroydon
zoie - Project Hosting on Google Code
november 2009 by mcroydon
Built on top of Apache Lucene and focuses on solving several problems around real-time search and indexing performance.
programming
development
google
library
opensource
free
java
search
architecture
tech
apache
db
lucene
solr
project
indexing
realtime
index
fulltext
mq
linkedin
zoie
november 2009 by mcroydon
Hadoop, Pig, and Twitter (NoSQL East 2009)
november 2009 by mcroydon
An awesome deck showing of Pig and how Twitter uses it.
programming
data
database
howto
server
dev
search
statistics
hadoop
presentation
mapreduce
datamining
twitter
analytics
slides
presentations
nosql
bigdata
socialnetworks
slideshare
hdfs
pig
big
functional
data_mining
november 2009 by mcroydon
A Machine Learning Approach for Improved BM25 Retrieval - Microsoft Research
november 2009 by mcroydon
I love that BM25 is alive and kicking even after all these years.
ranking
paper
search
ir
information-retrieval
november 2009 by mcroydon
Internet Archive: Free Downloads: Text Archive
october 2009 by mcroydon
Recently made available on XO laptops but this is a pretty impressive archive on its own.
web
reference
library
history
opensource
audiobooks
free
book
books
research
search
internet
online
education
audio
text
ebook
resources
download
reading
e-books
literature
ebooks
libraries
archive
digital
archives
texts
openaccess
etexts
october 2009 by mcroydon
String Similarity Metrics for Information Integration
october 2009 by mcroydon
A fine collection of string similarity algorithms.
programming
library
tools
reference
opensource
java
research
search
algorithm
algorithms
statistics
math
computing
analysis
text
comparison
datamining
nlp
similarity
theory
bioinformatics
metrics
mathematics
ir
string
matching
distance
strings
metric
sequence
october 2009 by mcroydon
Training to Climb an Everest of Digital Data
october 2009 by mcroydon
Big data is big and almost always requires a completely different mindset than the one that is taught in computer science programs.
data
database
processing
google
news
toread
ibm
energy
datasets
mining
search
research
science
internet
algorithms
storage
scaling
education
hadoop
analysis
computer-science
datacuration
october 2009 by mcroydon
Syllabus: Information Organization and Retrieval
september 2009 by mcroydon
IO and IR together at last.
search
academic
information-retrieval
ir
september 2009 by mcroydon
"pants on fire" | Search / LJWorld.com
august 2009 by mcroydon
My favorite new Ellington Haystack search example.
ellington
django
search
example
daniellindsleyrocksdahouse
august 2009 by mcroydon
[#SOLR-773] Incorporate Local Lucene/Solr - ASF JIRA
august 2009 by mcroydon
It looks like LocalSolr is tentatively slated to land in Solr 1.5.
gis
search
geo
map
lucene
geography
solr
august 2009 by mcroydon
Python Package Index : topia.termextract 1.1.0
august 2009 by mcroydon
This looks like a solid term tagger similar to tagging functionality in NLTK.
python
programming
software
development
api
tools
library
search
opensource
language
analysis
text
nlp
parsing
linguistics
parser
tagging
english
extraction
taxonomy
grammar
termextraction
term
termextract
libs
pos
topia
license:zpl
august 2009 by mcroydon
New Flickr Search « Flickr Blog
august 2009 by mcroydon
Excellent search interface enhancements.
tools
search
photography
photos
flickr
images
presentations
august 2009 by mcroydon
SIGIR: Meet the Who’s Who of Search and Information Retrieval | The Noisy Channel
july 2009 by mcroydon
Sounds like quite the party.
search
people
july 2009 by mcroydon
A Comparison of Open Source Search Engines
july 2009 by mcroydon
A look at Lucene, Zettair, Sphinx, SQLite, and Xaipan and some benchmarks.
database
opensource
search
comparison
lucene
engine
oss
benchmark
searchengine
searchengines
july 2009 by mcroydon
NUCULAR fielded text searchable indexing: Documentation
june 2009 by mcroydon
Another lightweight Python full-text search engine with a silly if not unfortunate name.
python
software
django
development
database
tools
library
api
xml
opensource
search
text
application
lucene
db
concurrency
indexing
solr
oss
ir
searchengine
fulltext
whoosh
nucular
june 2009 by mcroydon
Code: Flickr Developer Blog » Building Fast Client-side Searches
june 2009 by mcroydon
Fast == good.
programming
web
development
code
data
javascript
webdev
api
dev
search
xml
ajax
scalability
performance
cache
flickr
optimization
json
caching
parsing
regex
js
fast
yui
bestpractices
speed
autocomplete
clientside
optimisation
eval
june 2009 by mcroydon
mcroydon's django-tumbleweed at master - GitHub
june 2009 by mcroydon
Tumbleweed is essentially a framework for writing your own tumblelog using data denormalized in Haystack. It leans heavily on Haystack and the underlying search backend and is currently only recommended to be used with Solr.
django
tumblelog
tumble
haystack
solr
search
june 2009 by mcroydon
ghetto-fts - Google Code
june 2009 by mcroydon
<cartman>In the ghetto...</cartman>
python
search
text
filesystem
indexing
index
fulltext
june 2009 by mcroydon
katta - distributed lucene
june 2009 by mcroydon
"Katta serves large, replicated, Lucene indexes as shards to serve high loads and very large data sets."
software
development
java
search
scalability
performance
scaling
distributed
apache
hadoop
clustering
cloud
grid
lucene
tool
indexing
project
ir
searchengine
information-retrieval
dist
package
hdfs
katta
june 2009 by mcroydon
Cloudera's Distribution for Hadoop | Cloudera
june 2009 by mcroydon
Includes lots of feature tickets that are pretty stable but not yet in a Hadoop release. It reminds me a lot of Debian unstable or Ubuntu a month or so before release. Good stuff indeed. Includes RPM and APT package management options.
software
data
linux
google
search
aws
distributed
computing
ec2
hadoop
cloud
mapreduce
cloudcomputing
distribution
clusters
packaging
cloudera
cloud-computing
rpm
apt
june 2009 by mcroydon
Full-text search across multiple Django models using Djapian/Xapian | Nomad Journey
may 2009 by mcroydon
On full-text search using Xapian and Djapian.
django
search
may 2009 by mcroydon
Buy women's shoes boots handbags | Shop visually for women men's shoes clothing | Like.com
may 2009 by mcroydon
Very nice interface with hardcore compsci underpinnings.
design
web
tools
business
visualization
technology
search
ajax
photo
image
interface
photos
comparison
shopping
engine
inspiration
aggregator
shop
clothing
images
searchengine
ecommerce
shoes
clothes
fashion
like
visual
e-commerce
may 2009 by mcroydon
FastBit: An Efficient Compressed Bitmap Index Technology
may 2009 by mcroydon
What it says on the tin.
programming
software
database
data
research
library
opensource
search
datamining
grid
db
compression
indexing
index
hashing
bit
algo
data-mining
compress
fastbit
olap
bitmap
multidimensional
columndb
may 2009 by mcroydon
See Wolfram Alpha in Action: Our Screenshots - ReadWriteWeb
may 2009 by mcroydon
Pretty impressive stuff. Can't wait to poke holes in it.
reference
tools
google
technology
search
internet
interesting
web2.0
awesome
inspiration
searchengine
semanticweb
preview
wolfram
wolframalpha
web3.0
alpha
may 2009 by mcroydon
Apache Mahout - Taste Documentation
april 2009 by mcroydon
Collaborative filtering as part of the Mahout project. Also includes a web services interface for interfacing with non-Java stuff.
software
java
search
algorithm
cluster
apache
mapreduce
machinelearning
engine
filtering
recommendation
webservice
recommendations
mahout
april 2009 by mcroydon
Trie - Wikipedia, the free encyclopedia
april 2009 by mcroydon
There's CS in my tabs that has to go.
development
programming
reference
data
database
wikipedia
science
computer
wiki
search
ajax
algorithms
algorithm
cs
hash
binary
data-structures
tree
academics
datastructure
structure
datastructures
theory
trie
data-structure
trees
structures
data_structure
april 2009 by mcroydon
Slides from "MySQL and Search at Craigslist" (by Jeremy Zawodny)
april 2009 by mcroydon
A nice peek at what makes Craigslist tick.
video
search
scalability
presentation
mysql
optimization
slides
slide
craigslist
sphinx
april 2009 by mcroydon
Cloudera Hadoop & Big Data Blog » Blog Archive » Using Hadoop to Annotate Billions of Web Documents with Semantics
april 2009 by mcroydon
Not sure how much meat there is in this article, but what they're doing sounds awesome.
search
scaling
hadoop
mapreduce
examples
semantic
april 2009 by mcroydon
pHash.org: Home of pHash, the open source perceptual hash library
april 2009 by mcroydon
To play with later.
license:gpl
license:
gplv3
programming
development
open-source
code
video
media
library
api
graphics
search
opensource
algorithms
algorithm
audio
ai
multimedia
image
c++
c
hash
sound
hashing
copyright
similarity
images
fingerprint
imaging
perceptual
phash
perceptualhash
april 2009 by mcroydon
Toast Driven - Announcing Haystack - Modular search for Django
april 2009 by mcroydon
Daniel announces haystack. Yay Daniel!
django
search
license:mit
haystack
april 2009 by mcroydon
Add Full-Text Search to your Django project with Whoosh
march 2009 by mcroydon
Pretty straightforward.
python
programming
howto
django
search
blog
tips
fulltext
whoosh
march 2009 by mcroydon
Scaling Lucene and Solr | Lucid Imagination
march 2009 by mcroydon
Wow, I thought I had seen all of the goodies on the lucid imagination site. Nice find, Joseph.
programming
development
linux
java
search
sysadmin
scalability
scaling
article
performance
optimization
lucene
solr
oss
ir
information-retrieval
tomcat
tuning
march 2009 by mcroydon
KMWorld.com: Designing for faceted search
march 2009 by mcroydon
Some thoughts when designing for faceted search.
design
search
march 2009 by mcroydon
Pragmatic Programming Techniques: Solving TF-IDF using Map-Reduce
february 2009 by mcroydon
This is pretty simple and would apply to CouchDB pretty easily.
programming
search
algorithm
mapreduce
ir
february 2009 by mcroydon
duetopia - Google Code
february 2009 by mcroydon
Interesting, though I don't know exactly what it all means.
geo
gis
search
metadata
geodjango
february 2009 by mcroydon
Building and Scaling a Startup on Rails: 12 Things We Learned the Hard Way - Axon Flux - A Ruby on Rails Blog
february 2009 by mcroydon
This has fun illustrations, therefore I shall read it later.
programming
web
development
database
howto
webdev
ruby
amazon
rails
search
scalability
coding
storage
ruby-on-rails
scaling
s3
startup
performance
deployment
article
optimization
tips
cloud
advice
startups
rubyonrails
memcache
scale
ror
february 2009 by mcroydon
Xapian performance comparision with Whoosh « Searching with Xapian
february 2009 by mcroydon
Whoosh still rocks my world.
python
search
information-retrieval
february 2009 by mcroydon
related tags
3d ⊕ aa ⊕ academic ⊕ academics ⊕ acquisition ⊕ advice ⊕ aggregation ⊕ aggregator ⊕ ai ⊕ airlines ⊕ airplane ⊕ airport ⊕ ajax ⊕ algo ⊕ algorithims ⊕ algorithm ⊕ algorithms ⊕ alpha ⊕ amazon ⊕ america ⊕ analysis ⊕ analytics ⊕ analyzers ⊕ aol ⊕ apache ⊕ api ⊕ apis ⊕ app ⊕ appengine ⊕ application ⊕ apt ⊕ architecture ⊕ archive ⊕ archives ⊕ article ⊕ articles ⊕ audio ⊕ audiobooks ⊕ autocomplete ⊕ autosuggest ⊕ awesome ⊕ aws ⊕ bar ⊕ barcode ⊕ barcodes ⊕ basics ⊕ benchmark ⊕ bestpractices ⊕ big ⊕ bigdata ⊕ bigtable ⊕ binary ⊕ bioinformatics ⊕ bit ⊕ bitmap ⊕ blog ⊕ book ⊕ bookmarking ⊕ bookmarks ⊕ books ⊕ boss ⊕ browser ⊕ business ⊕ businessmodels ⊕ c ⊕ c++ ⊕ cache ⊕ caching ⊕ caffeine ⊕ census ⊕ cities ⊕ ckan ⊕ classification ⊕ clientside ⊕ clothes ⊕ clothing ⊕ cloud ⊕ cloud-computing ⊕ cloudcomputing ⊕ cloudera ⊕ cluster ⊕ clustering ⊕ clusters ⊕ cms ⊕ code ⊕ codes ⊕ coding ⊕ collaboration ⊕ collaborative ⊕ collective ⊕ colocation ⊕ colossus ⊕ columndb ⊕ combinatorics ⊕ community ⊕ comparison ⊕ compress ⊕ compression ⊕ compsci ⊕ computer ⊕ computer-science ⊕ computers ⊕ computerscience ⊕ computervision ⊕ computing ⊕ concurrency ⊕ conference ⊕ conferences ⊕ contacts ⊕ content ⊕ conversion ⊕ cool ⊕ cooliris ⊕ copyright ⊕ corpus ⊕ couchdb ⊕ cover ⊕ craigslist ⊕ cs ⊕ cuecat ⊕ daniellindsleyrocksdahouse ⊕ data ⊕ data-mining ⊕ data-structure ⊕ data-structures ⊕ database ⊕ databases ⊕ datacuration ⊕ datamining ⊕ dataset ⊕ datasets ⊕ datastructure ⊕ datastructures ⊕ data_mining ⊕ data_structure ⊕ db ⊕ dbpedia ⊕ dbs ⊕ deduplication ⊕ dekstop ⊕ del.icio.us ⊕ delicious ⊕ demographics ⊕ deployment ⊕ design ⊕ designpatterns ⊕ dev ⊕ development ⊕ digital ⊕ directory ⊕ dist ⊕ distance ⊕ distributed ⊕ distribution ⊕ dj ⊕ django ⊕ django-apps ⊕ doc ⊕ document ⊕ download ⊕ e-books ⊕ e-commerce ⊕ ean ⊕ ebook ⊕ ebooks ⊕ ec2 ⊕ ecommerce ⊕ education ⊕ elastic ⊕ elasticmapreduce ⊕ elasticsearch ⊕ ellington ⊕ emacs ⊕ emr ⊕ encyclopedia ⊕ energy ⊕ engine ⊕ english ⊕ entity-extraction ⊕ erlang ⊕ etexts ⊕ eval ⊕ example ⊕ examples ⊕ experience ⊕ experiments ⊕ explanation ⊕ extension ⊕ extensions ⊕ extraction ⊕ faceted ⊕ facetedsearch ⊕ facets ⊕ family ⊕ fashion ⊕ fast ⊕ fastbit ⊕ fcc ⊕ feedster ⊕ ferrett ⊕ filesystem ⊕ filesystems ⊕ filtering ⊕ find ⊕ fingerprint ⊕ firefox ⊕ flickr ⊕ flight ⊕ flights ⊕ flv ⊕ food ⊕ foss ⊕ framework ⊕ free ⊕ freebase ⊕ friend ⊕ friends ⊕ fte ⊕ fulltext ⊕ fun ⊕ function ⊕ functional ⊕ future ⊕ gadgets ⊕ gae ⊕ gallery ⊕ game ⊕ garden ⊕ gate ⊕ gateway ⊕ geek ⊕ genealogy ⊕ geo ⊕ geocoding ⊕ geodjango ⊕ geography ⊕ geolocation ⊕ geometry ⊕ georss ⊕ geospatial ⊕ geowanking ⊕ gfs ⊕ gfs2 ⊕ gis ⊕ gnome ⊕ google ⊕ gov ⊕ government ⊕ gplv3 ⊕ grammar ⊕ graph ⊕ graphics ⊕ grid ⊕ groonga ⊕ guide ⊕ guidelines ⊕ ha ⊕ hacking ⊕ hadoop ⊕ hash ⊕ hashing ⊕ haystack ⊕ hdfs ⊕ headlines ⊕ help ⊕ hip-hop ⊕ hiphop ⊕ history ⊕ hivemind ⊕ hosting ⊕ house ⊕ howto ⊕ html ⊕ http ⊕ hyperlocal ⊕ ia ⊕ iaas ⊕ ibm ⊕ ils ⊕ image ⊕ images ⊕ imaging ⊕ import ⊕ imported ⊕ index ⊕ indexing ⊕ inex ⊕ information ⊕ information-extraction ⊕ information-retrieval ⊕ informationretrieval ⊕ information_extraction ⊕ information_retrieval ⊕ innovation ⊕ inspiration ⊕ instant ⊕ integer ⊕ integers ⊕ intelligence ⊕ interesting ⊕ interface ⊕ internet ⊕ ip ⊕ ir ⊕ jacob ⊕ java ⊕ javascript ⊕ jellyroll ⊕ journalism ⊕ js ⊕ json ⊕ jsonp ⊕ katta ⊕ knowledge ⊕ labs ⊕ language ⊕ language:c ⊕ LBS ⊕ learning ⊕ libraries ⊕ library ⊕ libs ⊕ license: ⊕ license:gpl ⊕ license:lgpl ⊕ license:mit ⊕ license:zpl ⊕ like ⊕ lingpipe ⊕ linguistics ⊕ linkeddata ⊕ linkedin ⊕ linux ⊕ list ⊕ literature ⊕ local ⊕ location ⊕ logs ⊕ lsa ⊕ lucene ⊕ machine ⊕ machine-learning ⊕ machinelearning ⊕ mahout ⊕ manning ⊕ map ⊕ map-reduce ⊕ mapping ⊕ mapreduce ⊕ maps ⊕ market ⊕ marketplace ⊕ markup ⊕ mashup ⊕ matching ⊕ math ⊕ mathematics ⊕ maths ⊕ media ⊕ memcache ⊕ metadata ⊕ metaweb ⊕ metric ⊕ metrics ⊕ microsoft ⊕ mining ⊕ mobile ⊕ model ⊕ mono ⊕ morelikethis ⊕ motion ⊕ mowser ⊕ mp3 ⊕ mq ⊕ mrjob ⊕ multidimensional ⊕ multimedia ⊕ music ⊕ mysql ⊕ natural ⊕ natural-language ⊕ navigation ⊕ netflix ⊕ network ⊕ networking ⊕ networks ⊕ news ⊕ newsmedia ⊕ newspaper ⊕ newspapers ⊕ newyorktimes ⊕ nlp ⊕ nosql ⊕ nucular ⊕ number ⊕ numbers ⊕ nyt ⊕ nytimes ⊕ olap ⊕ online ⊕ onlinejournalism ⊕ ontology ⊕ open ⊕ open-source ⊕ openaccess ⊕ opencontent ⊕ opendata ⊕ openknowledge ⊕ opensource ⊕ open_source ⊕ optimisation ⊕ optimization ⊕ optimize ⊕ oss ⊕ p2p ⊕ package ⊕ packaging ⊕ pagerank ⊕ palm ⊕ paper ⊕ papers ⊕ parser ⊕ parsing ⊕ pathfinding ⊕ pda ⊕ pdf ⊕ people ⊕ perceptual ⊕ perceptualhash ⊕ performance ⊕ phash ⊕ phone ⊕ phones ⊕ photo ⊕ photography ⊕ photos ⊕ physics ⊕ pig ⊕ planes ⊕ planning ⊕ platform ⊕ plugin ⊕ plugins ⊕ population ⊕ porn ⊕ portal ⊕ pos ⊕ postgresql ⊕ prediction ⊕ presentation ⊕ presentations ⊕ press ⊕ preview ⊕ processing ⊕ product ⊕ profiling ⊕ programming ⊕ project ⊕ projects ⊕ publishing ⊕ puzzle ⊕ python ⊕ quadtree ⊕ query ⊕ queues ⊕ rails ⊕ ranking ⊕ rdbms ⊕ rdf ⊕ read-later ⊕ reading ⊕ realtime ⊕ recommendation ⊕ recommendations ⊕ redis ⊕ reduce ⊕ reference ⊕ regex ⊕ related ⊕ remix ⊕ replace ⊕ repository ⊕ research ⊕ resource ⊕ resources ⊕ rest ⊕ restful ⊕ reviews ⊕ robotics ⊕ robots ⊕ roogle ⊕ ror ⊕ rpm ⊕ rss ⊕ ruby ⊕ ruby-on-rails ⊕ rubyonrails ⊕ s3 ⊕ safari ⊕ sample ⊕ samples ⊕ scala ⊕ scalability ⊕ scale ⊕ scaling ⊕ science ⊕ scripps ⊕ script ⊕ search ⊖ search-engine ⊕ searchengine ⊕ searchengines ⊕ searching ⊕ semantic ⊕ semantic-web ⊕ semanticweb ⊕ semantic_web ⊕ seo ⊕ sequence ⊕ sequences ⊕ series ⊕ series-60 ⊕ server ⊕ service ⊕ shoes ⊕ shop ⊕ shopping ⊕ similar ⊕ similarity ⊕ simpledb ⊕ slide ⊕ slides ⊕ slideshare ⊕ small ⊕ sms ⊕ SOAP ⊕ social ⊕ socialgraph ⊕ socialmedia ⊕ socialnetworks ⊕ socialsoftware ⊕ software ⊕ solr ⊕ songs ⊕ sound ⊕ source ⊕ sparql ⊕ spatial ⊕ speed ⊕ spellcheck ⊕ spelling ⊕ sphinx ⊕ sql ⊕ standards ⊕ startup ⊕ startups ⊕ statistics ⊕ stats ⊕ storage ⊕ stories ⊕ storytelling ⊕ string ⊕ strings ⊕ structure ⊕ structures ⊕ svd ⊕ symbian ⊕ sysadmin ⊕ tagging ⊕ taxonomy ⊕ tech ⊕ technology ⊕ telephone ⊕ tentacle ⊕ tentacle-porn ⊕ term ⊕ termextract ⊕ termextraction ⊕ text ⊕ text-mining ⊕ textmining ⊕ texts ⊕ theory ⊕ timeline ⊕ times ⊕ tips ⊕ to-read ⊕ todo ⊕ tokenizer ⊕ tokyo ⊕ tokyocabinet ⊕ tomcat ⊕ tool ⊕ tools ⊕ topia ⊕ toread ⊕ travel ⊕ tree ⊕ trees ⊕ trend ⊕ trie ⊕ tumble ⊕ tumblelog ⊕ tuning ⊕ tutorial ⊕ tutorials ⊕ twitter ⊕ ucc ⊕ ui ⊕ upc ⊕ usability ⊕ users ⊕ utilities ⊕ ux ⊕ via:chl ⊕ via:jacobian ⊕ video ⊕ virtualization ⊕ visual ⊕ visualization ⊕ vps ⊕ wap ⊕ washingtonpost ⊕ web ⊕ web-services ⊕ web2.0 ⊕ web3.0 ⊕ webdesign ⊕ webdev ⊕ weblogs ⊕ webservice ⊕ webservices ⊕ whoosh ⊕ wiki ⊕ wikipedia ⊕ wolfram ⊕ wolframalpha ⊕ work ⊕ xaipan ⊕ xhtml ⊕ xml ⊕ yahoo ⊕ yelp ⊕ youtube ⊕ yui ⊕ zoie ⊕ 全文検索 ⊕Copy this bookmark: