howthebodyworks + search 205
RapidLibrary - Your Media Search Engine
6 weeks ago by howthebodyworks
Rule 44 is that if you can think of it, you can download it from rapidshare. This search engine helps you do that.
avcontent
ebook
search
from delicious
6 weeks ago by howthebodyworks
Terrier IR Platform v3.5 - Homepage
december 2011 by howthebodyworks
Terrier is a highly flexible, efficient, and effective open source search engine, readily deployable on large-scale collections of documents. Terrier implements state-of-the-art indexing and retrieval functionalities, and provides an ideal platform for the rapid development and evaluation of large-scale retrieval applications.
Terrier is open source, and is a comprehensive, flexible and transparent platform for research and experimentation in text retrieval. Research can easily be carried out on standard TREC and CLEF test collections.
nlp
java
search
from delicious
Terrier is open source, and is a comprehensive, flexible and transparent platform for research and experimentation in text retrieval. Research can easily be carried out on standard TREC and CLEF test collections.
december 2011 by howthebodyworks
Apache Lucy
november 2011 by howthebodyworks
python-happy fultext search lucene-style, without java. "Apache Lucy is full-text search engine library written in C and targeted at dynamic languages. It is a "loose C" port of Apache Lucene™, a search engine library for Java."
lucene
opensource
search
from delicious
november 2011 by howthebodyworks
SimString - A fast and simple algorithm for approximate string matching/retrieval
june 2011 by howthebodyworks
SimString has the following features:
Fast algorithm for approximate string retrieval. For example, SimString can find strings in Google Web1T unigrams (13,588,391 strings) that have cosine similarity ≧0.7 in 1.10 [ms] per query (on Intel Xeon 5140 2.33 GHz CPU).
100% exact retrieval. Although some algorithms allow misses (false positives) for faster query response, SimString is guaranteed to achieve 100% correct retrieval with fast query response.
Unicode (wchar_t) support. For languages using multi-byte characters, developers can use Unicode characters (wchar_t) instead of single-byte characters (char) as a character representation.
Implementation in C++ header files. Developers can add the funtionality of approximate string retrieval into C++ programs just by including a header file.
Python and Ruby bindings via SWIG. Developers can easily perform approximate string retrieval in scripting languages.
nlp
search
python
ruby
c++
performance
Fast algorithm for approximate string retrieval. For example, SimString can find strings in Google Web1T unigrams (13,588,391 strings) that have cosine similarity ≧0.7 in 1.10 [ms] per query (on Intel Xeon 5140 2.33 GHz CPU).
100% exact retrieval. Although some algorithms allow misses (false positives) for faster query response, SimString is guaranteed to achieve 100% correct retrieval with fast query response.
Unicode (wchar_t) support. For languages using multi-byte characters, developers can use Unicode characters (wchar_t) instead of single-byte characters (char) as a character representation.
Implementation in C++ header files. Developers can add the funtionality of approximate string retrieval into C++ programs just by including a header file.
Python and Ruby bindings via SWIG. Developers can easily perform approximate string retrieval in scripting languages.
june 2011 by howthebodyworks
Nullege: A Search Engine for Python source code
may 2011 by howthebodyworks
search for python code, kaggle style.
python
search
may 2011 by howthebodyworks
Sally - A Tool for Embedding Strings in Vector Spaces
may 2011 by howthebodyworks
string embedding, fast. ouputs to libsvm and matlab etc formats
nlp
search
c
may 2011 by howthebodyworks
Glassdoor.com – an inside look at jobs & companies
march 2011 by howthebodyworks
curious. collaborative corporate spying for mutual gain of employees and employers. So I'm told.
search
jobs
business
money
march 2011 by howthebodyworks
John Lamp's IS Pages ::
february 2011 by howthebodyworks
In Australia you need to know which code you are and which code the journals are. if you try to submit to one in the wrong field you endanger your whole unit's funding. This academic has made a nice interface to them all, although as it was not a publication in his journals, that time was putting his funding at risk by not going into publication quotas.
journal
academic
australia
search
february 2011 by howthebodyworks
PageRankGraph
january 2011 by howthebodyworks
pretty viz of where your traffic is from, more or less.
sea
visualization
search
january 2011 by howthebodyworks
DuckDuckGo
january 2011 by howthebodyworks
google without tracking or, er, features.
search
google
web
parsimony
ui
january 2011 by howthebodyworks
NUCULAR fielded text searchable indexing: Documentation
december 2010 by howthebodyworks
facet and field search library for python.
python
search
api
december 2010 by howthebodyworks
Find conferences to speak at with Lanyrd | The Lanyrd Blog
december 2010 by howthebodyworks
now you can find things to be invited to, not just at
conference
event
search
december 2010 by howthebodyworks
Thias の blog » Latex Spotlight Importer
november 2010 by howthebodyworks
purportedly non-ugly LaTeX indexing for spotlight
latex
spotlight
search
november 2010 by howthebodyworks
Cheap flights from easyJet, Air Asia, JetBlue, WestJet, AirBerlin, Jetstar and more
october 2010 by howthebodyworks
crzay-friendly travel search tool with all manner of nice visual ways of optimising your budget.
travel
search
via:jeanpoole
via:dpwolf
visualization
october 2010 by howthebodyworks
Catherine Devlin: The all-seeing milk carton
september 2010 by howthebodyworks
anti sex traffickers plant to crawl sex sites looking for kidnap victims
privacy
surveillance
computer_vision
search
sex
activism
september 2010 by howthebodyworks
Stack Exchange - Free, Community-Powered Q&A
september 2010 by howthebodyworks
teh stackexchange mega hub, whee the diaspora of questionably integrate sites that look a bit like the amazing stackoverflow.com site hang out and share expertise.
q&a
collaborative
search
learning
knowledge
september 2010 by howthebodyworks
Home - Quora
september 2010 by howthebodyworks
another Q&A site. unlike stackexchange it doesn't artificially segment communities but lets topics organically grow. unlike vark.com it emphasises collaboration and reputation, and integrates a bit better with social media. And there are lots of nerds i know on it. interesting.
collaborative
search
knowledge
learning
academic
crowdsourcing
q&a
september 2010 by howthebodyworks
Rebuilding Spotlight’s Index on OS X (Manually) | Walt-O-Matic
september 2010 by howthebodyworks
pretty much everything you need to know about fixing fucked spotlight indexes from the command like, the way it should be done.
spotlight
osx
howto
cli
search
september 2010 by howthebodyworks
Google Alarm | F.A.T.
september 2010 by howthebodyworks
a thingy which points out when google is monitoring your behaviour in your browser
privacy
google
search
chrome
browser
firefox
surveillance
september 2010 by howthebodyworks
Google Scribe
september 2010 by howthebodyworks
automatic writing cliche manufacturer, based on the most common phrases in use in the global datasphere at this point in time
google
english
writing
editor
search
markov
parkingsun
hiphop
september 2010 by howthebodyworks
Predicting Development of Research in Music Based on Parallels with Natural Language Processing
august 2010 by howthebodyworks
the future of music processing is nlp?
nlp
mir
audio
dsp
search
classification
august 2010 by howthebodyworks
ElasticSearch - Open Source, Distributed, RESTful Search Engine
june 2010 by howthebodyworks
REESTful auto-redundancy search engine. I think it still uses Lucene under the hood.
search
java
rest
http
june 2010 by howthebodyworks
A fast, fuzzy, full-text index using Redis | PlayNice.ly
may 2010 by howthebodyworks
fast and loose homophone happy fulltext search in redis.
search
language
redis
nosql
may 2010 by howthebodyworks
Visual Ack
may 2010 by howthebodyworks
cocoa/ osx ACK UI
search
coding
textmate
cocoa
osx
editor
ack
may 2010 by howthebodyworks
Basic text search with relevancy for MongoDB.
april 2010 by howthebodyworks
Basic text search with relevancy for MongoDB.
search
mapreduce
mongodb
april 2010 by howthebodyworks
HopStop.com - Subway Directions and Bus Directions for New York City (NYC)
february 2010 by howthebodyworks
universal US public transport directions
urban
travel
reference
search
usa
february 2010 by howthebodyworks
Spezify
january 2010 by howthebodyworks
pretty, visually smart search visualisation
via:datakid
search
visualization
ui
mashup
january 2010 by howthebodyworks
HINT.FM: Web Seer
january 2010 by howthebodyworks
google autocomplete tastiness
search
google
grammarthing
poetry
january 2010 by howthebodyworks
Aaron Johnson – How MoreLikeThis Works in Lucene
december 2009 by howthebodyworks
lucene's similarity matching
nlp
search
java
lucene
december 2009 by howthebodyworks
sitemap-generators - Project Hosting on Google Code
december 2009 by howthebodyworks
It's not clear, but it seems google's python sitemap generator is now hosted here, not in the 3 year stale version on sourceforge
python
xml
search
possumpalace
december 2009 by howthebodyworks
Simple Text Query Version 1.3
november 2009 by howthebodyworks
get DOI from fulltext string
reference
search
parser
citation
journal
academic
rest
november 2009 by howthebodyworks
The Third Bit » Blog Archive » Reverse Engineering a Bibliography
november 2009 by howthebodyworks
some nice tips here for automating tedious citation metadata parsing
academic
howto
search
citation
productivity
november 2009 by howthebodyworks
DocSavage's appengine-search at master - GitHub
november 2009 by howthebodyworks
porter stem your way to happines with a home-baked google app engine search
gae
search
november 2009 by howthebodyworks
UsingMac.com - Mac Tricks and Tips, Wallpapers and Applications for Mac Users
november 2009 by howthebodyworks
spotlight supports google-esque term faceting
spotlight
search
osx
howto
november 2009 by howthebodyworks
USEFULL MUSIC APPLICATIONS chords
november 2009 by howthebodyworks
what chord is that?
music
composition
search
november 2009 by howthebodyworks
DeepDyve: Like iTunes for Scientific Papers
november 2009 by howthebodyworks
missing the sweet, sweet journal access of academia? This claims to have better search than most journals and allows discount 1-day article rentals. I'd be curious to hear how good it was in fact.
academic
search
tagging
reference
science
november 2009 by howthebodyworks
Proxy
october 2009 by howthebodyworks
"Proxy develops and extends sophisticated architectural tools to conceive of design as a search. Designing "options" in parallel is nothing new, but by innovating flexible tools for design, we are able to massively expand the information being considered and generated. This enriches the design process and provides valuable feedback and contingency. With computation, we consider an incredible number of variations and options. We actively search through a space of possibilities to find unique solutions."
design
technology
architecture
3d
modeling
search
simulation
october 2009 by howthebodyworks
Welcome to Shazam
october 2009 by howthebodyworks
I keep forgetting its name. this is the service that uses your phone microphone to tell you what track you are listening to in the background
music
search
mp3
portable
october 2009 by howthebodyworks
Suburb View : Australian Real Estate Search for Google Earth and Google Maps
october 2009 by howthebodyworks
very, very useful. real estate analytics from across australia in a reasonably parseable from. usage rights would be interesting to know too.
australia
geography
money
shopping
statistics
netcultures
search
mapping
mashup
october 2009 by howthebodyworks
Quick Search Box Plugins by Aaron Ecay
october 2009 by howthebodyworks
general scripty quick search box plugins
qsb
plugin
search
google
osx
october 2009 by howthebodyworks
80Legs: A Web Crawler as a Service
october 2009 by howthebodyworks
pay someone to crawl the web for you. neat.
search
http
aggregator
october 2009 by howthebodyworks
Google Public Sector
september 2009 by howthebodyworks
There's some hidden gems in google's trite re-branding as a government service provider. Their ideascale+forum thing, "moderator" looks interesting.
via:cogat
gov2.0
google
search
democracy
collaborative
september 2009 by howthebodyworks
Full Text Search in Mongo - MongoDB
september 2009 by howthebodyworks
mongodb seems to have perfunctory support for full-text indices, but it does look easily extensible. (or at least, you can extend it in an easy language without learning some weird vendor-specific SQL variant.)
mongodb
nlp
search
parsing
javascript
nosql
september 2009 by howthebodyworks
Google PageRank Powers Endangered Species Hierarchy
september 2009 by howthebodyworks
Hm - an intersting use of google's pagerank algorithm. But I wonder how it performs under conditions of low information? (there is no food-web-spider)
search
google
ecology
sustainability
september 2009 by howthebodyworks
Language Log » The Google Books Settlement
september 2009 by howthebodyworks
Some choice snippets about he google books settlement as an IP watershed - both as an economics-of-books-issue, and as intersting problem for, say, researching language corpii.
ip
library
book
google
search
language
netcultures
september 2009 by howthebodyworks
How I Located a Camera in your Back Yard | Huy Nguyen
august 2009 by howthebodyworks
An interesting combo google-hack plus python hack that locates unsecured webcamera on the web and in reality.
security
privacy
search
geocoding
netcultures
python
august 2009 by howthebodyworks
Python Package Index : topia.termextract 1.1.0
august 2009 by howthebodyworks
from my perspective, a yahoo term extraction replacement. Also apparently light POS tagging.
nlp
language
python
opensource
search
parser
tagging
august 2009 by howthebodyworks
GeoCommons Finder!
august 2009 by howthebodyworks
awesome search on public geographic datasets
netcultures
search
gis
mapping
mashup
august 2009 by howthebodyworks
Google Maps Mania
august 2009 by howthebodyworks
some kind of twisted fan site for google maps.
netcultures
mapping
google
visualization
search
gps
august 2009 by howthebodyworks
SEO Boost from 404 Plugin? Focus on humans
august 2009 by howthebodyworks
A nice hack - this searches your site for alternative content shoudl punters link somewhere incorrect...
apache
cpod
php
google
search
seo
august 2009 by howthebodyworks
fizy / videomusic.
july 2009 by howthebodyworks
find instant links to tracks online
search
music
mp3
july 2009 by howthebodyworks
Scryve - Home
july 2009 by howthebodyworks
"Want to know who you're buying from?
Want to know who owns them?Want to buy from people who are good to the world?
We do. That's why we created this site
Scryve is a research and vetting engine for environmental and social responsibility. We rate companies and provide you with alternatives options to use whenever you don't like a company. We've got research on over 3500 companies from industry leading CSR firm, KLD, paid researchers and people like you."
money
sustainability
ethics
policy
search
shopping
wiki
Want to know who owns them?Want to buy from people who are good to the world?
We do. That's why we created this site
Scryve is a research and vetting engine for environmental and social responsibility. We rate companies and provide you with alternatives options to use whenever you don't like a company. We've got research on over 3500 companies from industry leading CSR firm, KLD, paid researchers and people like you."
july 2009 by howthebodyworks
Scirus - for scientific information
july 2009 by howthebodyworks
quasi-structured metadata search across scientific publications databases.
search
academic
commercial
research
metadata
july 2009 by howthebodyworks
Google Quick Search Box
june 2009 by howthebodyworks
it seems that google are building quicksilver. except not open. and google intergrated. and not abandonware...
opensource
osx
ui
search
quicksilver
june 2009 by howthebodyworks
Django snippets: Full-Text Searchable Models
may 2009 by howthebodyworks
outstanding postgres search snippet, bringing it to be very close to a first-class ORM citizen.
django
search
phm
orm
postgresql
may 2009 by howthebodyworks
Google Torrent Search
april 2009 by howthebodyworks
google custom search for torrents. seems rather comprehensive.
google
search
p2p
torrent
april 2009 by howthebodyworks
bfirsh's django at search - GitHub
april 2009 by howthebodyworks
native db-backed fulltext search queries for django
django
search
orm
phm
mysql
postgresql
april 2009 by howthebodyworks
Haystack - Modular search for Django
april 2009 by howthebodyworks
har. yet another fucking django search option. claims to be the greatest thus far and the thoughtfulness feature list seems to support that. (similarity ranking?) me, I'm stoked that I don't have to investigate this one, blessed be outsourcing. includes pure-python Whoosh backend.
python
django
search
phm
via:cogat
solr
april 2009 by howthebodyworks
staticICE :: Australia's comprehensive computer hardware and gadget price comparison search engine
april 2009 by howthebodyworks
price comparator for australian gizmos
search
shopping
geek
technology
hardware
australia
april 2009 by howthebodyworks
Ideological Search
april 2009 by howthebodyworks
the terrifying thing about this april fools joke is that it appears to actually function.
yahoo
search
policy
april 2009 by howthebodyworks
openmeta - Google Code
march 2009 by howthebodyworks
awesome - API access to file tagging from your objc apps., and get spotlight indexing.
opensource
osx
framework
cocoa
tagging
metadata
spotlight
search
objc
march 2009 by howthebodyworks
Ironic Software
march 2009 by howthebodyworks
nice looking bundles of tagging, search and analysis apps for your files. (search fiels by tags, images by colour and so on). Also includes an opensource metadata/tagging suite.
commercial
osx
productivity
search
tagging
pdf
archives
march 2009 by howthebodyworks
Aardvark
march 2009 by howthebodyworks
social question answering, where you trade answering questions for answers to questions. fast and interacts with mobile devices, IM, etc, and found me the best pastries in Adelaide.
social
community
knowledge
web2.0
q&a
collaborative
search
learning
crowdsourcing
march 2009 by howthebodyworks
djangosearch - Google Code
march 2009 by howthebodyworks
another slightly different django search backend.
opensource
django
search
phm
solr
march 2009 by howthebodyworks
Python Library for Google Search - good coders code, great reuse
march 2009 by howthebodyworks
python google search scraping bypassing API limits
search
google
python
opensource
march 2009 by howthebodyworks
HubLog: Guardian Lucene = Similar Articles Categorisation
march 2009 by howthebodyworks
the reverse of what we are doing at PHM, and devilishly cunning, although not the first time it's been done. Generates similarity ranging *and tags* by parsing a large corpus of text and using similarity indices.
via:simonw
lucene
solr
phm
classification
language
tagging
search
march 2009 by howthebodyworks
Google Agency Toolkit
march 2009 by howthebodyworks
one stop google-style analytics/SEO shop
search
google
marketing
seo
analytics
march 2009 by howthebodyworks
String Similarity Metrics for Information Integration
march 2009 by howthebodyworks
java-based string metrics implementations
opensource
phm
search
language
nlp
java
march 2009 by howthebodyworks
ichm - Google Code
march 2009 by howthebodyworks
leopardy search-friendy chm GUI. Still maintained, unlike CHMOX, and has a superior feature-set
chm
osx
opensource
cocoa
search
march 2009 by howthebodyworks
SourceForge.net: CHMporter
march 2009 by howthebodyworks
"CHMporter is an CHM importer for Spotlight. It use Chmlib to get the text inside every CHM files and index it."
predates intel macs but the source version compiles without issue
spotlight
search
chm
osx
predates intel macs but the source version compiles without issue
march 2009 by howthebodyworks
precipitate - Google Code
february 2009 by howthebodyworks
search you google docs and bookmarks from the mac os search interface
opensource
osx
productivity
search
google
february 2009 by howthebodyworks
django-springsteen and Distributed Search @ Irrational Exuberance
february 2009 by howthebodyworks
search result aggregating thing, providing combining multiple internal and externally provided search results. (in theory - only yahoo BOSS so far)
api
django
python
search
february 2009 by howthebodyworks
related tags
3d ⊕ @todo ⊕ academic ⊕ Accessibility ⊕ ack ⊕ activism ⊕ aggregator ⊕ ai ⊕ ajax ⊕ amazon ⊕ analytics ⊕ apache ⊕ api ⊕ architecture ⊕ archives ⊕ atom ⊕ audio ⊕ australia ⊕ avcontent ⊕ bibtex ⊕ bikes ⊕ bioinformatics ⊕ biology ⊕ blog ⊕ book ⊕ bookmark ⊕ browser ⊕ business ⊕ c ⊕ c# ⊕ c++ ⊕ cakephp ⊕ cd ⊕ chat ⊕ china ⊕ chm ⊕ chrome ⊕ citation ⊕ citeulike ⊕ classification ⊕ cli ⊕ cms ⊕ cocoa ⊕ coding ⊕ collaborative ⊕ commercial ⊕ commons ⊕ community ⊕ comparison ⊕ composition ⊕ compsci ⊕ computer_vision ⊕ conference ⊕ cpod ⊕ creativecommons ⊕ crowdsourcing ⊕ data ⊕ db ⊕ del.icio.us ⊕ democracy ⊕ design ⊕ diagram ⊕ dictionary ⊕ diy ⊕ django ⊕ dsp ⊕ dublincore ⊕ ebook ⊕ ecology ⊕ editor ⊕ english ⊕ ethics ⊕ event ⊕ extension ⊕ feed ⊕ firefox ⊕ framework ⊕ free ⊕ freebase ⊕ friend ⊕ fs ⊕ futures ⊕ gae ⊕ geek ⊕ generator ⊕ geo ⊕ geocoding ⊕ geography ⊕ gis ⊕ google ⊕ gov2.0 ⊕ gps ⊕ grammarthing ⊕ greasemonkey ⊕ greens ⊕ hadoop ⊕ hardware ⊕ hiphop ⊕ history ⊕ howto ⊕ html ⊕ http ⊕ icon ⊕ image ⊕ india ⊕ indonesia ⊕ industrial ⊕ innovation ⊕ Internet ⊕ ip ⊕ japan ⊕ java ⊕ javascript ⊕ jobs ⊕ journal ⊕ journalism ⊕ json ⊕ jython ⊕ knowledge ⊕ korea ⊕ language ⊕ last.fm ⊕ latex ⊕ law ⊕ learning ⊕ library ⊕ literature ⊕ lucene ⊕ mail ⊕ mapping ⊕ mapreduce ⊕ marketing ⊕ markov ⊕ mashup ⊕ mathematics ⊕ media ⊕ metadata ⊕ microformats ⊕ mind ⊕ mir ⊕ modeling ⊕ money ⊕ mongodb ⊕ mp3 ⊕ music ⊕ mysql ⊕ netcultures ⊕ netlabel ⊕ network ⊕ networks ⊕ nlp ⊕ nonfree ⊕ nosql ⊕ nsw ⊕ oai ⊕ objc ⊕ ocr ⊕ ontology ⊕ opac ⊕ openaccess ⊕ opensearch ⊕ opensource ⊕ OpenURL ⊕ orm ⊕ osx ⊕ outlook ⊕ p2p ⊕ parkingsun ⊕ parser ⊕ parsimony ⊕ parsing ⊕ patent ⊕ pdf ⊕ performance ⊕ perl ⊕ phd ⊕ phm ⊕ photography ⊕ php ⊕ plugin ⊕ podcast ⊕ poetry ⊕ policy ⊕ portable ⊕ possumpalace ⊕ postgresql ⊕ privacy ⊕ productivity ⊕ python ⊕ q&a ⊕ qsb ⊕ quicksilver ⊕ rails ⊕ rdf ⊕ redis ⊕ reference ⊕ regex ⊕ remix ⊕ research ⊕ rest ⊕ rss ⊕ ruby ⊕ safari ⊕ scalability ⊕ schmooze ⊕ science ⊕ sea ⊕ search ⊖ security ⊕ semantic ⊕ semanticweb ⊕ seo ⊕ service ⊕ sex ⊕ shell ⊕ shopping ⊕ simulation ⊕ social ⊕ software ⊕ solr ⊕ sound ⊕ sparql ⊕ spider ⊕ spotlight ⊕ standards ⊕ statistics ⊕ streaming ⊕ surveillance ⊕ sustainability ⊕ sydney ⊕ syndication ⊕ tagging ⊕ technology ⊕ template ⊕ testing ⊕ text ⊕ textmate ⊕ torrent ⊕ translate ⊕ travel ⊕ twitter ⊕ ui ⊕ urban ⊕ url ⊕ usa ⊕ validation ⊕ via:cogat ⊕ via:crucialfelix ⊕ via:datakid ⊕ via:dpwolf ⊕ via:jeanpoole ⊕ via:miriaml ⊕ via:neant ⊕ via:simonw ⊕ video ⊕ visualization ⊕ vst ⊕ web ⊕ web2.0 ⊕ webdev ⊕ webservices ⊕ wiki ⊕ wikipedia ⊕ windows ⊕ writing ⊕ xml ⊕ yahoo ⊕ z39.50 ⊕ zotero ⊕Copy this bookmark: