howthebodyworks + search   205

RapidLibrary - Your Media Search Engine
Rule 44 is that if you can think of it, you can download it from rapidshare. This search engine helps you do that.
avcontent  ebook  search  from delicious
6 weeks ago by howthebodyworks
Terrier IR Platform v3.5 - Homepage
Terrier is a highly flexible, efficient, and effective open source search engine, readily deployable on large-scale collections of documents. Terrier implements state-of-the-art indexing and retrieval functionalities, and provides an ideal platform for the rapid development and evaluation of large-scale retrieval applications.
Terrier is open source, and is a comprehensive, flexible and transparent platform for research and experimentation in text retrieval. Research can easily be carried out on standard TREC and CLEF test collections.
nlp  java  search  from delicious
december 2011 by howthebodyworks
Apache Lucy
python-happy fultext search lucene-style, without java. "Apache Lucy is full-text search engine library written in C and targeted at dynamic languages. It is a "loose C" port of Apache Lucene™, a search engine library for Java."
lucene  opensource  search  from delicious
november 2011 by howthebodyworks
SimString - A fast and simple algorithm for approximate string matching/retrieval
SimString has the following features:

Fast algorithm for approximate string retrieval. For example, SimString can find strings in Google Web1T unigrams (13,588,391 strings) that have cosine similarity ≧0.7 in 1.10 [ms] per query (on Intel Xeon 5140 2.33 GHz CPU).
100% exact retrieval. Although some algorithms allow misses (false positives) for faster query response, SimString is guaranteed to achieve 100% correct retrieval with fast query response.
Unicode (wchar_t) support. For languages using multi-byte characters, developers can use Unicode characters (wchar_t) instead of single-byte characters (char) as a character representation.
Implementation in C++ header files. Developers can add the funtionality of approximate string retrieval into C++ programs just by including a header file.
Python and Ruby bindings via SWIG. Developers can easily perform approximate string retrieval in scripting languages.
nlp  search  python  ruby  c++  performance 
june 2011 by howthebodyworks
Sally - A Tool for Embedding Strings in Vector Spaces
string embedding, fast. ouputs to libsvm and matlab etc formats
nlp  search  c 
may 2011 by howthebodyworks
Glassdoor.com – an inside look at jobs & companies
curious. collaborative corporate spying for mutual gain of employees and employers. So I'm told.
search  jobs  business  money 
march 2011 by howthebodyworks
John Lamp's IS Pages ::
In Australia you need to know which code you are and which code the journals are. if you try to submit to one in the wrong field you endanger your whole unit's funding. This academic has made a nice interface to them all, although as it was not a publication in his journals, that time was putting his funding at risk by not going into publication quotas.
journal  academic  australia  search 
february 2011 by howthebodyworks
PageRankGraph
pretty viz of where your traffic is from, more or less.
sea  visualization  search 
january 2011 by howthebodyworks
DuckDuckGo
google without tracking or, er, features.
search  google  web  parsimony  ui 
january 2011 by howthebodyworks
Cheap flights from easyJet, Air Asia, JetBlue, WestJet, AirBerlin, Jetstar and more
crzay-friendly travel search tool with all manner of nice visual ways of optimising your budget.
travel  search  via:jeanpoole  via:dpwolf  visualization 
october 2010 by howthebodyworks
Catherine Devlin: The all-seeing milk carton
anti sex traffickers plant to crawl sex sites looking for kidnap victims
privacy  surveillance  computer_vision  search  sex  activism 
september 2010 by howthebodyworks
Stack Exchange - Free, Community-Powered Q&A
teh stackexchange mega hub, whee the diaspora of questionably integrate sites that look a bit like the amazing stackoverflow.com site hang out and share expertise.
q&a  collaborative  search  learning  knowledge 
september 2010 by howthebodyworks
Home - Quora
another Q&A site. unlike stackexchange it doesn't artificially segment communities but lets topics organically grow. unlike vark.com it emphasises collaboration and reputation, and integrates a bit better with social media. And there are lots of nerds i know on it. interesting.
collaborative  search  knowledge  learning  academic  crowdsourcing  q&a 
september 2010 by howthebodyworks
Rebuilding Spotlight’s Index on OS X (Manually) | Walt-O-Matic
pretty much everything you need to know about fixing fucked spotlight indexes from the command like, the way it should be done.
spotlight  osx  howto  cli  search 
september 2010 by howthebodyworks
Google Alarm | F.A.T.
a thingy which points out when google is monitoring your behaviour in your browser
privacy  google  search  chrome  browser  firefox  surveillance 
september 2010 by howthebodyworks
Google Scribe
automatic writing cliche manufacturer, based on the most common phrases in use in the global datasphere at this point in time
google  english  writing  editor  search  markov  parkingsun  hiphop 
september 2010 by howthebodyworks
ElasticSearch - Open Source, Distributed, RESTful Search Engine
REESTful auto-redundancy search engine. I think it still uses Lucene under the hood.
search  java  rest  http 
june 2010 by howthebodyworks
A fast, fuzzy, full-text index using Redis | PlayNice.ly
fast and loose homophone happy fulltext search in redis.
search  language  redis  nosql 
may 2010 by howthebodyworks
Spezify
pretty, visually smart search visualisation
via:datakid  search  visualization  ui  mashup 
january 2010 by howthebodyworks
sitemap-generators - Project Hosting on Google Code
It's not clear, but it seems google's python sitemap generator is now hosted here, not in the 3 year stale version on sourceforge
python  xml  search  possumpalace 
december 2009 by howthebodyworks
DocSavage's appengine-search at master - GitHub
porter stem your way to happines with a home-baked google app engine search
gae  search 
november 2009 by howthebodyworks
DeepDyve: Like iTunes for Scientific Papers
missing the sweet, sweet journal access of academia? This claims to have better search than most journals and allows discount 1-day article rentals. I'd be curious to hear how good it was in fact.
academic  search  tagging  reference  science 
november 2009 by howthebodyworks
Proxy
"Proxy develops and extends sophisticated architectural tools to conceive of design as a search. Designing "options" in parallel is nothing new, but by innovating flexible tools for design, we are able to massively expand the information being considered and generated. This enriches the design process and provides valuable feedback and contingency. With computation, we consider an incredible number of variations and options. We actively search through a space of possibilities to find unique solutions."
design  technology  architecture  3d  modeling  search  simulation 
october 2009 by howthebodyworks
Welcome to Shazam
I keep forgetting its name. this is the service that uses your phone microphone to tell you what track you are listening to in the background
music  search  mp3  portable 
october 2009 by howthebodyworks
Suburb View : Australian Real Estate Search for Google Earth and Google Maps
very, very useful. real estate analytics from across australia in a reasonably parseable from. usage rights would be interesting to know too.
australia  geography  money  shopping  statistics  netcultures  search  mapping  mashup 
october 2009 by howthebodyworks
Google Public Sector
There's some hidden gems in google's trite re-branding as a government service provider. Their ideascale+forum thing, "moderator" looks interesting.
via:cogat  gov2.0  google  search  democracy  collaborative 
september 2009 by howthebodyworks
Full Text Search in Mongo - MongoDB
mongodb seems to have perfunctory support for full-text indices, but it does look easily extensible. (or at least, you can extend it in an easy language without learning some weird vendor-specific SQL variant.)
mongodb  nlp  search  parsing  javascript  nosql 
september 2009 by howthebodyworks
Google PageRank Powers Endangered Species Hierarchy
Hm - an intersting use of google's pagerank algorithm. But I wonder how it performs under conditions of low information? (there is no food-web-spider)
search  google  ecology  sustainability 
september 2009 by howthebodyworks
Language Log » The Google Books Settlement
Some choice snippets about he google books settlement as an IP watershed - both as an economics-of-books-issue, and as intersting problem for, say, researching language corpii.
ip  library  book  google  search  language  netcultures 
september 2009 by howthebodyworks
How I Located a Camera in your Back Yard | Huy Nguyen
An interesting combo google-hack plus python hack that locates unsecured webcamera on the web and in reality.
security  privacy  search  geocoding  netcultures  python 
august 2009 by howthebodyworks
Python Package Index : topia.termextract 1.1.0
from my perspective, a yahoo term extraction replacement. Also apparently light POS tagging.
nlp  language  python  opensource  search  parser  tagging 
august 2009 by howthebodyworks
GeoCommons Finder!
awesome search on public geographic datasets
netcultures  search  gis  mapping  mashup 
august 2009 by howthebodyworks
Google Maps Mania
some kind of twisted fan site for google maps.
netcultures  mapping  google  visualization  search  gps 
august 2009 by howthebodyworks
SEO Boost from 404 Plugin? Focus on humans
A nice hack - this searches your site for alternative content shoudl punters link somewhere incorrect...
apache  cpod  php  google  search  seo 
august 2009 by howthebodyworks
fizy / videomusic.
find instant links to tracks online
search  music  mp3 
july 2009 by howthebodyworks
Scryve - Home
"Want to know who you're buying from?
Want to know who owns them?Want to buy from people who are good to the world?
We do. That's why we created this site

Scryve is a research and vetting engine for environmental and social responsibility. We rate companies and provide you with alternatives options to use whenever you don't like a company. We've got research on over 3500 companies from industry leading CSR firm, KLD, paid researchers and people like you."
money  sustainability  ethics  policy  search  shopping  wiki 
july 2009 by howthebodyworks
Scirus - for scientific information
quasi-structured metadata search across scientific publications databases.
search  academic  commercial  research  metadata 
july 2009 by howthebodyworks
Google Quick Search Box
it seems that google are building quicksilver. except not open. and google intergrated. and not abandonware...
opensource  osx  ui  search  quicksilver 
june 2009 by howthebodyworks
Django snippets: Full-Text Searchable Models
outstanding postgres search snippet, bringing it to be very close to a first-class ORM citizen.
django  search  phm  orm  postgresql 
may 2009 by howthebodyworks
Google Torrent Search
google custom search for torrents. seems rather comprehensive.
google  search  p2p  torrent 
april 2009 by howthebodyworks
bfirsh's django at search - GitHub
native db-backed fulltext search queries for django
django  search  orm  phm  mysql  postgresql 
april 2009 by howthebodyworks
Haystack - Modular search for Django
har. yet another fucking django search option. claims to be the greatest thus far and the thoughtfulness feature list seems to support that. (similarity ranking?) me, I'm stoked that I don't have to investigate this one, blessed be outsourcing. includes pure-python Whoosh backend.
python  django  search  phm  via:cogat  solr 
april 2009 by howthebodyworks
Ideological Search
the terrifying thing about this april fools joke is that it appears to actually function.
yahoo  search  policy 
april 2009 by howthebodyworks
openmeta - Google Code
awesome - API access to file tagging from your objc apps., and get spotlight indexing.
opensource  osx  framework  cocoa  tagging  metadata  spotlight  search  objc 
march 2009 by howthebodyworks
Ironic Software
nice looking bundles of tagging, search and analysis apps for your files. (search fiels by tags, images by colour and so on). Also includes an opensource metadata/tagging suite.
commercial  osx  productivity  search  tagging  pdf  archives 
march 2009 by howthebodyworks
Aardvark
social question answering, where you trade answering questions for answers to questions. fast and interacts with mobile devices, IM, etc, and found me the best pastries in Adelaide.
social  community  knowledge  web2.0  q&a  collaborative  search  learning  crowdsourcing 
march 2009 by howthebodyworks
djangosearch - Google Code
another slightly different django search backend.
opensource  django  search  phm  solr 
march 2009 by howthebodyworks
HubLog: Guardian Lucene = Similar Articles Categorisation
the reverse of what we are doing at PHM, and devilishly cunning, although not the first time it's been done. Generates similarity ranging *and tags* by parsing a large corpus of text and using similarity indices.
via:simonw  lucene  solr  phm  classification  language  tagging  search 
march 2009 by howthebodyworks
Google Agency Toolkit
one stop google-style analytics/SEO shop
search  google  marketing  seo  analytics 
march 2009 by howthebodyworks
@ findjango
vertical search for django-related topics
django  search 
march 2009 by howthebodyworks
ichm - Google Code
leopardy search-friendy chm GUI. Still maintained, unlike CHMOX, and has a superior feature-set
chm  osx  opensource  cocoa  search 
march 2009 by howthebodyworks
SourceForge.net: CHMporter
"CHMporter is an CHM importer for Spotlight. It use Chmlib to get the text inside every CHM files and index it."

predates intel macs but the source version compiles without issue
spotlight  search  chm  osx 
march 2009 by howthebodyworks
precipitate - Google Code
search you google docs and bookmarks from the mac os search interface
opensource  osx  productivity  search  google 
february 2009 by howthebodyworks
django-springsteen and Distributed Search @ Irrational Exuberance
search result aggregating thing, providing combining multiple internal and externally provided search results. (in theory - only yahoo BOSS so far)
api  django  python  search 
february 2009 by howthebodyworks
« earlier      

related tags

3d  @todo  academic  Accessibility  ack  activism  aggregator  ai  ajax  amazon  analytics  apache  api  architecture  archives  atom  audio  australia  avcontent  bibtex  bikes  bioinformatics  biology  blog  book  bookmark  browser  business  c  c#  c++  cakephp  cd  chat  china  chm  chrome  citation  citeulike  classification  cli  cms  cocoa  coding  collaborative  commercial  commons  community  comparison  composition  compsci  computer_vision  conference  cpod  creativecommons  crowdsourcing  data  db  del.icio.us  democracy  design  diagram  dictionary  diy  django  dsp  dublincore  ebook  ecology  editor  english  ethics  event  extension  feed  firefox  framework  free  freebase  friend  fs  futures  gae  geek  generator  geo  geocoding  geography  gis  google  gov2.0  gps  grammarthing  greasemonkey  greens  hadoop  hardware  hiphop  history  howto  html  http  icon  image  india  indonesia  industrial  innovation  Internet  ip  japan  java  javascript  jobs  journal  journalism  json  jython  knowledge  korea  language  last.fm  latex  law  learning  library  literature  lucene  mail  mapping  mapreduce  marketing  markov  mashup  mathematics  media  metadata  microformats  mind  mir  modeling  money  mongodb  mp3  music  mysql  netcultures  netlabel  network  networks  nlp  nonfree  nosql  nsw  oai  objc  ocr  ontology  opac  openaccess  opensearch  opensource  OpenURL  orm  osx  outlook  p2p  parkingsun  parser  parsimony  parsing  patent  pdf  performance  perl  phd  phm  photography  php  plugin  podcast  poetry  policy  portable  possumpalace  postgresql  privacy  productivity  python  q&a  qsb  quicksilver  rails  rdf  redis  reference  regex  remix  research  rest  rss  ruby  safari  scalability  schmooze  science  sea  search  security  semantic  semanticweb  seo  service  sex  shell  shopping  simulation  social  software  solr  sound  sparql  spider  spotlight  standards  statistics  streaming  surveillance  sustainability  sydney  syndication  tagging  technology  template  testing  text  textmate  torrent  translate  travel  twitter  ui  urban  url  usa  validation  via:cogat  via:crucialfelix  via:datakid  via:dpwolf  via:jeanpoole  via:miriaml  via:neant  via:simonw  video  visualization  vst  web  web2.0  webdev  webservices  wiki  wikipedia  windows  writing  xml  yahoo  z39.50  zotero 

Copy this bookmark:



description:


tags: