plindberg + wikipedia   7

datawrangling's trendingtopics at master - GitHub
This repository contains the full source code for Trendingtopics.org, built by Data Wrangling to demonstrate how Hadoop & EC2 can power a data driven website.
rubyonrails  trending  statistics  wikipedia  github  hadoop  amazonec2 
june 2009 by plindberg
Modernista!
(This is the company whose website simply is a tiny menu rendered atop Flickr, Wikipedia, and Facebook.)
company  facebook  wikipedia  flickr 
january 2009 by plindberg
BBC - Radio Labs - Wikipedia + Lucene's MoreLikeThis = useful bits about the bits?
"My proof-of-concept is based on vacuuming every Wikipedia article into the Lucene open source search engine to build a text categorisation tool prototype."
bbcradiolabs  textmining  lucene  categories  categorisation  wikipedia  bbc 
september 2008 by plindberg

Copy this bookmark:



description:


tags: