All Sorts - a linguistic experiment
june 2010 by infovore
Collecting casual and informal collective-nouns by scraping twitter. The "What Is This" page is very well done, explaining just what the scraper "sees" in a clear fashion. Fun.
linguistics
twitter
words
nouns
collective
scraping
june 2010 by infovore
The little page of TRANSPORT CHAOS
february 2009 by infovore
Scrapes lots of things, produces a useful page which actually manages to stay up. Also, it spells TRANSPORT CHAOS the only way it should be spelt: in capitals.
information
uk
funny
scraping
travel
useful
transportchaos
transport
february 2009 by infovore
Welcome to Sackbook! Social networking for LittleBigPlanet™
december 2008 by infovore
Oh gosh this is brilliant.
littlebigplanet
socialnetworking
games
play
ps3
web20
pastiche
scraping
december 2008 by infovore
Monkeys & Robots » Blog Archive » Code to scape CNN.com election results
november 2008 by infovore
"My election party tomorrow will feature DMX controlled RGB LED lighting. The color of the house should reflect the electoral balance. The color will start purple, and drift toward either red or blue, depending on who’s winning." Awesome.
hacking
making
scraping
electronics
hardware
politics
election
november 2008 by infovore
Data Scraping Wikipedia with Google Spreadsheets « OUseful.Info, the blog…
october 2008 by infovore
"So to recap, we have scraped some data from a wikipedia page into a Google spreadsheet using the =importHTML formula, published a handful of rows from the table as CSV, consumed the CSV in a Yahoo pipe and created a geocoded KML feed from it, and then displayed it in a Yahoo map." Wow, etc.
yahoo
google
wikipedia
cloudcomputing
web
tools
scraping
october 2008 by infovore
Geo Spidering » Blog » tomtaylor.co.uk
october 2008 by infovore
"The technology will probably improve, but in lieu of the promised emergent web AI, we need to build more small tools, more games to bootstrap datasets, and more simple ways of encouraging people to play their part in the semantic web without ever having to explain what it is." tt++.
geo
location
scraping
semantics
tools
small
little
data
parsing
tomtaylor
october 2008 by infovore
(theinfo)
january 2008 by infovore
"This is a site for large data sets and the people who love them: the scrapers and crawlers who collect them, the academics and geeks who process them, the designers and artists who visualize them." Aaron Swartz strikes again. This looks great.
dataviz
infoviz
visualisation
scraping
data
accumulation
processing
january 2008 by infovore
scRUBYt! - a Simple to Learn and Use, yet Powerful Web Scraping Toolkit Written in Ruby
august 2007 by infovore
"WWW::Mechanize and Hpricot on Steroids". An interesting looking screenscraping library; I'm a big fan of both HPricot and Mechanize, so it'll be interesting to put this to use at some point.
ruby
scraping
hpricot
mechanize
development
hacking
library
screenscraping
html
august 2007 by infovore
Introducing templatemaker | Holovaty.com
july 2007 by infovore
Template maker is "a Python library for extracting data from similarly formatted text strings." Looks super-useful.
python
scraping
text
processing
utility
july 2007 by infovore
related tags
accumulation ⊕ cinema ⊕ cloudcomputing ⊕ collective ⊕ data ⊕ dataviz ⊕ development ⊕ election ⊕ electronics ⊕ film ⊕ funny ⊕ games ⊕ geo ⊕ google ⊕ hacking ⊕ hardware ⊕ hpricot ⊕ html ⊕ independent ⊕ information ⊕ infoviz ⊕ library ⊕ linguistics ⊕ listings ⊕ little ⊕ littlebigplanet ⊕ location ⊕ london ⊕ making ⊕ mashup ⊕ mechanize ⊕ myspace ⊕ nouns ⊕ parser ⊕ parsing ⊕ pastiche ⊕ play ⊕ politics ⊕ processing ⊕ ps3 ⊕ python ⊕ ruby ⊕ scraping ⊖ screenscraping ⊕ semantics ⊕ small ⊕ socialnetworking ⊕ text ⊕ tomtaylor ⊕ tools ⊕ transport ⊕ transportchaos ⊕ travel ⊕ twitter ⊕ uk ⊕ useful ⊕ utility ⊕ visualisation ⊕ web ⊕ web20 ⊕ wikipedia ⊕ words ⊕ yahoo ⊕Copy this bookmark: