PhantomJS: Headless WebKit with JavaScript API
7 weeks ago by keimlink
PhantomJS is a headless WebKit with JavaScript API. It has fast and native support for various web standards: DOM handling, CSS selector, JSON, Canvas, and SVG.
browser
json
javascript
testing
svg
screenscraping
dom
programming
opensource
7 weeks ago by keimlink
ScraperWiki
july 2011 by keimlink
ScraperWiki is all the tools you need for Screen Scraping, Data Mining & visualisation.
database
datamining
opendata
php
programming
python
ruby
screenscraping
tools
wiki
july 2011 by keimlink
Python Package Index : grab
june 2011 by keimlink
Grab is the tool which simplifies writing sitescrapers.
python
html
dom
screenscraping
opensource
http
june 2011 by keimlink
pyquery: a jquery-like library for python
september 2009 by keimlink
pyquery allows you to make jquery queries on xml documents. The API is as much as possible the similar to jquery. pyquery uses lxml for fast xml and html manipulation.
python
opensource
xml
library
development
xhtml
programming
parser
screenscraping
browser
september 2009 by keimlink
Beautiful Soup: We called him Tortoise because he taught us.
january 2009 by keimlink
Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping.
python
opensource
programming
parser
xhtml
webdev
browser
xml
screenscraping
january 2009 by keimlink
Copy this bookmark: