scraping 3961
Needlebase
23 hours ago by qfwfq
platform for acquiring, integrating, cleansing, analyzing and publishing data on the web. Using Needlebase through a web browser, without programmers or DBAs, you can easily:
* acquire data from multiple sources
* merge, deduplicate and cleanse
* build and publish custom data views
MERGING INTO GOOGLE and shutting down June 1, 2012
data
database
scraping
web
webapp
cloud
* acquire data from multiple sources
* merge, deduplicate and cleanse
* build and publish custom data views
MERGING INTO GOOGLE and shutting down June 1, 2012
23 hours ago by qfwfq
Scraping Made Easy with jQuery and SelectorGadget - David Trejo's Thoughts
3 days ago by bensheldon
A few days ago I was doing a TON of scraping, and as you know, without the right tools, scraping can be a REAL pain. Out of my pain comes your pleasure — here’s a list of scraping tools and resources which will make your life MUCH easier the next time you need some information from a crufty old website. If you’re short on time, skip to the end and read the tl;dr.
nodejs
javascript
jquery
scraping
3 days ago by bensheldon
PhantomJS: Headless WebKit with JavaScript API
4 days ago by rybesh
PhantomJS is a headless WebKit with JavaScript API. It has fast and native support for various web standards: DOM handling, CSS selector, JSON, Canvas, and SVG.
PhantomJS is an optimal solution for fast headless testing, site scraping, pages capture, SVG renderer, network monitoring and many other use cases.
javascript
scraping
testing
PhantomJS is an optimal solution for fast headless testing, site scraping, pages capture, SVG renderer, network monitoring and many other use cases.
4 days ago by rybesh
JackDanger/hacker_news - GitHub
5 days ago by genieyclo
Hacker News
This tiny script is a scraper for http://news.ycombinator.com. It shows just how little Ruby is required to run a powerful web filter.
What it be doin':
Fetches the YCombinator news page
HPricot's it into pieces
Builds an RSS feed
Uses links to parse the destination article page and include the main text inline
Runs as a stand-alone server thanks to Rack and Mongrel
ruby
golden:ruby
examples
codeexamples
useful
scraping
links
parser
hpricot
mongrel
rack
rss
This tiny script is a scraper for http://news.ycombinator.com. It shows just how little Ruby is required to run a powerful web filter.
What it be doin':
Fetches the YCombinator news page
HPricot's it into pieces
Builds an RSS feed
Uses links to parse the destination article page and include the main text inline
Runs as a stand-alone server thanks to Rack and Mongrel
5 days ago by genieyclo
apexdodge/NASCAR-Screen-Scraper
9 days ago by heyitsnoah
"NASCAR.com, to my knowledge, does not provide an API for acquiring driver stats. Here is a screen scraper for NASCAR.com to acquire all the relevant stats and races."
nascar
scraping
code
python
9 days ago by heyitsnoah
Copy this bookmark: