keimlink + datamining   7

Pattern | CLiPS
Pattern is a web mining module for the Python programming language.
It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics) and data visualization (graph networks).
python  web  opensource  tools  datamining  google  twitter  wikipedia  text  graphs 
october 2011 by keimlink
ScraperWiki
ScraperWiki is all the tools you need for Screen Scraping, Data Mining & visualisation.
database  datamining  opendata  php  programming  python  ruby  screenscraping  tools  wiki 
july 2011 by keimlink
Data Brewery and Cubes
Brewery is a Python framework and collection of tools for analysing and mining data.
python  datamining  programming  opensource  opendata  csv  mongodb  database 
june 2011 by keimlink
[!] PippiLongstrings
A document database that knows the relations between it's documents.
python  django  eu  politics  datamining  opensource  27c3 
december 2010 by keimlink
Carrot2 - Open Source Search Results Clustering Engine
Carrot2 is an Open Source Search Results Clustering Engine. It can automatically organize small collections of documents, e.g. search results, into thematic categories.
search  opensource  searchengine  java  clustering  datamining 
june 2010 by keimlink
Exomind
An experimental Python console and programmatic framework for building decorated graphs and developing open-source intelligence modules and ideas, centered on social network services, search engines and instant messaging.
python  opensource  datamining  socialnetwork  community  security 
november 2008 by keimlink

Copy this bookmark:



description:


tags: