mcroydon + datamining 64
Driving down the cost of Big-Data analytics - All Things Distributed
september 2011 by mcroydon
"The Amazon Elastic MapReduce (EMR) team announced today the ability to seamlessly use Amazon EC2 Spot Instances with their service, significantly driving down the cost of data analytics in the cloud."
analytics
aws
bigdata
data
datamining
september 2011 by mcroydon
nathanmarz/storm - GitHub
september 2011 by mcroydon
Processing kit from Backtype.
data
datamining
processing
twitter
september 2011 by mcroydon
Mining of Massive Datasets
september 2011 by mcroydon
Looks like a fantastic book on data mining.
book
books
data
datamining
mapreduce
september 2011 by mcroydon
Overview: Extracting article text from HTML documents | My tech blog.
may 2011 by mcroydon
Extracting text from markup.
datamining
extraction
html
scraping
text
may 2011 by mcroydon
Stanford Vis Group | Wrangler: Interactive Visual Specification of Data Transformation Scripts
march 2011 by mcroydon
The paper behind the Wrangler project.
browser
data
datamining
visualization
web
march 2011 by mcroydon
Pattern | CLiPS
february 2011 by mcroydon
A Python NLP package with emphasis on retrieving and analyzing language found on the web.
analysis
data
datamining
nlp
python
february 2011 by mcroydon
Beyond Amazon: How to Make Recommendations Smarter | Fast Company
february 2011 by mcroydon
You put the chicken in the pot... bork bork bork.
amazon
analytics
cloud_computing
data
datamining
february 2011 by mcroydon
A Graph Processing Stack |
december 2010 by mcroydon
Java-oriented graph stack.
algorithm
datamining
graph
programming
december 2010 by mcroydon
Yelp Engineering Blog: mrjob: Distributed Computing for Everybody
november 2010 by mcroydon
A really nice wrapper around EMR.
algorithms
amazon
aws
cloudcomputing
computing
data
hadoop
framework
distributed
dist
development
datamining
library
map-reduce
map
mapreduce
nosql
opensource
yelp
webservices
search
reduce
python
programming
aa
elasticmapreduce
logs
emr
mrjob
november 2010 by mcroydon
Machine Learning textbook slides
november 2010 by mcroydon
Great slides as companion to the book and classes built around it.
classifier
ai
artificial-intelligence
bayesian
book
books
intelligence
datamining
courses
machine
computer
class
learning
lectures
machine-learning
machinelearning
tutorial
science
slides
resources
reference
ml
mitchell
november 2010 by mcroydon
Silk - A Link Discovery Framework for the Web of Data
october 2010 by mcroydon
"The Silk framework is a tool for discovering relationships between data items within different Linked Data sources. Data publishers can use Silk to set RDF links from their data sources to other data sources on the Web."
app
code
applications
data
datamining
framework
library
owl
linking
lod
opensource
linkeddata
linked-data
programming
python
rdf
semantic
semantic-web
web
tools
tool
sparql
software
semweb
semanticweb
silk
october 2010 by mcroydon
wiki.dbpedia.org : About
october 2010 by mcroydon
Quite a lot of things in the linked open data universe point to dbpedia.
analysis
data
api
database
datamining
datasets
db
free
opensource
ontology
mashup
knowledge
information
programming
rdf
reference
search
semantic
semantic-web
web2.0
web
tools
sparql
semanticweb
semantic_web
wiki
wikipedia
xml
dbpedia
october 2010 by mcroydon
The Easy Way to Extract Useful Text from Arbitrary HTML - AI Depot
algorithms content ai code crawler data datamining extract math machinelearning machine learning extraction html mining network parser parsing programming python scraping web useful tutorial tools textmining text-mining statistics
september 2010 by mcroydon
algorithms content ai code crawler data datamining extract math machinelearning machine learning extraction html mining network parser parsing programming python scraping web useful tutorial tools textmining text-mining statistics
september 2010 by mcroydon
Gephi, graph exploration and manipulation software
march 2010 by mcroydon
Quite beautiful.
3d
analysis
animation
cartographie
cartography
data
graphviz
graphs
graphics
datavisualization
graph
datamining
infographics
infovis
infoviz
java
maps
network
networks
socialnetworks
socialnetwork
socialgraph
sna
opensource
software
tools
tool
visualisation
visualization
gephi
march 2010 by mcroydon
Data-Intensive Text Processing with MapReduce
february 2010 by mcroydon
I flipped through this during the conference and need to read through it more thoroughly.
programming
design
data
reference
book
free
books
geek
pdf
text
hadoop
distributed
online
algorithm
algorithms
mapreduce
datamining
to-read
ebooks
nlp
ir
textmining
distributedcomputing
text-mining
developers
draft
february 2010 by mcroydon
Pin Dancing: Learning about Machine Learning
january 2010 by mcroydon
Big book list recommendation / study order (part 2).
programming
data
book
books
computer
science
statistics
math
learning
ai
list
ebook
datamining
machinelearning
machine-learning
to-read
academic
mix
hackernews
lists
machine_learning
ml
mathematics
ebooks
january 2010 by mcroydon
Measuring Measures: Learning About Statistical Learning
january 2010 by mcroydon
Big list of book recommendations / study order (part 1).
programming
data
reference
toread
tutorial
research
book
books
statistics
math
education
ai
learning
list
cs
datamining
stats
machinelearning
analytics
probability
machine.learning
courses
machine_learning
ml
mathematics
machine-learning
statistical
statistical_learning
intro
suggestions
january 2010 by mcroydon
sbff - Project Hosting on Google Code
december 2009 by mcroydon
Takes the brute force approach but it'll be interesting to see what this comes up with.
web
software
database
opensource
api
java
search
computing
social
datamining
del.icio.us
recommendations
socialmedia
delicious
script
filtering
collaborative
users
bookmarks
bookmarking
experiments
friend
similar
december 2009 by mcroydon
Open source Clustering software
december 2009 by mcroydon
Including PyCluster, a Python interface to several of the C-based clustering algorithms.
python
programming
software
linux
code
tools
library
opensource
free
java
research
science
statistics
license:PSF
algorithm
algorithms
math
ai
cluster
analysis
open
kmeans
data_mining
bioinformatics
c++
perl
c
machinelearning
source
datamining
clustering
december 2009 by mcroydon
Welcome to Elefant — Elefant
november 2009 by mcroydon
"Elefant (Efficient Learning, Large-scale Inference, and Optimisation Toolkit) is an open source library for machine learning licensed under the Mozilla Public License"
python
programming
open-source
software
code
tools
library
opensource
research
computer
math
ai
app
datamining
learning
gui
optimization
machinelearning
check
scipy
machine.learning
toolkit
ml
frameworks
c++
machine-learning
nicta
scipy2007
november 2009 by mcroydon
Building a Data Intensive Web Application with Cloudera, Hadoop, Hive, Pig, and EC2 | Cloudera
november 2009 by mcroydon
A nice look at end-to-end data analysis of big datasets using things like Pig and Hive.
programming
web
data
database
tools
webdev
business
toread
howto
tutorial
amazon
dev
scalability
hadoop
architecture
aws
computing
cluster
ec2
pig
trends
hive
cloudera
cloudcomputing
analytics
datamining
mapreduce
cloud
application
november 2009 by mcroydon
Hadoop, Pig, and Twitter (NoSQL East 2009)
november 2009 by mcroydon
An awesome deck showing of Pig and how Twitter uses it.
programming
data
database
howto
server
dev
search
statistics
hadoop
presentation
mapreduce
datamining
twitter
analytics
slides
presentations
nosql
bigdata
socialnetworks
slideshare
hdfs
pig
big
functional
data_mining
november 2009 by mcroydon
An Overview of Data Mining Techniques
november 2009 by mcroydon
Data mining vs. statistics. Fight.
data
reference
database
toread
tutorial
research
books
algorithms
statistics
algorithm
education
ai
learning
clustering
datamining
paper
tutorials
analytics
mining
data-mining
datawarehouse
data_mining
bi
and
techniques
technique
november 2009 by mcroydon
LucidDB Home Page
october 2009 by mcroydon
"LucidDB is the first and only open-source RDBMS purpose-built entirely for data warehousing and business intelligence. It is based on architectural cornerstones such as column-store, bitmap indexing, hash join/aggregation, and page-level multiversioning."
programming
software
development
database
data
business
opensource
java
scalability
storage
distributed
cluster
databases
sql
db
datamining
olap
columndb
bi
datawarehouse
dbms
reporting
rdbms
luciddb
column
warehousing
column-store
data_warehouse
column-oriented
dwh
october 2009 by mcroydon
String Similarity Metrics for Information Integration
october 2009 by mcroydon
A fine collection of string similarity algorithms.
programming
library
tools
reference
opensource
java
research
search
algorithm
algorithms
statistics
math
computing
analysis
text
comparison
datamining
nlp
similarity
theory
bioinformatics
metrics
mathematics
ir
string
matching
distance
strings
metric
sequence
october 2009 by mcroydon
Elements of Statistical Learning: data mining, inference, and prediction.2nd Edition.
october 2009 by mcroydon
Available as a PDF.
programming
data
reference
toread
free
book
research
statistics
books
math
ai
pdf
learning
ebook
stats
datamining
machinelearning
machine-learning
inference
r
machine.learning
prediction
data-mining
mathematics
ml
machine_learning
ebooks
mining
machine
academic
october 2009 by mcroydon
David MacKay: Information Theory, Inference, and Learning Algorithms: The Book
september 2009 by mcroydon
A freely available text on machine learning, AI, inference, etc.
machinelearning
programming
reference
toread
free
book
research
books
science
statistics
algorithms
math
ai
pdf
algorithm
information
learning
cs
ebook
datamining
probability
bayesian
theory
mathematics
machine_learning
ebooks
download
inference
bayes
informationtheory
september 2009 by mcroydon
Tracking Trends with Hadoop and Hive on EC2 » Cloudera Hadoop & Big Data Blog
august 2009 by mcroydon
A detailed run through of data warehousing and creating trending results for wikipedia data.
python
data
database
howto
tutorial
wikipedia
rails
scaling
hadoop
aws
resources
ec2
cloud
mapreduce
datamining
sparklines
rubyonrails
trends
hive
cloudera
bigdata
log
t
trendingtopics
august 2009 by mcroydon
Charu Aggarwal
august 2009 by mcroydon
Some pretty hefty thoughts on data mining and ML stuffs.
data
database
research
books
algorithms
management
db
datamining
academic
ibm
papers
mining
people
lists
data-mining
data_mining
privacy
august 2009 by mcroydon
Vowpal Wabbit (Fast Online Learning)
august 2009 by mcroydon
"This is a project at Yahoo! Research to design a fast, scalable, useful learning algorithm."
programming
software
code
tools
library
opensource
research
dev
algorithms
statistics
algorithm
computing
online
ai
learning
yahoo
compsci
tool
datamining
data_mining
artificialintelligence
lib
machine_learning
nlp
ml
machine-learning
machinelearning
online_learning
sgd
august 2009 by mcroydon
Kernel-Machines.Org — Kernel Machines
august 2009 by mcroydon
"This page is devoted to learning methods building on kernels, such as the support vector machine."
programming
software
reference
tutorial
research
science
statistics
algorithms
algorithm
ai
math
learning
clustering
cs
resources
datamining
machinelearning
svm
machine_learning
mathematics
ml
classification
papers
kernel
machine
academic
machine-learning
kernel-machines
kernels
pattern_recognition
august 2009 by mcroydon
Welcome to The Machine Learning Forum
july 2009 by mcroydon
This is worth keeping an eye on if it takes off.
web
data
research
community
math
review
cs
datamining
machinelearning
machine-learning
mathematics
mining
machine_learning
ml
vision
data-mining
forum
discussion
july 2009 by mcroydon
CS171
june 2009 by mcroydon
Data visualization class at Harvard.
programming
design
development
data
reference
video
tutorial
toread
visualization
research
free
graphics
science
statistics
education
math
learning
datamining
academic
course
lectures
processing
visualisation
lecture
courses
class
classes
infovis
harvard
processing.org
june 2009 by mcroydon
Balie - Baseline Information Extraction
june 2009 by mcroydon
Interesting machine learning/information extraction stuff.
programming
software
tools
library
java
research
opensource
language
information
ai
datamining
machinelearning
machine-learning
linguistics
textmining
nlp
machine_learning
semantic
semanticweb
extraction
ie
information_extraction
information_retrieval
uima
ner
entity
weka
balie
named
namedentityrecognition
june 2009 by mcroydon
Rise of the Data Scientist | FlowingData
june 2009 by mcroydon
Indeed.
web
design
data
journalism
business
toread
science
visualization
research
statistics
education
article
future
jobs
datamining
computerscience
visualisation
career
mathematics
data-mining
flowingdata
freelancing
halvarian
skills
scientist
bigdata
datascientist
analytics
june 2009 by mcroydon
FastBit: An Efficient Compressed Bitmap Index Technology
may 2009 by mcroydon
What it says on the tin.
programming
software
database
data
research
library
opensource
search
datamining
grid
db
compression
indexing
index
hashing
bit
algo
data-mining
compress
fastbit
olap
bitmap
multidimensional
columndb
may 2009 by mcroydon
Greenplum: the petabyte-scale database for data warehousing and business intelligence.
may 2009 by mcroydon
Petabyte data mining and data warehousing.
programming
software
development
data
database
business
technology
opensource
storage
scalability
performance
cluster
postgresql
startup
open
postgres
sql
mapreduce
datamining
db
analytics
oss
reporting
rdbms
intelligence
bi
businessintelligence
datawarehouse
greenplum
bizgres
may 2009 by mcroydon
Stephen Marsland
april 2009 by mcroydon
Machine learning + Python. !!!
python
programming
software
data
reference
code
book
books
algorithms
todo
algorithm
analysis
information
ai
learning
resources
cs
datamining
machine
academic
machine-learning
machinelearning
ml
examples
numpy
artificialintelligence
scipy
computer_science
machine.learning
e-books
april 2009 by mcroydon
lucasjosh's collectiverb at master - GitHub
march 2009 by mcroydon
Algorithms from PCI translated to Ruby.
programming
ruby
datamining
march 2009 by mcroydon
UC Berkeley Machine Learning Tea / Hadoop for Machine Learning Guide
march 2009 by mcroydon
Nice and simple example of ML in Java with Hadoop from the Berkeley ML tea.
research
hadoop
mapreduce
datamining
machinelearning
machine_learning
regression
march 2009 by mcroydon
Seng 474 - Spring2009, Course Outline, Alex Thomo
march 2009 by mcroydon
I love the smell of data mining in the spring.
python
code
mapreduce
slides
datamining
lectures
machinelearning
course
march 2009 by mcroydon
Directed Edge News » Blog Archive » On Building a Stupidly Fast Graph Database
february 2009 by mcroydon
Wow.
programming
design
database
visualization
blog
research
algorithms
scalability
algorithm
performance
scaling
coding
engineering
graph
databases
cs
graphs
recommendations
memory
scale
mathematics
datamining
db
hashing
concurrency
notes
february 2009 by mcroydon
Amazon Web Services Blog: New AWS Public Data Sets - Economics, DBpedia, Freebase, and Wikipedia
february 2009 by mcroydon
Lots of hot new data.
programming
data
blog
amazon
research
wikipedia
statistics
aws
compsci
ec2
resources
transportation
cloud
datamining
nlp
dataset
cloudcomputing
corpus
datasets
freebase
gov
publicdata
scraping
encyclopedia
february 2009 by mcroydon
Automating Data Visualization with Ruby and Graphviz - OmniNerd
february 2009 by mcroydon
Yay Graphiz!
programming
data
howto
tutorial
ruby
visualization
rails
graphics
network
statistics
math
charts
chart
graph
databases
graphs
tutorials
article
datamining
reporting
visualisation
diagram
networks
graphviz
scripts
graphing
data-visualization
dot
datavisualization
erp
february 2009 by mcroydon
An Exercise in Species Barcoding
february 2009 by mcroydon
Fantastic read, as always.
python
programming
science
algorithms
math
hacks
clustering
datamining
machine-learning
machinelearning
classification
evolution
dna
biology
bioinformatics
genetic
eol
genetics
genome
taxonomy
norvig
genomics
species
february 2009 by mcroydon
PyML
january 2009 by mcroydon
A machine learning library for Python. LGPL.
license:lgpl
python
programming
software
code
library
research
statistics
algorithms
opensource
framework
ai
learning
module
machine
datamining
classification
machine-learning
machinelearning
ml
svm
pyml
data-mining
knn
regression
data_mining
january 2009 by mcroydon
High-Performance Click Analysis with MySQL | MySQL Performance Blog
january 2009 by mcroydon
Good thoughts on write-heavy systems. A lot of it applies outside the MySQL ecosystem.
design
web
data
database
blog
statistics
storage
scalability
analysis
scaling
performance
stats
mysql
optimization
hadoop
sql
tracking
ad
article
analytics
datamining
english
reporting
db
sharding
seo
warehouse
click
lesen
realtime
datawarehouse
january 2009 by mcroydon
Main page - Introduction to Genetic Algorithms - Tutorial with Interactive Java Applets
december 2008 by mcroydon
Genetic programming in Python.
programming
development
reference
tutorial
java
book
books
algorithm
algorithms
architecture
analysis
computerscience
ai
datamining
dna
ga
genetic
geneticalgorithms
genetics
geneticprogramming
genetic_algorithm
alife
complexity
genetic_algorithms
math.optimization
december 2008 by mcroydon
LIBLINEAR -- A Library for Large Linear Classification
december 2008 by mcroydon
"LIBLINEAR is a linear classifier for data with millions of instances and features. It supports L2-regularized logistic regression (LR), L2-loss linear SVM, and L1-loss linear SVM."
programming
software
library
statistics
math
opensource
clustering
download
c
ai
classification
datamining
matlab
machinelearning
machine-learning
machine_learning
uni
svm
liblinear
ml
december 2008 by mcroydon
Main Page - NLTK
october 2008 by mcroydon
I feel like I've been down this road before.
python
programming
development
software
reference
code
library
tools
book
api
books
science
computer
research
algorithms
language
freeware
text
toolkit
natural
textmining
nlp
languages
datamining
processing
linguistics
nltk
opensource
ai
wordnet
october 2008 by mcroydon
GATE, A General Architecture for Text Engineering
october 2008 by mcroydon
The tools. There they are.
programming
library
software
tools
research
java
search
language
architecture
information
framework
text
analysis
ai
opensource
linguistics
mining
tool
datamining
information_extraction
textmining
nlp
gate
semantic
text-mining
extraction
ontology
knowledge
information-extraction
october 2008 by mcroydon
LingPipe Home
october 2008 by mcroydon
Someday.
programming
software
development
library
code
tools
java
api
research
free
language
search
framework
text
analysis
clustering
opensource
ai
lucene
linguistics
parser
classification
datamining
ir
nlp
textmining
entity-extraction
lingpipe
information_extraction
natural
october 2008 by mcroydon
Colophon « Dataninja
august 2008 by mcroydon
Worth checking out. I was hoping to find more dataviz with latex stuff, but that's okay.
programming
software
data
reference
blog
osx
tutorial
tools
science
research
statistics
latex
analysis
text
blogs
academic
analytics
tips
economy
r
sas
stata
datamining
methodology
august 2008 by mcroydon
related tags
3d ⊕ aa ⊕ academic ⊕ ad ⊕ ai ⊕ algo ⊕ algorithm ⊕ algorithms ⊕ alife ⊕ amazon ⊕ analyser ⊕ analysis ⊕ analytics ⊕ and ⊕ animation ⊕ api ⊕ app ⊕ application ⊕ applications ⊕ architecture ⊕ article ⊕ artificial-intelligence ⊕ artificialintelligence ⊕ aws ⊕ balie ⊕ bayes ⊕ bayesian ⊕ bi ⊕ big ⊕ bigdata ⊕ bioinformatics ⊕ biology ⊕ bit ⊕ bitmap ⊕ bizgres ⊕ blog ⊕ blogs ⊕ book ⊕ bookmarking ⊕ bookmarks ⊕ books ⊕ browser ⊕ business ⊕ businessintelligence ⊕ c ⊕ c++ ⊕ career ⊕ cartographie ⊕ cartography ⊕ census ⊕ challenge ⊕ chart ⊕ charts ⊕ check ⊕ class ⊕ classes ⊕ classification ⊕ classifier ⊕ click ⊕ cloud ⊕ cloudcomputing ⊕ cloudera ⊕ cloud_computing ⊕ cluster ⊕ clustering ⊕ code ⊕ coding ⊕ collaborative ⊕ collaborative_filtering ⊕ column ⊕ column-oriented ⊕ column-store ⊕ columndb ⊕ community ⊕ comparison ⊕ complexity ⊕ compress ⊕ compression ⊕ compsci ⊕ computer ⊕ computerscience ⊕ computer_science ⊕ computing ⊕ concurrency ⊕ content ⊕ contest ⊕ conversion ⊕ convert ⊕ corpus ⊕ course ⊕ courses ⊕ crawler ⊕ cs ⊕ data ⊕ data-mining ⊕ data-visualization ⊕ database ⊕ databases ⊕ dataflow ⊕ datamining ⊖ datascientist ⊕ dataset ⊕ datasets ⊕ datavisualization ⊕ datawarehouse ⊕ data_mining ⊕ data_warehouse ⊕ db ⊕ dbms ⊕ dbpedia ⊕ del.icio.us ⊕ delicious ⊕ demographics ⊕ design ⊕ dev ⊕ developers ⊕ development ⊕ diagram ⊕ discussion ⊕ dist ⊕ distance ⊕ distributed ⊕ distributedcomputing ⊕ dna ⊕ dot ⊕ download ⊕ draft ⊕ dwh ⊕ e-books ⊕ ebook ⊕ ebooks ⊕ ec2 ⊕ economy ⊕ education ⊕ elasticmapreduce ⊕ emr ⊕ encyclopedia ⊕ engineering ⊕ english ⊕ entity ⊕ entity-extraction ⊕ eol ⊕ erp ⊕ evolution ⊕ examples ⊕ experiments ⊕ extract ⊕ extraction ⊕ fastbit ⊕ ferrett ⊕ filtering ⊕ flowingdata ⊕ forum ⊕ framework ⊕ frameworks ⊕ free ⊕ freebase ⊕ freelancing ⊕ freeware ⊕ friend ⊕ functional ⊕ future ⊕ ga ⊕ gate ⊕ geek ⊕ genealogy ⊕ genetic ⊕ geneticalgorithms ⊕ geneticprogramming ⊕ genetics ⊕ genetic_algorithm ⊕ genetic_algorithms ⊕ genome ⊕ genomics ⊕ gephi ⊕ gis ⊕ gov ⊕ government ⊕ graph ⊕ graphics ⊕ graphing ⊕ graphs ⊕ graphviz ⊕ greenplum ⊕ grid ⊕ gui ⊕ hackernews ⊕ hacks ⊕ hadoop ⊕ halvarian ⊕ harvard ⊕ hashing ⊕ hdfs ⊕ hive ⊕ howto ⊕ html ⊕ ibm ⊕ ie ⊕ index ⊕ indexing ⊕ inference ⊕ infographics ⊕ information ⊕ information-extraction ⊕ informationtheory ⊕ information_extraction ⊕ information_retrieval ⊕ infovis ⊕ infoviz ⊕ intelligence ⊕ intro ⊕ ir ⊕ java ⊕ jobs ⊕ journalism ⊕ kernel ⊕ kernel-machines ⊕ kernels ⊕ kmeans ⊕ knn ⊕ knowledge ⊕ language ⊕ languages ⊕ latex ⊕ learning ⊕ lecture ⊕ lectures ⊕ lesen ⊕ lib ⊕ liblinear ⊕ library ⊕ license:lgpl ⊕ license:PSF ⊕ lingpipe ⊕ linguistics ⊕ linked-data ⊕ linkeddata ⊕ linking ⊕ linux ⊕ list ⊕ lists ⊕ lod ⊕ log ⊕ logs ⊕ lucene ⊕ luciddb ⊕ machine ⊕ machine-learning ⊕ machine.learning ⊕ machinelearning ⊕ machine_learning ⊕ management ⊕ manning ⊕ map ⊕ map-reduce ⊕ mapreduce ⊕ maps ⊕ market ⊕ mashup ⊕ matching ⊕ math ⊕ math.optimization ⊕ mathematics ⊕ matlab ⊕ memory ⊕ methodology ⊕ metric ⊕ metrics ⊕ mining ⊕ mitchell ⊕ mix ⊕ ml ⊕ moa ⊕ module ⊕ mrjob ⊕ multidimensional ⊕ music ⊕ mysql ⊕ named ⊕ namedentityrecognition ⊕ natural ⊕ ner ⊕ netflix ⊕ network ⊕ networks ⊕ nicta ⊕ nlp ⊕ nltk ⊕ norvig ⊕ nosql ⊕ notes ⊕ numpy ⊕ olap ⊕ online ⊕ online_learning ⊕ ontology ⊕ open ⊕ open-source ⊕ opensource ⊕ opinion ⊕ optimization ⊕ oss ⊕ osx ⊕ owl ⊕ paper ⊕ papers ⊕ parse ⊕ parser ⊕ parsing ⊕ pattern_recognition ⊕ pdf ⊕ pdfminer ⊕ people ⊕ performance ⊕ perl ⊕ personalization ⊕ pig ⊕ population ⊕ postgres ⊕ postgresql ⊕ prediction ⊕ presentation ⊕ presentations ⊕ privacy ⊕ probability ⊕ processing ⊕ processing.org ⊕ programming ⊕ publicdata ⊕ pyml ⊕ python ⊕ r ⊕ rails ⊕ rdbms ⊕ rdf ⊕ realtime ⊕ recommendation ⊕ recommendations ⊕ recommender ⊕ reduce ⊕ reference ⊕ references ⊕ regression ⊕ reporting ⊕ research ⊕ resource ⊕ resources ⊕ review ⊕ reviews ⊕ ruby ⊕ rubyonrails ⊕ sas ⊕ scalability ⊕ scale ⊕ scaling ⊕ science ⊕ scientist ⊕ scipy ⊕ scipy2007 ⊕ scraping ⊕ script ⊕ scripts ⊕ search ⊕ semantic ⊕ semantic-web ⊕ semanticweb ⊕ semantic_web ⊕ semweb ⊕ seo ⊕ sequence ⊕ server ⊕ sgd ⊕ sharding ⊕ silk ⊕ similar ⊕ similarity ⊕ skills ⊕ slides ⊕ slideshare ⊕ sna ⊕ social ⊕ socialgraph ⊕ socialmedia ⊕ socialnetwork ⊕ socialnetworks ⊕ software ⊕ solr ⊕ source ⊕ sparklines ⊕ sparql ⊕ spatial ⊕ species ⊕ sql ⊕ stanford ⊕ startup ⊕ stata ⊕ statistical ⊕ statistical_learning ⊕ statistics ⊕ stats ⊕ storage ⊕ stream ⊕ streaming ⊕ string ⊕ strings ⊕ structure ⊕ suggestions ⊕ svm ⊕ t ⊕ tagging ⊕ talk ⊕ taxonomy ⊕ technique ⊕ techniques ⊕ technology ⊕ text ⊕ text-mining ⊕ textmining ⊕ theory ⊕ time ⊕ tips ⊕ to-read ⊕ todo ⊕ tool ⊕ toolkit ⊕ tools ⊕ toread ⊕ towatch ⊕ tracking ⊕ transportation ⊕ trendingtopics ⊕ trends ⊕ tutorial ⊕ tutorials ⊕ twitter ⊕ uima ⊕ uni ⊕ useful ⊕ users ⊕ utilities ⊕ via:pskomoroch ⊕ video ⊕ vision ⊕ visualisation ⊕ visualization ⊕ warehouse ⊕ warehousing ⊕ web ⊕ web2.0 ⊕ webdev ⊕ webservices ⊕ weka ⊕ wiki ⊕ wikipedia ⊕ wordnet ⊕ xml ⊕ yahoo ⊕ yelp ⊕Copy this bookmark: