mcroydon + datamining   64

Driving down the cost of Big-Data analytics - All Things Distributed
"The Amazon Elastic MapReduce (EMR) team announced today the ability to seamlessly use Amazon EC2 Spot Instances with their service, significantly driving down the cost of data analytics in the cloud."
analytics  aws  bigdata  data  datamining 
september 2011 by mcroydon
Mining of Massive Datasets
Looks like a fantastic book on data mining.
book  books  data  datamining  mapreduce 
september 2011 by mcroydon
Pattern | CLiPS
A Python NLP package with emphasis on retrieving and analyzing language found on the web.
analysis  data  datamining  nlp  python 
february 2011 by mcroydon
Silk - A Link Discovery Framework for the Web of Data
"The Silk framework is a tool for discovering relationships between data items within different Linked Data sources. Data publishers can use Silk to set RDF links from their data sources to other data sources on the Web."
app  code  applications  data  datamining  framework  library  owl  linking  lod  opensource  linkeddata  linked-data  programming  python  rdf  semantic  semantic-web  web  tools  tool  sparql  software  semweb  semanticweb  silk 
october 2010 by mcroydon
Welcome to Elefant — Elefant
"Elefant (Efficient Learning, Large-scale Inference, and Optimisation Toolkit) is an open source library for machine learning licensed under the Mozilla Public License"
python  programming  open-source  software  code  tools  library  opensource  research  computer  math  ai  app  datamining  learning  gui  optimization  machinelearning  check  scipy  machine.learning  toolkit  ml  frameworks  c++  machine-learning  nicta  scipy2007 
november 2009 by mcroydon
LucidDB Home Page
"LucidDB is the first and only open-source RDBMS purpose-built entirely for data warehousing and business intelligence. It is based on architectural cornerstones such as column-store, bitmap indexing, hash join/aggregation, and page-level multiversioning."
programming  software  development  database  data  business  opensource  java  scalability  storage  distributed  cluster  databases  sql  db  datamining  olap  columndb  bi  datawarehouse  dbms  reporting  rdbms  luciddb  column  warehousing  column-store  data_warehouse  column-oriented  dwh 
october 2009 by mcroydon
Charu Aggarwal
Some pretty hefty thoughts on data mining and ML stuffs.
data  database  research  books  algorithms  management  db  datamining  academic  ibm  papers  mining  people  lists  data-mining  data_mining  privacy 
august 2009 by mcroydon
LIBLINEAR -- A Library for Large Linear Classification
"LIBLINEAR is a linear classifier for data with millions of instances and features. It supports L2-regularized logistic regression (LR), L2-loss linear SVM, and L1-loss linear SVM."
programming  software  library  statistics  math  opensource  clustering  download  c  ai  classification  datamining  matlab  machinelearning  machine-learning  machine_learning  uni  svm  liblinear  ml 
december 2008 by mcroydon
Colophon « Dataninja
Worth checking out. I was hoping to find more dataviz with latex stuff, but that's okay.
programming  software  data  reference  blog  osx  tutorial  tools  science  research  statistics  latex  analysis  text  blogs  academic  analytics  tips  economy  r  sas  stata  datamining  methodology 
august 2008 by mcroydon

related tags

3d  aa  academic  ad  ai  algo  algorithm  algorithms  alife  amazon  analyser  analysis  analytics  and  animation  api  app  application  applications  architecture  article  artificial-intelligence  artificialintelligence  aws  balie  bayes  bayesian  bi  big  bigdata  bioinformatics  biology  bit  bitmap  bizgres  blog  blogs  book  bookmarking  bookmarks  books  browser  business  businessintelligence  c  c++  career  cartographie  cartography  census  challenge  chart  charts  check  class  classes  classification  classifier  click  cloud  cloudcomputing  cloudera  cloud_computing  cluster  clustering  code  coding  collaborative  collaborative_filtering  column  column-oriented  column-store  columndb  community  comparison  complexity  compress  compression  compsci  computer  computerscience  computer_science  computing  concurrency  content  contest  conversion  convert  corpus  course  courses  crawler  cs  data  data-mining  data-visualization  database  databases  dataflow  datamining  datascientist  dataset  datasets  datavisualization  datawarehouse  data_mining  data_warehouse  db  dbms  dbpedia  del.icio.us  delicious  demographics  design  dev  developers  development  diagram  discussion  dist  distance  distributed  distributedcomputing  dna  dot  download  draft  dwh  e-books  ebook  ebooks  ec2  economy  education  elasticmapreduce  emr  encyclopedia  engineering  english  entity  entity-extraction  eol  erp  evolution  examples  experiments  extract  extraction  fastbit  ferrett  filtering  flowingdata  forum  framework  frameworks  free  freebase  freelancing  freeware  friend  functional  future  ga  gate  geek  genealogy  genetic  geneticalgorithms  geneticprogramming  genetics  genetic_algorithm  genetic_algorithms  genome  genomics  gephi  gis  gov  government  graph  graphics  graphing  graphs  graphviz  greenplum  grid  gui  hackernews  hacks  hadoop  halvarian  harvard  hashing  hdfs  hive  howto  html  ibm  ie  index  indexing  inference  infographics  information  information-extraction  informationtheory  information_extraction  information_retrieval  infovis  infoviz  intelligence  intro  ir  java  jobs  journalism  kernel  kernel-machines  kernels  kmeans  knn  knowledge  language  languages  latex  learning  lecture  lectures  lesen  lib  liblinear  library  license:lgpl  license:PSF  lingpipe  linguistics  linked-data  linkeddata  linking  linux  list  lists  lod  log  logs  lucene  luciddb  machine  machine-learning  machine.learning  machinelearning  machine_learning  management  manning  map  map-reduce  mapreduce  maps  market  mashup  matching  math  math.optimization  mathematics  matlab  memory  methodology  metric  metrics  mining  mitchell  mix  ml  moa  module  mrjob  multidimensional  music  mysql  named  namedentityrecognition  natural  ner  netflix  network  networks  nicta  nlp  nltk  norvig  nosql  notes  numpy  olap  online  online_learning  ontology  open  open-source  opensource  opinion  optimization  oss  osx  owl  paper  papers  parse  parser  parsing  pattern_recognition  pdf  pdfminer  people  performance  perl  personalization  pig  population  postgres  postgresql  prediction  presentation  presentations  privacy  probability  processing  processing.org  programming  publicdata  pyml  python  r  rails  rdbms  rdf  realtime  recommendation  recommendations  recommender  reduce  reference  references  regression  reporting  research  resource  resources  review  reviews  ruby  rubyonrails  sas  scalability  scale  scaling  science  scientist  scipy  scipy2007  scraping  script  scripts  search  semantic  semantic-web  semanticweb  semantic_web  semweb  seo  sequence  server  sgd  sharding  silk  similar  similarity  skills  slides  slideshare  sna  social  socialgraph  socialmedia  socialnetwork  socialnetworks  software  solr  source  sparklines  sparql  spatial  species  sql  stanford  startup  stata  statistical  statistical_learning  statistics  stats  storage  stream  streaming  string  strings  structure  suggestions  svm  t  tagging  talk  taxonomy  technique  techniques  technology  text  text-mining  textmining  theory  time  tips  to-read  todo  tool  toolkit  tools  toread  towatch  tracking  transportation  trendingtopics  trends  tutorial  tutorials  twitter  uima  uni  useful  users  utilities  via:pskomoroch  video  vision  visualisation  visualization  warehouse  warehousing  web  web2.0  webdev  webservices  weka  wiki  wikipedia  wordnet  xml  yahoo  yelp 

Copy this bookmark:



description:


tags: