donturn + kdd   94

Apache Flume
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. Its main goal is to deliver data from applications to Apache Hadoop's HDFS. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.
opensource  apache  logs  datascience  kdd  data 
september 2011 by donturn
Needlebase
merge data, crawl web data, then chart and explore it
api  data  database  datascience  kdd  etl  stats  charts  map 
february 2011 by donturn
Wrangler
looks a lot like excel. but in the cloud? but with bigger data sets?
analysis  analytics  data  tools  visualization  viz  etl  kdd  excel 
february 2011 by donturn
cityofsound: The street as platform
The way the street feels may soon be defined by what cannot be seen with the naked eye.
mobile  design  kdd  data_mining  privacy  ubicomp  wireless  wifi  web  research  scifi  urbanism  networks  data  information 
march 2008 by donturn
Pollen Viewer
Amazing Java app to show plant migration and adaptation over time (and by time they mean centuries).
science  gis  gps  climate  flora  fauna  paleoclimatology  kdd 
february 2006 by donturn
« earlier      

related tags

academic  acm  algorithms  analysi  analysis  analytics  apache  api  apps  astronomy  austin  bayes  behavior  blocking  blog  blogs  book  brewster  business  cf  cfp  charts  cities  classification  climate  cloud  clustering  code  collaborative_filtering  community  conference  crawling  crm  cscw  data  database  datamining  datascience  data_mining  data_science  data_warehouse  dbms  design  dev  dns  donturn  eclipse  economics  email  empirical  enterprise  epidemics  etl  excel  fauna  filteirng  filtering  finance  flora  folksonomy  genres  gis  Google  gps  graphics  graphs  grid  gui  hadoop  hardware  hci  ia  ibm  indexing  information  information_architecture  information_retrieval  information_retrievaln  internet  intranet  ir  iseek  itunes  java  kdd  km  kms  language  links  link_analysis  location  log  logging  logs  mac  machinelearning  machine_learning  mail  map  mapreduce  markets  math  meme  metrics  metroia  microsoft  mobile  mpi  music  mysql  network  networks  nlp  olap  ontology  opensource  open_source  organization  osx  pagerank  paleoclimatology  personalization  php  pim  pkm  planet  portals  powerpoint  prediction  privacy  professor  professors  programming  python  quant  quantia  quantitative  rdf  readability  recommender  recommender_systems  recsys  regex  relevance  reports  research  rstats  science  scifi  scraper  screensaver  search  security  sem  semantic_web  social_computing  social_networks  social_software  software  sourceforge  spreadsheet  sql  startup  statistics  stats  statsitcs  stocks  strategy  syllabi  syllabus  tagging  tags  text  textbook  tools  ubicomp  ui  urbanism  utexas  ux  video  visualization  viz  web  webir  weka  wifi  wikipedia  windows  wired  wireless  www  xml  zipf 

Copy this bookmark:



description:


tags: