mcroydon + analysis   89

Apache Kafka
Design doc pointed out by Eric but interesting project as well.
analysis  distributed  logging  messaging 
november 2011 by mcroydon
Pattern | CLiPS
A Python NLP package with emphasis on retrieving and analyzing language found on the web.
analysis  data  datamining  nlp  python 
february 2011 by mcroydon
mnot’s blog: Digging Deeper with htracr
"There’s a lot of current activity on the binding between HTTP and TCP; from pipelining to SPDY, the frontier of Web performance lives between these layers."
analysis  capture  cool  development  howto  http  innovation  node  networking  network  net  monitoring  javascript  monitor  node.js  performance  nodejs  protocol  svg  sysadmin  tool  tcp  tracking  visualization  webdev  htracr  pcap  raphael  sniffer 
november 2010 by mcroydon
OpenTSDB - A Distributed, Scalable Monitoring System
OpenTSDB is a distributed, scalable Time Series Database (TSDB) written on top of HBase. OpenTSDB was written to address a common need: store, index and serve metrics collected from computer systems (network gear, operating systems, applications) at a large scale, and make this data easily accessible and graphable.
analysis  architecture  bigdata  cloud  data  database  db  java  lgpl  hbase  hadoop  development  graph  distributed  monitoring  nosql  opensource  operations  scalability  scale  time  sysadmin  software  storage  series  opentsdb  rrd  stumbleupon  time-series  timeseries 
november 2010 by mcroydon
New Startup Analyzes 100,000 Web Pages With a Snap of Your Fingers
These tools are getting a lot better but still require lots of human intervention to avoid false assertions.
analysis  extractiv  data  research  science  semantic  tool  startup  semanticweb  crawling 
november 2010 by mcroydon
Buzz Kill : LOL: The Life of Leo
"I was shouting into a vast echo chamber where no one could hear me because they were too busy shouting themselves."
analysis  attention  blogging  blog  facebook  blogs  social  news  microblogging  media  google  social_media  socialmedia  socialnetworking  twitter  web  leolaporte  googlebuzz  echochamber  criticism  blog  buzz  social.media 
august 2010 by mcroydon
Petit Log Analysis | Crunch Tools
An interesting set of tools for pondering over log files. Includes a little NLP and the ability to graph log files over time.
administration  analysis  logging 
july 2010 by mcroydon
Lean Analytics For Startups
Definitely worth clicking through and thinking about.
toread  analysis  startup  analytics  metrics  deck  startups  powerpoint  leanstartup 
january 2010 by mcroydon
Hw09 Counting And Clustering And Other Data Tricks
"Large scale computing is transformative for NYTimes.com."
hadoop  nytimes  data  analysis 
november 2009 by mcroydon
Nokia in Trouble? How Fast Can a Mobile Device Giant React? | MEOW! Blog
I believe that Nokia are already well ahead of this timeline but it's an interesting look at the potential challenges in front of them.
mobile  development  business  nokia  apple  blog  technology  iphone  management  analysis  culture  future  strategy  ux  economics  market  industry  mobility 
october 2009 by mcroydon
Training to Climb an Everest of Digital Data
Big data is big and almost always requires a completely different mindset than the one that is taught in computer science programs.
data  database  processing  google  news  toread  ibm  energy  datasets  mining  search  research  science  internet  algorithms  storage  scaling  education  hadoop  analysis  computer-science  datacuration 
october 2009 by mcroydon
9.520, Spring 2009
"Focuses on the problem of supervised and unsupervised learning from the perspective of modern statistical learning theory, starting with the theory of multivariate function approximation from sparse data."
research  statistics  algorithm  analysis  ai  learning  notes  mit  academic  machine-learning  machinelearning  lectures  ocr  courses  course  ml  class  classes  functional  lecture-notes  modeling  lecture_notes 
february 2009 by mcroydon
AnalyzersTokenizersTokenFilters - Solr Wiki
There are some really handy tokenizers and filters in here.
java  search  analysis  plugin  lucene  imported  searchengine  tokenizer  analyzers 
october 2008 by mcroydon
NetworkX
"NetworkX (NX) is a Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks."
python  programming  software  code  visualization  library  tools  science  computer  network  research  graphics  statistics  networking  math  algorithms  social  algorithm  framework  analysis  clustering  graph  opensource  graphs 
september 2008 by mcroydon
Colophon « Dataninja
Worth checking out. I was hoping to find more dataviz with latex stuff, but that's okay.
programming  software  data  reference  blog  osx  tutorial  tools  science  research  statistics  latex  analysis  text  blogs  academic  analytics  tips  economy  r  sas  stata  datamining  methodology 
august 2008 by mcroydon
The Economics of Online Backup - Inside Lightroom
Never underestimate the bandwidth of a station wagon full of backup tapes.
amazon  analysis  bandwidth  blogs  backup  business  internet  s3  storage  sysadmin 
february 2008 by mcroydon
timsort
I've skimmed this several times but need to read it front to back one of these days.
algorithm  algorithms  analysis  programming  python 
january 2008 by mcroydon
consensus
Open source collaborative filtering library in Python. According to PyPI it's PSF-licensed.
algorithm  analysis  collaboration  ideas  development  home  library  model  python  programming  research  resources  software  social  statistics  tools  toread  web  web.py  webdev  book:PCI 
december 2007 by mcroydon
Numpy Home Page
The bees knees for Python matrix operations and other mathwanking.
analysis  api  c  code  development  dev  computing  library  math  open-source  matlab  model  module  programming  random  reference  science  statistics  stats  tools 
december 2007 by mcroydon
LIBSVM -- A Library for Support Vector Machines
"LIBSVM is an integrated software for support vector classification, (C-SVC, nu-SVC), regression (epsilon-SVR, nu-SVR) and distribution estimation (one-class SVM ). It supports multi-class classification."
ai  algorithm  algorithms  analysis  classification  clustering  code  cs  computer-science  open-source  visualization  text 
december 2007 by mcroydon
Record Breaking Number of People Visited Newspaper Web Sites in Q2
Yes your print revenue is shrinking. But they're going to your website.
news  journalism  media  industry  web  analysis 
july 2007 by mcroydon
« earlier      

related tags

2read  3d  academic  ad  admin  administration  adobe  advertising  advice  ai  airlines  airplane  airport  algorithm  algorithms  alife  amazing  amazon  analyser  analysis  analytics  analyzer  analyzers  animation  apache  api  apple  apps  arch  architecture  archive  article  artificialintelligence  attention  audio  aws  backup  bandwidth  bash  bashreduce  benchmark  bi  bigdata  billing  binary  binary-diff  binary_diff  bioinformatics  biz  bizmodel  bizmodels  blog  blogging  blogs  bloom  bloom-filter  bloomfilter  bloomfilters  blumenthalisms  book  book:PCI  books  broadcasting  bsd  bsdiff  business  businessmodel  businessmodels  buzz  c  c++  cap  capture  car  cars  cartographie  cartography  cassandra  categorization  chart  charts  class  classes  classification  click  clicktrack  cloc  cloud  cloud-computing  cloudcomputing  cluster  clustering  code  coding  collaboration  community  company  comparison  compiler  compilers  complexity  compression  computer  computer-science  computers  computerscience  computer_science  computing  contacts  context  conversion  convert  cool  count  course  courses  crawling  criticism  cs  css  ctan  culture  cuny  customer  data  data-mining  data-structures  database  databases  datacuration  dataflow  datamining  datasets  datavisualization  datawarehouse  data_mining  db  dbpedia  debian  debugging  deck  delta  design  detection  dev  development  diagram  diff  dist  distance  distributed  distribution  django  django-apps  dna  doc  document  drawing  e-books  ebook  ec2  echochamber  echonest  ecommerce  economics  economy  education  einstein  elastic  energy  engineering  english  entity-extraction  entrepreneurship  essay  ethics  example  examples  excel  extract  extraction  extractiv  facebook  fast  fastcompany  feeds  filter  filters  finance  flight  flights  flow  flowchart  flowingdata  font  fonts  framework  frameworks  free  freeware  friends  from:zedshaw  fun  functional  future  ga  game  games  gate  generator  genetic  geneticalgorithms  geneticprogramming  genetics  genetic_algorithm  genetic_algorithms  gephi  github  goo  google  googlebuzz  grammar  graph  graph-theory  graphics  graphs  graphviz  grep  grid  grids  guides  hack  hacking  hacks  hadoop  hal  hardware  hbase  hdfs  help  hip-hop  hivemind  home  howto  html  htracr  http  humor  humour  hyperlocal  ibm  ideas  illustrator  imported  index  industry  infographic  infographics  information  information-extraction  information-retrieval  information_extraction  information_retrieval  infovis  infoviz  innovation  interesting  internet  investing  ip  iphone  ipod  ir  jarvis  java  javascript  jaylinks  jeffjarvis  journalism  journalismus  kloc  kmeans  know  knowledge  language  languages  last.fm  latex  leadership  leanstartup  learning  lecture-notes  lectures  lecture_notes  leolaporte  lesen  lex  lexer  lexical  lexing  lgpl  lib  library  libs  license:bsd  license:gpl  license:gplv2  license:PSF  license:zpl  lines  lingpipe  linguistics  linux  list  loc  log  logging  logo  lsa  lucene  lyrics  machine  machine-learning  machine.learning  machinelearning  mahout  mainstreammedia  management  manager  map  mapping  mapreduce  maps  market  marketing  mashup  matching  math  math.optimization  mathematics  maths  matlab  measurement  media  medienwandel  merge  messaging  methodology  metric  metrics  microblogging  mining  mit  ml  moa  mobile  mobility  model  modeling  models  module  money  monitor  monitoring  multicore  multimedia  music  mysql  natural  natural-language-processing  natural_language_processing  net  network  networking  networks  networkx  newmedia  news  newspapers  newsroom  nlp  node  node.js  nodejs  nokia  nosql  notes  npr  numpy  nytimes  ocr  of  online  onlinejournalism  ontology  open  open-source  opensource  opentsdb  operations  optimization  osx  package  paper  papers  parallel  parse  parser  parsers  parsing  patch  pcap  pdf  pdfminer  performance  perl  php  planes  plugin  pos  postgres  postgresql  powerpoint  prediction  presentation  press  processing  profile  profiling  program  programming  protocol  psychology  public  python  quality  query  queue  r  radio  random  ranking  rap  raphael  rating  ratings  rdf  re  read  read-later  realtime  redesign  redis  reduce  refactoring  reference  regex  regression  remix  replication  reporting  research  resources  review  reviews  rrd  rss  rsync  ruby  s3  saas  sarcasm  sas  scalability  scale  scaling  scan  scanner  scanning  sci-fi  science  scipy  scm  scraping  script  scripting  search  searchengine  security  semantic  semantic-web  semanticweb  semantic_web  sentiment  seo  sequence  series  server  service  sharding  shell  similarity  simple  sloc  sna  sniffer  social  social.media  socialgraph  socialmedia  socialnetwork  socialnetworking  socialnetworks  socialsoftware  social_media  society  software  solr  song  source  sparql  speech  spoiler  sql  star-trek  startup  startups  stata  statistics  stats  stock  storage  strategy  stream  streaming  string  strings  stumbleupon  survey  svd  svg  sysadmin  system  tagger  tagging  taxonomy  tcp  technology  tempo  term  termextract  termextraction  tex  text  text-mining  textmining  theory  time  time-based  time-series  timeseries  tips  todo  token  tokenizer  tool  tools  topia  toread  track  tracking  trading  traffic  travel  trend  trends  tutorial  tutorials  tv  twitter  typesetting  typography  unix  useful  user  users  utilities  utility  ux  vector  via:chl  via:jacobian  via:jacobkm  visualisation  visualization  voting  warehouse  web  web-dev  web.py  web2.0  webdesign  webdev  webservice  website  weka  wiki  wikipedia  windows  worldofgoo  xml  zedshaw 

Copy this bookmark:



description:


tags: