Boundary — Home
february 2012 by mcroydon
Real-time analytics and visualization.
analytics
monitoring
visualization
network
february 2012 by mcroydon
Driving down the cost of Big-Data analytics - All Things Distributed
september 2011 by mcroydon
"The Amazon Elastic MapReduce (EMR) team announced today the ability to seamlessly use Amazon EC2 Spot Instances with their service, significantly driving down the cost of data analytics in the cloud."
analytics
aws
bigdata
data
datamining
september 2011 by mcroydon
Rainbird: Realtime Analytics at Twitter (Strata 2011)
february 2011 by mcroydon
Counting really fast in a distributed fashion. Cassandra, Zookeeper, Scribe, Thrift, and Scala under the hood.
analytics
cassandra
twitter
february 2011 by mcroydon
Wrangler
february 2011 by mcroydon
Impressive data munger.
analysis
analytics
data
tools
visualization
via:jacobkm
february 2011 by mcroydon
Beyond Amazon: How to Make Recommendations Smarter | Fast Company
february 2011 by mcroydon
You put the chicken in the pot... bork bork bork.
amazon
analytics
cloud_computing
data
datamining
february 2011 by mcroydon
Kafka
december 2010 by mcroydon
"Kafka is a distributed publish/subscribe messaging system"
activity
asynchronous
backend
data
analytics
messaging
december 2010 by mcroydon
Six Metrics Every Business Should Track « The Metric System
october 2010 by mcroydon
Track things.
analysis
business
customer
cs
data
ecommerce
toread
metric
marketing
management
leadership
entrepreneurship
analytics
metrics
saas
seo
socialmedia
startup
statistics
stats
webdev
web
users
user
tracking
biz
measurement
track
october 2010 by mcroydon
chartbeat / FrontPage
september 2010 by mcroydon
"chartbeat is a real-time analytics service, which provides an interface for viewing what users on your site are doing right now, as well as providing historical context."
analytics
api
development
integration
internet
javascript
stats
statistics
service
programming
semantic
live
json
tracking
web
chartbeat
september 2010 by mcroydon
Mixpanel | Real-time Web Analytics, Funnel Analysis
july 2010 by mcroydon
Event-based analytics.
analytics
july 2010 by mcroydon
Piwik - Web analytics - Open source
may 2010 by mcroydon
Open source analytics built on PHP/MySQL.
analysis
analytics
chart
charts
development
free
software
php
opensource
open
mysql
monitoring
source
statistics
stats
webdev
tool
tools
tracking
traffic
visualization
website
webdesign
web2.0
web
may 2010 by mcroydon
NoSQL at Twitter (NoSQL EU 2010)
april 2010 by mcroydon
A pretty thorough look behind the curtain at Twitter.
analytics
architecture
cassandra
cloud
databases
database
db
grid
hbase
nosql
hadoop
presentation
pig
programming
read
twitter
slideshare
slides
scribe
scaling
scalability
flockdb
yam
april 2010 by mcroydon
5 great ways to waste money in Google Adwords « Successful Software
february 2010 by mcroydon
Do the opposite of this.
web
google
tools
business
toread
howto
article
technology
internet
startup
tips
marketing
optimization
entrepreneurship
analytics
ads
money
strategy
advertising
economics
cost
seo
no_tag
sem
adsense
adwords
february 2010 by mcroydon
Measuring Measures: Learning About Statistical Learning
january 2010 by mcroydon
Big list of book recommendations / study order (part 1).
programming
data
reference
toread
tutorial
research
book
books
statistics
math
education
ai
learning
list
cs
datamining
stats
machinelearning
analytics
probability
machine.learning
courses
machine_learning
ml
mathematics
machine-learning
statistical
statistical_learning
intro
suggestions
january 2010 by mcroydon
Lean Analytics For Startups
january 2010 by mcroydon
Definitely worth clicking through and thinking about.
toread
analysis
startup
analytics
metrics
deck
startups
powerpoint
leanstartup
january 2010 by mcroydon
Airtrout » Reverse Engineering Precinct Maps
january 2010 by mcroydon
A pretty fantastic solution to a pretty hard problem.
via:pskomoroch
data
google
mapping
maps
geo
gis
math
map
perl
politics
analytics
election
government
mathematics
elections
law
diagram
spatial
vote
voronoi
precinct
january 2010 by mcroydon
GIS on Hadoop - Nathan Kerr
december 2009 by mcroydon
WKT for the win.
data
gis
geo
hadoop
parallel
analytics
processing
census
spatial
via:pskomoroch
december 2009 by mcroydon
MongoDB - Using MongoDB for Real-time Analytics
december 2009 by mcroydon
Upserts look like they might be quite helpful with realtime analytics.
database
webdev
howto
statistics
stats
analytics
metrics
nosql
realtime
mongodb
december 2009 by mcroydon
Building a Data Intensive Web Application with Cloudera, Hadoop, Hive, Pig, and EC2 | Cloudera
november 2009 by mcroydon
A nice look at end-to-end data analysis of big datasets using things like Pig and Hive.
programming
web
data
database
tools
webdev
business
toread
howto
tutorial
amazon
dev
scalability
hadoop
architecture
aws
computing
cluster
ec2
pig
trends
hive
cloudera
cloudcomputing
analytics
datamining
mapreduce
cloud
application
november 2009 by mcroydon
SourceForge.net: pydoop
november 2009 by mcroydon
Python C++ wrappers for HDFS and MapReduce. It's probably quicker than Dumbo.
python
code
library
hadoop
c++
analytics
project
examples
hdfs
via:pskomoroch
november 2009 by mcroydon
Hadoop, Pig, and Twitter (NoSQL East 2009)
november 2009 by mcroydon
An awesome deck showing of Pig and how Twitter uses it.
programming
data
database
howto
server
dev
search
statistics
hadoop
presentation
mapreduce
datamining
twitter
analytics
slides
presentations
nosql
bigdata
socialnetworks
slideshare
hdfs
pig
big
functional
data_mining
november 2009 by mcroydon
An Overview of Data Mining Techniques
november 2009 by mcroydon
Data mining vs. statistics. Fight.
data
reference
database
toread
tutorial
research
books
algorithms
statistics
algorithm
education
ai
learning
clustering
datamining
paper
tutorials
analytics
mining
data-mining
datawarehouse
data_mining
bi
and
techniques
technique
november 2009 by mcroydon
tr.im your URLs
august 2009 by mcroydon
"There is no way for us to monetize URL shortening -- users won't pay for it -- and we just can't justify further development since Twitter has all but annointed bit.ly the market winner."
web
tools
internet
statistics
webdesign
tech
web2.0
online
tool
twitter
tracking
utility
blogging
webapp
service
analytics
services
links
url
urls
url-shortening
tinyurl
shorten
trim
short
shortener
shorturl
august 2009 by mcroydon
HadoopDB Project
july 2009 by mcroydon
Interesting approach, we'll see if it has legs.
programming
software
development
database
java
opensource
research
scalability
distributed
performance
scaling
hadoop
cluster
postgresql
databases
mysql
hadoopdb
map-reduce
hive
dbms
rdbms
2009
analytics
postgres
db
sql
mapreduce
yale
vldb
july 2009 by mcroydon
Infomaki
july 2009 by mcroydon
Pretty awesome lightweight usability testing. This would probably be a great tool for getting feedback from early mockups.
software
tools
ruby
opensource
free
rails
statistics
ui
webdesign
usability
architecture
open
source
testing
accessibility
analytics
test
ux
ia
user-experience
infomaki
feedback
ruby_on_rails
nypl
july 2009 by mcroydon
Project Voldemort Blog : Building a terabyte-scale data cycle at LinkedIn with Hadoop and Project Voldemort
july 2009 by mcroydon
More on what makes Voldemort tick.
design
development
data
database
toread
erlang
java
scalability
storage
architecture
distributed
performance
scaling
hadoop
cluster
grid
cloud
mapreduce
db
caching
analytics
arch
key-value
dht
keyvalue
scale
voldemort
batch
linkedin
datastore
july 2009 by mcroydon
Using Mongo for Real-Time Analytics
july 2009 by mcroydon
Potentially useful.
database
stats
analytics
reporting
realtime
cloud-computing
mongodb
july 2009 by mcroydon
Rise of the Data Scientist | FlowingData
june 2009 by mcroydon
Indeed.
web
design
data
journalism
business
toread
science
visualization
research
statistics
education
article
future
jobs
datamining
computerscience
visualisation
career
mathematics
data-mining
flowingdata
freelancing
halvarian
skills
scientist
bigdata
datascientist
analytics
june 2009 by mcroydon
Engineering @ Facebook's Notes | Facebook
june 2009 by mcroydon
Big big big data warehousing / data mining.
design
data
database
blog
java
map
scalability
storage
distributed
computing
scaling
article
hadoop
sql
mapreduce
db
reading
facebook
analytics
rdbms
arch
comment
hive
datawarehouse
warehouse
data-warehousing
hdfs
dw
june 2009 by mcroydon
Greenplum: the petabyte-scale database for data warehousing and business intelligence.
may 2009 by mcroydon
Petabyte data mining and data warehousing.
programming
software
development
data
database
business
technology
opensource
storage
scalability
performance
cluster
postgresql
startup
open
postgres
sql
mapreduce
datamining
db
analytics
oss
reporting
rdbms
intelligence
bi
businessintelligence
datawarehouse
greenplum
bizgres
may 2009 by mcroydon
How to Find the Right Chart Type for your Numeric Data
january 2009 by mcroydon
It's like a work-related choose your own adventure.
data
reference
howto
visualization
business
free
statistics
graphics
charts
analysis
graph
chart
graphs
article
tips
tutorials
excel
analytics
advice
visualisation
infographics
flowchart
guides
january 2009 by mcroydon
High-Performance Click Analysis with MySQL | MySQL Performance Blog
january 2009 by mcroydon
Good thoughts on write-heavy systems. A lot of it applies outside the MySQL ecosystem.
design
web
data
database
blog
statistics
storage
scalability
analysis
scaling
performance
stats
mysql
optimization
hadoop
sql
tracking
ad
article
analytics
datamining
english
reporting
db
sharding
seo
warehouse
click
lesen
realtime
datawarehouse
january 2009 by mcroydon
FeedBurner API - Google Code
october 2008 by mcroydon
Feedburner has an API.
programming
web
development
reference
google
blog
webdev
web2.0
api
mashup
rss
ideas
atom
entertainment
feed
syndication
analytics
feeds
feedburner
october 2008 by mcroydon
Colophon « Dataninja
august 2008 by mcroydon
Worth checking out. I was hoping to find more dataviz with latex stuff, but that's okay.
programming
software
data
reference
blog
osx
tutorial
tools
science
research
statistics
latex
analysis
text
blogs
academic
analytics
tips
economy
r
sas
stata
datamining
methodology
august 2008 by mcroydon
Analytics According to Captain Kirk
april 2008 by mcroydon
Best visualization today.
analysis
analytics
sci-fi
star-trek
statistics
visualization
april 2008 by mcroydon
pgFouine - a PostgreSQL log analyzer
february 2008 by mcroydon
This looks insanely useful. Hadn't seen it before.
analysis
analytics
apache
benchmark
database
databases
debian
development
monitor
optimization
performance
postgres
quality
postgresql
scalability
server
software
sysadmin
sql
system
tools
february 2008 by mcroydon
Text Analytics Wiki: Welcome to the Text Analytics Wiki
january 2008 by mcroydon
Worth checking out when I have some free time.
text
analysis
analytics
wiki
database
mining
data
january 2008 by mcroydon
related tags
academic ⊕ accessibility ⊕ activity ⊕ ad ⊕ admin ⊕ ads ⊕ adsense ⊕ advertising ⊕ advice ⊕ adwords ⊕ ai ⊕ ajax ⊕ algorithm ⊕ algorithms ⊕ amazon ⊕ analyser ⊕ analysis ⊕ analytics ⊖ analyzer ⊕ and ⊕ android ⊕ apache ⊕ api ⊕ apple ⊕ application ⊕ applications ⊕ apps ⊕ arch ⊕ architecture ⊕ art ⊕ article ⊕ asynchronous ⊕ atom ⊕ aws ⊕ backend ⊕ batch ⊕ benchmark ⊕ berkeley ⊕ bi ⊕ big ⊕ bigdata ⊕ billing ⊕ biz ⊕ bizgres ⊕ blog ⊕ blogging ⊕ blogs ⊕ bloom ⊕ bloom-filter ⊕ bloomfilter ⊕ bloomfilters ⊕ book ⊕ books ⊕ browser ⊕ browsers ⊕ bubble ⊕ business ⊕ businessintelligence ⊕ c++ ⊕ caching ⊕ career ⊕ cassandra ⊕ census ⊕ chart ⊕ chartbeat ⊕ charts ⊕ click ⊕ cloud ⊕ cloud-computing ⊕ cloudcomputing ⊕ cloudera ⊕ cloud_computing ⊕ cluster ⊕ clustering ⊕ code ⊕ comment ⊕ company ⊕ comparison ⊕ computer ⊕ computerscience ⊕ computing ⊕ conversion ⊕ convert ⊕ cool ⊕ cost ⊕ courses ⊕ cs ⊕ customer ⊕ data ⊕ data-mining ⊕ data-structures ⊕ data-warehousing ⊕ database ⊕ databases ⊕ datamining ⊕ dataprocessing ⊕ datascientist ⊕ datastore ⊕ datawarehouse ⊕ data_mining ⊕ db ⊕ dbms ⊕ debian ⊕ debugging ⊕ deck ⊕ deployment ⊕ design ⊕ dev ⊕ development ⊕ dht ⊕ diagram ⊕ distributed ⊕ django ⊕ django-apps ⊕ dw ⊕ ec2 ⊕ ecommerce ⊕ economics ⊕ economy ⊕ education ⊕ election ⊕ elections ⊕ english ⊕ entertainment ⊕ entrepreneurship ⊕ erlang ⊕ ethics ⊕ event ⊕ events ⊕ example ⊕ examples ⊕ excel ⊕ extract ⊕ extraction ⊕ facebook ⊕ fail ⊕ fast ⊕ feed ⊕ feedback ⊕ feedburner ⊕ feeds ⊕ filter ⊕ filters ⊕ finance ⊕ flash ⊕ flockdb ⊕ flow ⊕ flowchart ⊕ flowingdata ⊕ free ⊕ freelancing ⊕ functional ⊕ future ⊕ ga ⊕ gadgets ⊕ gem ⊕ gems ⊕ geo ⊕ gis ⊕ google ⊕ googleanalytics ⊕ government ⊕ graph ⊕ graphics ⊕ graphs ⊕ greenplum ⊕ grid ⊕ guide ⊕ guides ⊕ hacks ⊕ hadoop ⊕ hadoopdb ⊕ halvarian ⊕ hbase ⊕ hdfs ⊕ hive ⊕ hop ⊕ howto ⊕ html ⊕ ia ⊕ ideas ⊕ infographics ⊕ infomaki ⊕ infovis ⊕ integration ⊕ intelligence ⊕ interesting ⊕ internet ⊕ intro ⊕ iphone ⊕ java ⊕ javascript ⊕ jobs ⊕ journalism ⊕ json ⊕ key-value ⊕ keyvalue ⊕ latex ⊕ law ⊕ leadership ⊕ leanstartup ⊕ learning ⊕ lesen ⊕ library ⊕ linkedin ⊕ links ⊕ list ⊕ live ⊕ log ⊕ logging ⊕ machine-learning ⊕ machine.learning ⊕ machinelearning ⊕ machine_learning ⊕ management ⊕ manager ⊕ map ⊕ map-reduce ⊕ mapping ⊕ mapreduce ⊕ maps ⊕ market ⊕ marketing ⊕ mashup ⊕ math ⊕ mathematics ⊕ measurement ⊕ media ⊕ messaging ⊕ methodology ⊕ metric ⊕ metrics ⊕ mining ⊕ ml ⊕ mobile ⊕ money ⊕ mongodb ⊕ monitor ⊕ monitoring ⊕ mysql ⊕ network ⊕ news ⊕ nodebox ⊕ nosql ⊕ no_tag ⊕ nypl ⊕ nytimes ⊕ online ⊕ open ⊕ open-source ⊕ opensource ⊕ optimization ⊕ os ⊕ oss ⊕ osx ⊕ paper ⊕ parallel ⊕ parse ⊕ parser ⊕ parsing ⊕ pdf ⊕ pdfminer ⊕ performance ⊕ perl ⊕ php ⊕ pig ⊕ plugin ⊕ politics ⊕ postgres ⊕ postgresql ⊕ powerpoint ⊕ precinct ⊕ presentation ⊕ presentations ⊕ probability ⊕ processing ⊕ profile ⊕ profiling ⊕ programming ⊕ project ⊕ python ⊕ quality ⊕ query ⊕ queue ⊕ r ⊕ rails ⊕ rdbms ⊕ read ⊕ reading ⊕ realtime ⊕ redis ⊕ reference ⊕ reporting ⊕ research ⊕ rss ⊕ ruby ⊕ rubyonrails ⊕ ruby_on_rails ⊕ saas ⊕ sas ⊕ scala ⊕ scalability ⊕ scale ⊕ scaling ⊕ sci-fi ⊕ science ⊕ scientist ⊕ scraping ⊕ scribe ⊕ search ⊕ sem ⊕ semantic ⊕ seo ⊕ server ⊕ service ⊕ services ⊕ sharding ⊕ short ⊕ shorten ⊕ shortener ⊕ shorturl ⊕ skills ⊕ slides ⊕ slideshare ⊕ smartphones ⊕ socialmedia ⊕ socialnetworks ⊕ software ⊕ source ⊕ spatial ⊕ sql ⊕ star-trek ⊕ startup ⊕ startups ⊕ stata ⊕ statistical ⊕ statistical_learning ⊕ statistics ⊕ stats ⊕ storage ⊕ strategy ⊕ stream ⊕ streaming ⊕ suggestions ⊕ syndication ⊕ sysadmin ⊕ system ⊕ tech ⊕ technique ⊕ techniques ⊕ technology ⊕ test ⊕ testing ⊕ text ⊕ time ⊕ time-based ⊕ tinyurl ⊕ tips ⊕ tool ⊕ tools ⊕ toread ⊕ track ⊕ tracking ⊕ trading ⊕ traffic ⊕ trends ⊕ trim ⊕ tutorial ⊕ tutorials ⊕ twitter ⊕ ui ⊕ url ⊕ url-shortening ⊕ urls ⊕ usability ⊕ useful ⊕ user ⊕ user-experience ⊕ users ⊕ utilities ⊕ utility ⊕ ux ⊕ via:jacobkm ⊕ via:pskomoroch ⊕ visualisation ⊕ visualization ⊕ vldb ⊕ voldemort ⊕ voronoi ⊕ vote ⊕ warehouse ⊕ web ⊕ web-dev ⊕ web2.0 ⊕ webapp ⊕ webdesign ⊕ webdev ⊕ website ⊕ wiki ⊕ work ⊕ xml ⊕ yale ⊕ yam ⊕Copy this bookmark: