amy + datamining 127
Big data and APIs for PHP developers - SXSW 2011
march 2011 by amy
Presentation: Big Data & APIs for PHP Developers /via @lxt @EliW #BigData #SXSW #PHP
thinkup
php
analytics
APIs
datamining
SXSW
BigData
march 2011 by amy
Metamarkets Blog » Blog Archive » Why Generic Machine Learning Fails
march 2011 by amy
Fabulous piece by @josephreisinger about tradeoffs between data and algorithms in machine learning via @ginablaber
machine_learning
analysis
datamining
march 2011 by amy
About TeleHash
february 2011 by amy
"TeleHash is a new wire protocol for exchanging JSON in a real-time and fully decentralized manner, enabling applications to connect directly and participate as servers on the edge of the network. It is designed to efficiently route and distribute small bits of data in order for applications to discover each other directly or in relation to events around piece of shared content. The core benefits of TeleHash over other similar platforms and protocols is that it is both generic (not tied to any specific application or content structures) and is radically decentralized with no servers or points of central control."
p2p
datamining
web_services
json
february 2011 by amy
Creator of Instant Messaging Protocol to Launch App Platform for Your Life
february 2011 by amy
RT @jeremie Awesome! RT Creator of Instant Messaging Protocol to Launch App Platform for Your Life via @marshallk
"Jeremie Miller is a revered figure among developers, best known for building XMPP, the open source protocol that powers most of the Instant Messaging apps in the world. Now Miller has raised funds and is building a team that will develop software aimed directly at the future of the web.
Called The Locker Project, the open source service will capture what's called exhaust data from users' activities around the web and offline via sensors, put it firmly in their own possesion and then allow them to run local apps that are built to leverage their data."
p2p
social_media
storage
datamining
from twitter_favs
"Jeremie Miller is a revered figure among developers, best known for building XMPP, the open source protocol that powers most of the Instant Messaging apps in the world. Now Miller has raised funds and is building a team that will develop software aimed directly at the future of the web.
Called The Locker Project, the open source service will capture what's called exhaust data from users' activities around the web and offline via sensors, put it firmly in their own possesion and then allow them to run local apps that are built to leverage their data."
february 2011 by amy
drewconway/strata_bootcamp - GitHub
february 2011 by amy
All code, slides and notes in support of "Data Bootcamp" tutorial at O'Reilly's Strata Conference 2011.
presentation
reference
datamining
analytics
big_data
february 2011 by amy
Crossbow: Whole Genome Resequencing Analysis in the Clouds
february 2011 by amy
Genotyping from short reads using cloud computing
hadoop
genetics
analysis
analytics
datamining
february 2011 by amy
Beyond Amazon: How to Make Recommendations Smarter | Fast Company
january 2011 by amy
"…The people who are likely to produce breakthroughs--the really smart smarty-pants in the math departments of the world’s universities--don’t have access to large bodies of real-world data. And without real-world data, they can come up with as many hypotheses and new types of math as they like, but they’ll never really know if it actually works in the real world. It’s like trying to learn how to serve without tennis balls. You can swing as much as you like, but until you actually hit a real-live ball, you can never be sure if your swing would actually place a ball in the serve box.
For their part, the people who have real-world data--the Amazons and eBays of the world--can’t share it with the researchers for reasons of customer privacy. “Even if we anonymize it, we’re handcuffed because we can’t give out data that can be reasonably be used to reconstruct who someone really is,” the Chief Scientist, Darren Vengroff, tells Fast Company.
Vengroff, however, has come up with a novel solution: He’s created a “black box” of sorts with real-world data that researchers can use to run experiments on. Researchers won’t be able to look at the data, but they will be able to dump their algorithms in and have the box spit out results, which the researchers can then use to refine their hypotheses.…"
analytics
datamining
recommendation_systems
cloud_computing
data
For their part, the people who have real-world data--the Amazons and eBays of the world--can’t share it with the researchers for reasons of customer privacy. “Even if we anonymize it, we’re handcuffed because we can’t give out data that can be reasonably be used to reconstruct who someone really is,” the Chief Scientist, Darren Vengroff, tells Fast Company.
Vengroff, however, has come up with a novel solution: He’s created a “black box” of sorts with real-world data that researchers can use to run experiments on. Researchers won’t be able to look at the data, but they will be able to dump their algorithms in and have the box spit out results, which the researchers can then use to refine their hypotheses.…"
january 2011 by amy
Clustering - Introduction
january 2011 by amy
A Tutorial on Clustering Algorithms
algorithms
datamining
statistics
reference
thinkup
january 2011 by amy
Award for Math Model that Relates Biomarkers of Asthma with Clinical Outcomes
december 2010 by amy
Award for Math Model that Relates Biomarkers of Asthma with Clinical Outcomes: The Seeker is looking... #datamining
#datamining
datamining
from twitter_favs
december 2010 by amy
Online book: Introduction to data mining
november 2010 by amy
Online book: Introduction to data mining: this online book was created by the Data Mining group at t... #datamining
#datamining
datamining
from twitter_favs
november 2010 by amy
PredictiveDB is SQL-based predictive software (currently in alpha)
november 2010 by amy
PredictiveDB uses SQL for on-line learning or predicting - become an alpha tester #datamining
#datamining
datamining
from twitter_favs
november 2010 by amy
Capital One Made Me Different Loan Offers Depending On Which Browser I Used - The Consumerist
november 2010 by amy
Your choice of browser may affect the price you get on loan offers via @slashdot #datamining #strataconf
#strataconf
#datamining
strataconf
datamining
from twitter
november 2010 by amy
Fusion Tables - Google Labs
october 2010 by amy
Fusion Tables is a service for managing large collections of tabular data in the cloud. You can upload tables of up to 100MB and share them with collaborators, or make them public. You can apply filters and aggregation to your data, visualize it on maps and other charts, merge data from multiple tables, and export it to the Web or csv files. You can also conduct discussions about the data at several levels of granularity, such as rows, columns and individual cells.
analysis
datamining
google
visualizations
october 2010 by amy
IBM to buy more analytics companies, rivals worried
october 2010 by amy
IBM to buy more analytics companies, rivals worried: IBM plans to buy more companies within the m... #datamining
#datamining
datamining
from twitter_favs
october 2010 by amy
IBM to buy more analytics companies, rivals worried
october 2010 by amy
IBM to buy more analytics companies, rivals worried: IBM plans to buy more companies within the m... #datamining
#datamining
datamining
from twitter_favs
october 2010 by amy
Apache Mahout:: Scalable machine-learning and data-mining library
september 2010 by amy
"Apache Mahout's goal is to build scalable machine learning libraries. "
apache
hadoop
mapreduce
machine_learning
datamining
analytics
september 2010 by amy
A Chip That Digests Data and Calculates the Odds
august 2010 by amy
A Chip That Digests Data and Calculates the Odds: The company is redesigning processing circuits from ... #datamining
#datamining
datamining
from twitter_favs
august 2010 by amy
Pervasive DataRush Reaches 2 TB/Hour Processing Speed
august 2010 by amy
Pervasive DataRush Reaches 2 TB/Hour Processing Speed: Pervasive DataRush(tm) processes MalStone B-10 d... #datamining
#datamining
datamining
from twitter_favs
august 2010 by amy
Zynga Stresses Visual Access to Its Vast Data Volumes
august 2010 by amy
Zynga Stresses Visual Access to Its Vast Data Volumes: Zynga has over 230 million users monthly playing... #datamining
#datamining
datamining
from twitter_favs
august 2010 by amy
Privacy-Aware Knowledge Discovery: Novel Applications and New Techniques
july 2010 by amy
Privacy-Aware Knowledge Discovery: Novel Applications and New Techniques: edited by Bonchi & Ferrari. C... #datamining
#datamining
datamining
from twitter_favs
july 2010 by amy
Memphis Cuts Crime With Predictive Analytics
july 2010 by amy
Memphis Cuts Crime With Predictive Analytics: reports 31% drop in crime rate to knowing when and where ... #datamining
#datamining
datamining
from twitter_favs
july 2010 by amy
Introduction to Data Mining
july 2010 by amy
Provides both theoretical and practical coverage of all data mining topics.
Includes extensive number of integrated examples and figures.
Offers instructor resources including solutions for exercises and complete set of lecture slides.
Assumes only a modest statistics or mathematics background, and no database knowledge is needed.
Topics covered include; predictive modeling, association analysis, clustering, anomaly detection, visualization.
datamining
books
Includes extensive number of integrated examples and figures.
Offers instructor resources including solutions for exercises and complete set of lecture slides.
Assumes only a modest statistics or mathematics background, and no database knowledge is needed.
Topics covered include; predictive modeling, association analysis, clustering, anomaly detection, visualization.
july 2010 by amy
Data Mining and Rule Engine Software Download
june 2010 by amy
Data Mining and Rule Engine Software Download: CMSR is a new generation data mining and rule engine sof... #datamining
#datamining
datamining
from twitter_favs
june 2010 by amy
Gephi: The Open Graph Viz Platform
may 2010 by amy
Gephi is an interactive visualization and exploration platform for all kinds of networks and complex systems, dynamic and hierarchical graphs.
Runs on Windows, Linux and Mac OS X. Gephi is open-source and free
#datamining
datamining
visualizations
from twitter_favs
Runs on Windows, Linux and Mac OS X. Gephi is open-source and free
may 2010 by amy
Welcome | ScraperWiki
may 2010 by amy
ScraperWiki code libraries make scraping, parsing and geocoding simple
datamining
api
python
may 2010 by amy
Google Prediction API - Google Code
may 2010 by amy
The Prediction API enables access to Google's machine learning algorithms to analyze your historic data and predict likely future outcomes. Upload your data to Google Storage for Developers, then use the Prediction API to make real-time decisions in your applications. The Prediction API implements supervised learning algorithms as a RESTful web service to let you leverage patterns in your data, providing more relevant information to your users. Run your predictions on Google's infrastructure and scale effortlessly as your data grows in size and complexity.
machine_learning
datamining
statistics
google
may 2010 by amy
Language Log » Mapping the Demographics of American English with Twitter
may 2010 by amy
Mapping the Demographics of American English with Twitter - this will be a very rich area for academics #twitter
twitter
linguistics
language
datamining
#twitter
from twitter_favs
may 2010 by amy
Beautiful Data - O'Reilly Media
may 2010 by amy
RT @OReillyMedia #Ebook Deal of the Day: Beautiful Data - Only $9.99 use code DDBDA
books
ebooks
datamining
data
#Ebook
Ebook
visualizations
from twitter
may 2010 by amy
Data mining with WEKA, Part 1: Introduction and regression
april 2010 by amy
Data mining with WEKA, Part 1: Introduction and regression: an intro to data mining and to WEKA, free a... #datamining
#datamining
datamining
from twitter_favs
april 2010 by amy
Boeing gives $1.25 million to launch new tech institute
april 2010 by amy
Boeing gives $1.25 million to launch new tech institute: Vancouver Institute for Visual Analytics bring... #datamining
#datamining
datamining
from twitter_favs
april 2010 by amy
(500) http://www.dataminingblog.com/what-is-a-good-classification-accuracy-in-data-mining/
april 2010 by amy
Interesting post: What is a good classification accuracy in data mining? #datamining /RT @paulblaser
#datamining
datamining
from twitter_favs
april 2010 by amy
My Data Mining Weblog » Watch Online Data Mining Tutorial with RapidMiner 5.0
march 2010 by amy
Watch Online Data Mining Tutorial with RapidMiner 5.0
datamining
statistics
from twitter_favs
march 2010 by amy
PeteSearch: How to split up the US
february 2010 by amy
How to split up the US: what facebook tells us about the shape of the country. More: (via @brady)
analytics
datamining
statistics
culture
geography
february 2010 by amy
krimelabb.com
november 2009 by amy
krimelabb.com is ...
Austin city-wide crime data
free and no registration is required for basic service
a tool for viewing crime data from a variety of perspectives
updated daily
a mashup that leverages value from APD data, Google Maps and Streetview, news media and other resources
austin
mashups
datamining
Austin city-wide crime data
free and no registration is required for basic service
a tool for viewing crime data from a variety of perspectives
updated daily
a mashup that leverages value from APD data, Google Maps and Streetview, news media and other resources
november 2009 by amy
Flickr: Cool Data Visualization Techniques - Information Visualization
november 2009 by amy
Fantastic Flickr pool collecting cool data visualization techniques http://is.gd/4PbAh
twitter_fav
@brainpicker
datamining
analysis
images
visualizations
november 2009 by amy
New Genomic Zoo to Collect DNA of 10,000 Vertebrate Species | Popular Science
november 2009 by amy
New Genomic Zoo to Collect DNA of 10,000 Vertebrate Species http://bit.ly/2CjdFJ
twitter_fav
@PopSci
genetics
analysis
datamining
november 2009 by amy
CourseWiki - cs448b - Data Visualization
october 2009 by amy
Stanford course in data visualization
education
statistics
datamining
visualizations
from delicious
october 2009 by amy
Many Eyes
september 2009 by amy
Just realized that my old friend and information visualization wizard @wattenberg is on Twitter. Check this out: http://bit.ly/2MdMUH
datamining
discovery
twitter_fav
@alexiskold
statistics
tools
visualizations
september 2009 by amy
Project ‘Gaydar’: An MIT experiment raises new questions about online privacy - The Boston Globe
september 2009 by amy
Our digital bread crumbs can add up to something bigger. http://bit.ly/N8NEI ^VG
twitter_fav
@nytimesbits
datamining
privacy
september 2009 by amy
related tags
#datamining ⊕ #datapimp ⊕ #decisiontrees ⊕ #Ebook ⊕ #gov20 ⊕ #machinelearning ⊕ #stats ⊕ #strataconf ⊕ #twitter ⊕ @alexiskold ⊕ @brainpicker ⊕ @nytimesbits ⊕ @PopSci ⊕ academia ⊕ ack ⊕ aggregator ⊕ algorithms ⊕ analysis ⊕ analytics ⊕ apache ⊕ api ⊕ APIs ⊕ archives ⊕ art ⊕ austin ⊕ aws ⊕ BigData ⊕ big_data ⊕ bioinformatics ⊕ biology ⊕ blogging ⊕ books ⊕ cloud_computing ⊕ collaboration ⊕ collaborative_filtering ⊕ computer_languages ⊕ cool ⊕ copyright ⊕ culture ⊕ data ⊕ database ⊕ datamining ⊖ datapimp ⊕ datbases ⊕ decisiontrees ⊕ demographics ⊕ discovery ⊕ Ebook ⊕ ebooks ⊕ ec2 ⊕ economics ⊕ education ⊕ genetics ⊕ genomics ⊕ geography ⊕ google ⊕ gov20 ⊕ government ⊕ graphics ⊕ hadoop ⊕ hive ⊕ html5 ⊕ images ⊕ information_retrieval ⊕ iphone ⊕ javascript ⊕ journalism ⊕ json ⊕ language ⊕ law ⊕ library ⊕ linguistics ⊕ machinelearning ⊕ machine_learning ⊕ mapreduce ⊕ maps ⊕ mashups ⊕ math ⊕ news ⊕ nlp ⊕ open ⊕ open_source ⊕ p2p ⊕ php ⊕ politics ⊕ presentation ⊕ privacy ⊕ programming ⊕ psychology ⊕ python ⊕ rails ⊕ recommendation ⊕ recommendation_systems ⊕ reference ⊕ research ⊕ resources ⊕ ruby ⊕ s3 ⊕ scalability ⊕ science ⊕ search ⊕ sex ⊕ social_media ⊕ society ⊕ software/social ⊕ statistics ⊕ stats ⊕ storage ⊕ strataconf ⊕ svd ⊕ SXSW ⊕ tbr ⊕ technology ⊕ thinkup ⊕ tools ⊕ transparency ⊕ trends ⊕ tutorials ⊕ twitter ⊕ twitter_fav ⊕ usa ⊕ visualization ⊕ visualizations ⊕ web ⊕ web-analytics ⊕ web_app ⊕ web_services ⊕Copy this bookmark: