Metamarkets Blog » Blog Archive » Why Generic Machine Learning Fails
march 2011 by amy
Fabulous piece by @josephreisinger about tradeoffs between data and algorithms in machine learning via @ginablaber
machine_learning
analysis
datamining
march 2011 by amy
Topicmarks - summarize your text documents in minutes
february 2011 by amy
"Topicmarks summarizes text documents for you electronically."
text
analysis
february 2011 by amy
Data Wrangler
february 2011 by amy
"Wrangler is an interactive tool for data cleaning and transformation.
Spend less time formatting and more time analyzing your data."
analysis
analytics
data
tools
visualization
Spend less time formatting and more time analyzing your data."
february 2011 by amy
Crossbow: Whole Genome Resequencing Analysis in the Clouds
february 2011 by amy
Genotyping from short reads using cloud computing
hadoop
genetics
analysis
analytics
datamining
february 2011 by amy
Fusion Tables - Google Labs
october 2010 by amy
Fusion Tables is a service for managing large collections of tabular data in the cloud. You can upload tables of up to 100MB and share them with collaborators, or make them public. You can apply filters and aggregation to your data, visualize it on maps and other charts, merge data from multiple tables, and export it to the Web or csv files. You can also conduct discussions about the data at several levels of granularity, such as rows, columns and individual cells.
analysis
datamining
google
visualizations
october 2010 by amy
Flickr: Cool Data Visualization Techniques - Information Visualization
november 2009 by amy
Fantastic Flickr pool collecting cool data visualization techniques http://is.gd/4PbAh
twitter_fav
@brainpicker
datamining
analysis
images
visualizations
november 2009 by amy
New Genomic Zoo to Collect DNA of 10,000 Vertebrate Species | Popular Science
november 2009 by amy
New Genomic Zoo to Collect DNA of 10,000 Vertebrate Species http://bit.ly/2CjdFJ
twitter_fav
@PopSci
genetics
analysis
datamining
november 2009 by amy
Trendsmap - Real-time local Twitter trends
october 2009 by amy
Wow, http://trendsmap.com/ is very impressive. [via @charlesarthur]
twitter_fav
@girlonetrack
twitter
analysis
visualizations
october 2009 by amy
Welcome to Hive!
september 2009 by amy
Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc querying and analysis of large datasets data stored in Hadoop files. It provides a mechanism to put structure on this data and it also provides a simple query language called Hive QL which is based on SQL and which enables users familiar with SQL to query this data. At the same time, this language also allows traditional map/reduce programmers to be able to plug in their custom mappers and reducers to do more sophisticated analysis which may not be supported by the built-in capabilities of the language.
hadoop
mapreduce
apache
query_language
open_source
analysis
from delicious
september 2009 by amy
Networks / Pajek
september 2009 by amy
analysis and visualization of large networks, runs on windows
analysis
visualizations
september 2009 by amy
- - Scientific Computing World
september 2009 by amy
"Analysis is the mother of invention" - good article on models of investion and the role of statistical analysis http://bit.ly/4mVIpY
twitter_fav
@Werner
statistics
analysis
innovation
september 2009 by amy
Memeorandum Colors: Visualizing Political Bias with Greasemonkey - Waxy.org
october 2008 by amy
using svd to cluster blog political linking
politics
usa
blogging
statistics
machine_learning
cool
datamining
analysis
visualizations
october 2008 by amy
Google to Host Terabytes of Open-Source Science Data | Wired Science from Wired.com
february 2008 by amy
provide a home for terabytes of open-source scientific datasets. The storage will be free to scientists and access to the data will be free for all.
google
research
datamining
collaboration
search
storage
open_source
analysis
visualizations
february 2008 by amy
related tags
@brainpicker ⊕ @girlonetrack ⊕ @PopSci ⊕ @Werner ⊕ advertising ⊕ aggregator ⊕ analysis ⊖ analytics ⊕ apache ⊕ austin ⊕ aws ⊕ blogging ⊕ cloud_computing ⊕ collaboration ⊕ communication ⊕ computer_languages ⊕ cool ⊕ culture ⊕ curation ⊕ data ⊕ datamining ⊕ distributed ⊕ ec2 ⊕ genetics ⊕ genomics ⊕ google ⊕ gwt ⊕ hadoop ⊕ heh ⊕ hive ⊕ images ⊕ innovation ⊕ internet ⊕ iran ⊕ journalism ⊕ language ⊕ linguistics ⊕ machine_learning ⊕ mapreduce ⊕ marketing ⊕ networks ⊕ news ⊕ open_source ⊕ politics ⊕ privacy ⊕ programming ⊕ psychology ⊕ query_language ⊕ research ⊕ scalability ⊕ science ⊕ search ⊕ semantics ⊕ statistics ⊕ storage ⊕ technology ⊕ text ⊕ tools ⊕ trends ⊕ twitter ⊕ twitter_fav ⊕ usa ⊕ visualization ⊕ visualizations ⊕ web-analytics ⊕ world ⊕Copy this bookmark: