tsuomela + data   236

International land deals: who is investing and where | Claire Provost | Global development | guardian.co.uk
The Land Matrix database is now the most comprehensive public source for information on international land deals – but it is not perfect. Some countries, for example, may be over-represented in the data simply because they are more transparent and routinely publish contract information. In other cases, the data may be skewed towards investors or countries that have featured in more media reports.
data  land-use  market  economics  exchange  international  from delicious
26 days ago by tsuomela
The Miso Project
Miso is an open source toolkit designed to expedite the creation of high-quality interactive storytelling and data visualisation content.
data  data-exploration  visualization  library  programming  javascript  toolkit  from delicious
5 weeks ago by tsuomela
AmericanScience: A Team Blog: 23andMe: Genetic Testing or Bioprospecting?
"On the one hand, Wojcicki highlighted her desire to empower consumer-patients by circumventing the medical establishment and making data available
technology  silicon-valley  consumerism  data  ownership  information  asymmetrical  sts  from delicious
6 weeks ago by tsuomela
Research Data Curation Bibliography
The Research Data Curation Bibliography includes selected English-language articles and technical reports that are useful in understanding the curation of digital research data in academic and other research institutions. For broader coverage of the digital curation literature, see the author's Digital Curation and Preservation Bibliography 2010.
data  curation  research  bibliography  data-curation  information-science  from delicious
6 weeks ago by tsuomela
Data for the Public Good - O'Reilly Media
As we move into an era of unprecedented volumes of data and computing power, the benefits aren't for business alone. Data can help citizens access government, hold it accountable and build new services to help themselves.

Simply making data available is not sufficient. The use of data for the public good is being driven by a distributed community of media, nonprofits, academics and civic advocates.

This report from O'Reilly Radar highlights the principles of data in the public good, and surveys areas where data is already being used to great effect, covering:

Consumer finance
Transit data
Government transparency
Data journalism
Aid and development
Crisis and emergency response
Healthcare
data  public  benefits  citizenship  citizen  from delicious
7 weeks ago by tsuomela
Wind Map
"This map shows you the delicate tracery of wind flowing over the US right now."
data  visualization  weather  meteorology  climate  wind-power  real-time  from delicious
9 weeks ago by tsuomela
36-350, Statistical Computing, Fall 2011
"Computational data analysis is an essential part of modern statistics. Competent statisticians must not just be able to run existing programs, but to understand the principles on which they work. They must also be able to read, modify and write code, so that they can assemble the computational tools needed to solve their data-analysis problems, rather than distorting problems to fit tools provided by others. This class is an introduction to programming, targeted at statistics majors with minimal programming knowledge, which will give them the skills to grasp how statistical software works, tweak it to suit their needs, recombine existing pieces of code, and when needed create their own programs.

Students will learn the core of ideas of programming — functions, objects, data structures, flow control, input and output, debugging, logical design and abstraction — through writing code to assist in numerical and graphical statistical analyses. Students will in particular learn how to write maintainable code, and to test code for correctness. They will then learn how to set up stochastic simulations, how to parallelize data analyses, how to employ numerical optimization algorithms and diagnose their limitations, and how to work with and filter large data sets. Since code is also an important form of communication among scientists, students will learn how to comment and organize code. "
syllabi  statistics  computing  data  analysis  from delicious
february 2012 by tsuomela
Open State Project: freeing state legislative information
"We're gathering legislative data directly from the states and making it available in a common format through a RESTful API and regular bulk downloads. "
open-access  open-government  data  collecting  state  government  from delicious
october 2011 by tsuomela
The most practical, innovative, and easy-to-read data management and BI books
"Technics Publications publishes the most practical, innovative, and easy-to-read data management and business intelligence books with our objective to improve the effectiveness of information technology in the workplace. "
book  publisher  data  management  information-science 
september 2011 by tsuomela
State of the Birds - 2011 Report — Public Lands and Waters
"This year’s report provides the nation’s first assessment of the distribution of birds on public lands and helps public agencies identify which species have significant potential for conservation in each habitat. The state of our birds is a measurable indicator of how well we are doing as stewards of our environment. The signal is clear. Greater conservation efforts on public lands and waters are needed to realize the vision of a nation sustained economically and spiritually by abundant natural resources and spectacular wildlife."
citizen-science  data  collaboration  ornithology  birds  environment  conservation  biology  science 
september 2011 by tsuomela
DAMA DMBOK - DAMA International
"A team of DAMA members has developed a new book titled "The DAMA Guide to the Data Management Body of Knowledge" (DAMA-DMBOK Guide), under the guidance of a new DAMA-DMBOK Editorial Board. This publication is available from April 5th, 2009.

The “body of knowledge” about data management is quite large and constantly growing. the DAMA-DMBOK Guide provides a “definitive introduction” to data management. It defines a standard industry view of data management functions, terminology and best practices, without detailing specific methods and techniques. The DAMA-DMBOK is not a complete authority on any specific topic, but will point readers to widely recognized publications, articles and websites for further reading."
data  management  best-practices  professional-association  library  science 
september 2011 by tsuomela
Edward Tufte’s “Slopegraphs”
"In Tufte’s June 1st post, he sums up the use of slopegraphs well: “Slopegraphs compare changes over time for a list of nouns located on an ordinal or interval scale.”

Basically: Any time you’d use a line chart to show a progression of univariate data among multiple actors over time, you might have a good candidate for a slopegraph."
information  visualization  design  data 
july 2011 by tsuomela
[1106.6062] The Life and Death of Unwanted Bits: Towards Proactive Waste Data Management in Digital Ecosystems
"Our everyday data processing activities create massive amounts of data. Like physical waste and trash, unwanted and unused data also pollutes the digital environment by degrading the performance and capacity of storage systems and requiring costly disposal. In this paper, we propose using the lessons from real life waste management in handling waste data. We show the impact of waste data on the performance and operational costs of our computing systems. To allow better waste data management, we define a waste hierarchy for digital objects and provide insights into how to identify and categorize waste data. Finally, we introduce novel ways of reusing, reducing, and recycling data and software to minimize the impact of data wastage "
data  computer-science  management 
july 2011 by tsuomela
Give Me My Data | David Bollier
A way to get data out of Facebook and an alternative to Twitter.
privacy  social-media  data  online  facebook  twitter 
july 2011 by tsuomela
Our data, ourselves - The Boston Globe
"What if privacy is keeping us from reaping the real benefits of the infosphere?"
data  privacy  lifeblog  health  tracking 
june 2011 by tsuomela
MediaShift . The Necessity of Data Journalism in the New Digital Community | PBS
In the information age, journalism needs to go further. Information bombards us. What is scarce is insight, understanding and knowledge.

The news industry is built on the assumption that if you give a reporter a notebook and a few days to ramp up, he can write authoritatively on any subject. That's not enough anymore. In today's information-rich world, reporters need to bring more to the table. To provide readers with truly insightful experiences, they need to have the kind of expertise that will allow them to see the story behind the story, to see what's really going on.
journalism  media  data  statistics  media-studies  big-data 
june 2011 by tsuomela
d8taplex
"d8taplex is a resource for finding and manipulating data from the web."
data-collection  data  visualization  time-series 
april 2011 by tsuomela
Home - CKAN - the Data Hub
"CKAN is the Comprehensive Knowledge Archive Network, a registry of open knowledge packages and projects (and a few closed ones).

CKAN makes it easy to find, share and reuse open content and data, especially in ways that are machine automatable."
science  scholarly-communication  data-curation  sharing  data  open-science  publishing  communication 
april 2011 by tsuomela
FigShare
"Scientific publishing as it stands is an inefficient way to do science on a global scale. A lot of time and money is being wasted by groups around the world duplicating research that has already been carried out. FigShare allows you to share all of your data, negative results and unpublished figures. In doing this, other researchers will not duplicate the work, but instead may publish with your previously wasted figures, or offer collaboration opportunities and feedback on preprint figures."
science  scholarly-communication  data-curation  sharing  data  open-science  publishing  communication 
april 2011 by tsuomela
Data.Rescue@Home
"Data rescue at home is an internet-based attempt to digitize historical weather data from all over the globe and make the digitised data available to everybody. Two projects are currently online: German radiosonde data form the Second World War and meteorological station data from Tulagi (Solomon Islands) for the first half of the 20th century. "
citizen-science  crowdsourcing  meteorology  climate  data  history  digitization 
march 2011 by tsuomela
Digging Into Data > Home
"What is the "challenge" we speak of? The idea behind the Digging into Data Challenge is to address how "big data" changes the research landscape for the humanities and social sciences. Now that we have massive databases of materials used by scholars in the humanities and social sciences -- ranging from digitized books, newspapers, and music to transactional data like web searches, sensor data or cell phone records -- what new, computationally-based research methods might we apply? As the world becomes increasingly digital, new techniques will be needed to search, analyze, and understand these everyday materials. Digging into Data challenges the research community to help create the new research infrastructure for 21st century scholarship. "
funding  grants  academic  data  big-data  social-science  humanities  digital 
march 2011 by tsuomela
Supplemental or detrimental? - The Scientist - Magazine of the Life Sciences
"The decision highlights a tension between the need for rigorous peer review of scientific research and the desire to provide as much data as possible to the scientific community.

"More data, in and of itself, is always a good thing -- if there aren't adverse effects," said Maunsell, who is also a neuroscientist at Harvard University. But peer review was becoming less effective because many reviewers failed to evaluate the supplemental materials, which the journal wasn't even required to provide, he explained. "We were taking a hit on peer review for something that wasn't formally our responsibility." "
science  peer-review  scholarly-communication  data  data-curation  publishing 
march 2011 by tsuomela
earthsensors.txt
" We are now interconnected enough for amateur scientists and hobbyists to help professional scientists with sensors scattered over the earth collecting data. "
citizen-science  distributed  data-collection  data  earth-science 
march 2011 by tsuomela
Wrangler
Wrangler allows interactive transformation of messy, real-world data into the data tables analysis tools expect.
data  tool  data-processing  cleanup  research 
february 2011 by tsuomela
Dryad data file: Dryad Home
Dryad is an international repository of data underlying peer-reviewed articles in the basic and applied biosciences. Dryad enables scientists to validate published findings, explore new analysis methodologies, repurpose data for research questions unanticipated by the original authors, and perform synthetic studies. Dryad is governed by a consortium of journals that collaboratively promote data archiving and ensure the sustainability of the repository.
science  data  sharing  open-science  repository  biology  bioscience 
january 2011 by tsuomela
What's stats got to do with it? - Expression Patterns Blog | Nature Publishing Group
"I realized I wasn't scared of stats: just bored and annoyed and wondering, indeed, what they had to do with various things. Keeping track of lots of data makes for pretty graphs and useful trends. Those kinds of stats are cool. But statistical analysis of data doesn't always make sense to the people using it. Not just because it's complicated, but because it's not always informative of what they're looking at. It has to make sense in context. You have to be able to actually answer the question "what's stats go to do with it?", and not just use it rhetorically like I did in most of this post. "
statistics  mathematics  graphs  data  understanding  significance 
january 2011 by tsuomela
Researchers launch hunt for endangered data : Nature News
Around the world, key scientific data are at risk of being lost, either because they are held on fragile or obsolete media or because they may be destroyed by researchers who are unaware of their value. Now a team of scientists is planning to scour museums and research institutes to draw up a global inventory of threatened data. Launched on 29 October, shortly after the biennial conference of the Committee on Data for Science and Technology in Stellenbosch, South Africa, the project aims to publish the inventory online in 2012.
data  preservation  science  research  curation  scholarly-communication  scientific  archive  history  historical 
december 2010 by tsuomela
The 70 Online Databases that Define Our Planet - Technology Review
So in the interests of stimulating this debate, I'm reproducing here Helbing's list of websites that are potential sources of data for an Earth Simulator. It makes for fascinating, if unnerving, reading:
data  data-collection  earth  simulation 
december 2010 by tsuomela
Obsidian Wings: The culture of conspiracy, the conspiracy of culture
In other words, Assange (and I presume Wikileaks as a whole) are publishing bulk-leaked documents because:
Authoritarian organizations (including most present-day national governments and large corporations) are naturally unjust, secretive, and conspiratorial.
The networks of information and influence inside such organizations are less stable to leaking than the corresponding networks inside open, just, and non-authoritarian organizations. They will either become hardened and (even more) inefficient, or they will become more open, less authoritarian, and more just. Either result is a win.
wikileaks  politics  secrecy  data  governance  government  foreign-policy  authoritarian 
december 2010 by tsuomela
Victorian Literature, Statistically Analyzed With New Process - NYTimes.com
Victorians were enamored of the new science of statistics, so it seems fitting that these pioneering data hounds are now the subject of an unusual experiment in statistical analysis. The titles of every British book published in English in and around the 19th century — 1,681,161, to be exact — are being electronically scoured for key words and phrases that might offer fresh insight into the minds of the Victorians.
data  literature  criticism  statistics  analysis  19c  english 
december 2010 by tsuomela
PeteSearch: Data is snake oil
It's because data is powerful but fickle. A lot of theoretically promising approaches don't work because there's so many barriers between spotting a possible relationship and turning it into something useful and actionable.
big-data  data  barriers 
december 2010 by tsuomela
Interactive: Which Banks Got Emergency Loans from the Fed During the Financial Meltdown?
Wednesday the Federal Reserve released data on more than 21,000 loans and other deals it made through a dozen emergency programs created during the financial crisis. The Fed used trillions of dollars to stabilize the economy when the housing bubble burst and credit markets froze.

We combined the Fed’s three programs that loaned directly to banks and other financial firms with the goal of getting them to start lending again. We hope to post on the Fed’s other programs soon.
data  media  federal-reserve  crisis  2008  business  banking  loans  journalism 
december 2010 by tsuomela
Changing software, hardware a nightmare for tracking scientific data
We've gone into all the problems involved with preserving and sharing scientific data in some detail, but the challenges don't end there. Typically, data doesn't speak for itself; it has to be analyzed and interpreted. And, these days, that analysis generally involves computer tools. Even basic images of cells can end up being processed to look for things like signal intensity and total area of signal. The results of that analysis may end up plugged into a spreadsheet and subjected to a further analysis. This general approach—a pipeline of software tools—makes documenting and reproducing exactly what happened to generate a final result.
science  data  data-management  communication  technology  computer 
november 2010 by tsuomela
« earlier      

related tags

3d  19c  academia  academic  academic-center  access  activism  aesthetics  agent-based-model  ajax  algorithms  amazon  america  american  analysis  analytics  anonymity  api  apollo-program  aquatic  architecture  archive  art  artificial-intelligence  astronomy  astrophysics  asymmetrical  atmosphere  augmented-reality  authoritarian  banking  barriers  behavior  benefits  best-practices  bibliography  big-data  big-science  biodiversity  biology  bioscience  birds  book  books  budget  business  bwca  calendar  canada  career  case-study  catalog  census  charts  citation  cities  citizen  citizen-journalism  citizen-science  citizenship  classification  cleanup  climate  cloud  co-science  coaching  collaboration  collecting  collection  commoditization  communication  community  comparative  comparison  complexity  computational-science  computer  computer-science  computing  conference  conservation  consortium  consumerism  consumption  contest  country(Vietnam)  craft  creativity  crisis  criticism  crowdsourcing  culture  curation  cyberinfrastructure  cyberscience  data  data-collection  data-curation  data-exploration  data-management  data-mining  data-processing  data-sharing  data-sources  database  debt  declassification  deforestation  degree  demography  design  development  digital  digital-humanities  digitization  discovery  dissertation  distributed  document  documentary  e-research  e-science  earth  earth-science  economics  edemocracy  education  elections  email  energy  england  english  environment  epistemology  ethics  europe  events  exchange  exoplanets  experiments  exploration  facebook  fads  fear  federal  federal-reserve  film  flood  flow  forecast  foreign-policy  freedom  funding  future  genetics  geocode  geography  geolocation  geology  geospatial  gis  global  global-warming  google  google-earth  governance  government  graduate-school  grants  graphics  graphs  grid  grid-computing  groups  health  hint  historical  history  housing  howto  humanities  hydrology  ibm  ideas  identity  ideology  incentives  income  income-distribution  industry  infographics  information  information-literacy  information-overload  information-science  infrastructure  inspiration  intellectual-property  interactive  international  internet  introduction  invasives  ipad  java  javascript  journal  journalism  kml  land-use  large  lecture  libraries  library  licensing  lifeblog  lifecycle  lifespan  list  literacy  literature  loans  machine-learning  magazine  mail  management  map  mapping  maps  market  markets  mashup  math  mathematics  measurement  media  media-reform  media-studies  medieval  metadata  meteorology  methodology  methods  metrics  metropolitan-area  military  minneapolis  minnesota  mobile  model  modeling  money  moon  mortality  nasa  netflix  network  network-analysis  networking  networks  news  norms  number  observation  online  open  open-access  open-data  open-government  open-information  open-science  open-source  organization  ornithology  ownership  particle-physics  pedagogy  peer-review  people  perception  personal  phd  philosophy  physical  physics  planets  policy  politcal-science  political-science  politics  polls  poltical-science  population  practice  precipitation  prediction  preservation  privacy  process  professional  professional-association  profile  programming  progressive  project(DataOne)  project(Utenn)  public  public-data  public-opinion  publisher  publishing  python  quantified-self  quantitative  queries  quora  r  radar  RAND  rdf  real-time  recession  recommendation-systems  record  records  reference  reform  regional  regions  religion  remix  remote-sensing  remotesensing  reports  repository  research  reuse  review  rewards  rich  risk  rss  satellite  scale  scholarly-communication  school(MIT)  school(UTenn)  science  scientific  scraping  search  searchengine  secrecy  semantic-web  semanticweb  sharing  si679  significance  silicon-valley  simulation  size  skewt  sla  snow  social  social-media  social-networks  social-science  sociology  software  sounding  space  species  standards  startup  state  state-government  statistics  storage  sts  supercomputing  surveillance  survey  syllabi  taxes  teaching  technology  technology-effects  text-analysis  time  time-series  tips  tool  toolkit  tools  tracking  training  transparency  treasury-department  trend  trends  twitter  ubicomp  understanding  united-nations  university  unix  urbanism  ux  via:vaguery  video  visual  visualization  volunteer  voting  war  water  wealth  weather  web  web-programming  web2.0  weblog  weblog-group  weblog-individual  weblog-recommendations  wiki  wikileaks  wind-power  windows  winter  workload  world  xml 

Copy this bookmark:



description:


tags: