The Registry! :: FRBR
2 days ago by rybesh
This page provides quick links for the Registered FRBR Element Sets and Value Vocabularies.
frbr
metadata
linkeddata
library
vocabulary
2 days ago by rybesh
The Metadata is the Interface: Better Description for Better Discovery of Archives and Special Collections, Synthesized from User Studies
15 days ago by rybesh
This essay—part of a series of OCLC Research projects to mobilize unique materials synthesizes evidence of what descriptive information people say they need for research.
userresearch
metadata
interface
search
specialcollections
archives
15 days ago by rybesh
DCMI Abstract Model
16 days ago by rybesh
This document specifies an abstract model for Dublin Core metadata. The primary purpose of this document is to specify the components and constructs used in Dublin Core metadata. It defines the nature of the components used and describes how those components are combined to create information structures. It provides an information model which is independent of any particular encoding syntax. Such an information model allows us to gain a better understanding of the kinds of descriptions that we are encoding and facilitates the development of better mappings and cross-syntax translations.
metadata
model
inls520
16 days ago by rybesh
Visualizing Oral History? | Visualizing the Past
6 weeks ago by rybesh
...we can see the outlines for how we might curate oral history that offers a richer perspective on sound. These points of agreement might include some of the following: a) recognition that oral history is fundamentally an aural experience and not just a text; b) oral history should be evaluated for meaning at a clip or segment level, not just at the level of the 60-minute or 90-minute interview; c) clips and segments should be connectable across interview or even collection; d) our metadata schemes, as well as our work in representing oral history to public audiences, have to account such rich metadata schemes; e) collecting oral history is one thing, making it open and accessible is another goal that the oral history community should embrace; f) efforts at linked open data have to account for segment and clip-level metadata; g) and, it is vital that we involve communities in processing and connecting to oral history.
oralhistory
archives
metadata
6 weeks ago by rybesh
organizing early modern texts
7 weeks ago by rybesh
We don’t live in the binary producer/consumer world anymore. Even if we did, there is simply too much data to deal with. Its stewards simply do not have all necessary expertise or resources to organize it most effectively and flexibly. Without doubt, this involves plenty of technical challenges (standards, interfaces, infrastructure). But these are trivial in comparison to the real challenge: shifting community expectations that erroneous metadata can and should be edited by researchers themselves. And while we’re at it, we might broaden our view of metadata to include not only the usual fields (author, date, etc), but additional description as well (abstracts, section headings, keywords, etc) that makes the texts more findable.
metadata
digitalhumanities
inls520
7 weeks ago by rybesh
Pop-Up Archive | Difference between Element Sets and Item Types in Omeka
7 weeks ago by rybesh
The biggest difference here is that element sets are available for all items, while item type elements are available only to those items that belong to that item type. So, if you need a standardized set of elements that is available across all item types, an element set is the way to go.
description
metadata
inls520
standards
7 weeks ago by rybesh
Coyle's InFormation: If not RDF, then what?
7 weeks ago by rybesh
It also seems to me that we have everything to gain by beginning our work on a data format with no particular serialization in mind. We could go from RDA to RDA-as-data and then on to RDA-as-RDF. I see some dangers in skipping the middle step, mainly that we could end up making some decisions that fit RDA into RDF but that are problematic for other serializations.
inls520
webinfo
rda
cataloging
metadata
standards
models
7 weeks ago by rybesh
An organization ontology
8 weeks ago by rybesh
This document describes a core ontology for organizational structures, aimed at supporting linked-data publishing of organizational information across a number of domains. It is designed to allow domain-specific extensions to add classification of organzations and roles, as well as extensions to support neighbouring information such as organizational activities.
metadata
standard
data
description
inls520
8 weeks ago by rybesh
Terms for describing people
8 weeks ago by rybesh
This document defines a set of terms for describing people. It defines how to describe people's characteristics such as names or addresses and how to relate people to other things, for example to organizations or projects. For each term, guidance on the usage within a running example is provided. This document also defines mappings to widely used vocabularies to enable interoperability.
metadata
standard
description
inls520
8 weeks ago by rybesh
The RDF Data Cube Vocabulary
8 weeks ago by rybesh
There are many situations where it would be useful to be able to publish multi-dimensional data, such as statistics, on the web in such a way that it can be linked to related data sets and concepts. The Data Cube vocabulary provides a means to do this using the W3C RDF (Resource Description Framework) standard. The model underpinning the Data Cube vocabulary is compatible with the cube model that underlies SDMX (Statistical Data and Metadata eXchange), an ISO standard for exchanging and sharing statistical data and metadata among organizations. The Data Cube vocabulary is a core foundation which supports extension vocabularies to enable publication of other aspects of statistical data flows.
metadata
standard
data
description
inls520
webinfo
statistics
science
8 weeks ago by rybesh
Data Catalog Vocabulary (DCAT)
8 weeks ago by rybesh
DCAT is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. This document defines the schema and provides examples for its use.
By using DCAT to describe datasets in data catalogs, publishers increase discoverability and enable applications easily to consume metadata from multiple catalogs. It further enables decentralized publishing of catalogs and facilitates federated dataset search across sites. Aggregated DCAT metadata can serve as a manifest file to facilitate digital preservation.
metadata
standard
data
description
inls520
webinfo
By using DCAT to describe datasets in data catalogs, publishers increase discoverability and enable applications easily to consume metadata from multiple catalogs. It further enables decentralized publishing of catalogs and facilitates federated dataset search across sites. Aggregated DCAT metadata can serve as a manifest file to facilitate digital preservation.
8 weeks ago by rybesh
Crowd Sourcing Metadata
9 weeks ago by rybesh
The New York Public Library recently launched its first foray into crowd sourcing metadata by exposing 40,000 image pages of turn of the century restaurant and cruise ship menus: “What’s On the Menu?” The goal of the project was to widely distribute the transcription of the menu items into a structured and reusable form. The site was exceedingly popular in its first few months.
Recent activity has flattened somewhat, raising issues regarding the public’s appetite for these projects. More importantly, the menus project raised hard questions about the quality of the crowd sourced content, the longevity of the data, and the disposition of the data (e.g. What is it? Is it good enough for our purposes? Should we keep it? If yes, where does it belong?).
This presentation will discuss these issues and propose some alternative views on metadata, user-generated content, and the intersection of the two.
metadata
crowdsourcing
Recent activity has flattened somewhat, raising issues regarding the public’s appetite for these projects. More importantly, the menus project raised hard questions about the quality of the crowd sourced content, the longevity of the data, and the disposition of the data (e.g. What is it? Is it good enough for our purposes? Should we keep it? If yes, where does it belong?).
This presentation will discuss these issues and propose some alternative views on metadata, user-generated content, and the intersection of the two.
9 weeks ago by rybesh
[whatwg] RDFa
9 weeks ago by rybesh
"In controlled environments, e.g. on a single site, or in a single person's
media library, or within a small coherent community where all the
participants have compatible goals, it is possible to get enough
discipline that metadata is both reliable and useful. And for such
communities we have a raft of extension mechanisms, and clashes can be
avoided easily by simply using names that nobody in the community is
already using.
"But as soon as this kind of thing is applied to people outside the
tightnit community, the metadata becomes an utter mess, misused, wrong,
missing, syntactically incorrect, semantically incorrect, unusable. We
have shown time and time again that when metadata mechanisms face the
wider Web community, they fail. Ignoring this doesn't make it go away."
rdfa
web
metadata
media library, or within a small coherent community where all the
participants have compatible goals, it is possible to get enough
discipline that metadata is both reliable and useful. And for such
communities we have a raft of extension mechanisms, and clashes can be
avoided easily by simply using names that nobody in the community is
already using.
"But as soon as this kind of thing is applied to people outside the
tightnit community, the metadata becomes an utter mess, misused, wrong,
missing, syntactically incorrect, semantically incorrect, unusable. We
have shown time and time again that when metadata mechanisms face the
wider Web community, they fail. Ignoring this doesn't make it go away."
9 weeks ago by rybesh
ResourceAbout - RDFa Working Group Wiki
9 weeks ago by rybesh
This is a suggestion to consider @resource as preferable over @about. The purpose is to promote a slightly simpler and more uniform practice of RDFa use, especially in the RDFa Lite subset.
rdfa
metadata
semweb
9 weeks ago by rybesh
HTML Data Guide
12 weeks ago by rybesh
Microformats, RDFa and microdata all enable consumers to extract data from HTML pages. This data may be embedded within enhanced search engine results, exposed to users through browser extensions, aggregated across websites or used by scripts running within those HTML pages.
This guide aims to help publishers and consumers of HTML data use it well. With several syntaxes and vocabularies to choose from, it provides guidance about how to decide which meets the publisher's or consumer's needs. It discusses when it is necessary to mix syntaxes and vocabularies and how to publish and consume data that uses multiple formats. It describes how to create vocabularies that can be used in multiple syntaxes and general best practices about the publication and consumption of HTML data.
microdata
microformats
rdfa
html
standards
metadata
semweb
webinfo
This guide aims to help publishers and consumers of HTML data use it well. With several syntaxes and vocabularies to choose from, it provides guidance about how to decide which meets the publisher's or consumer's needs. It discusses when it is necessary to mix syntaxes and vocabularies and how to publish and consume data that uses multiple formats. It describes how to create vocabularies that can be used in multiple syntaxes and general best practices about the publication and consumption of HTML data.
12 weeks ago by rybesh
Stanford Topic Modeling Toolbox
12 weeks ago by rybesh
Includes an implementation of PLDA.
Partially Labeled Dirchlet Allocation (PLDA) [paper] is a topic model that extends and generalizes both LDA and Labeled LDA. The model is analogous to Labeled LDA except that it allows more than one latent topic per label and a set of background labels. Learning and inference in the model is much like the example above for Labeled LDA, but you must additionally specify the number of topics associated with each label.
lda
plda
metadata
topicmodels
nlp
socialscience
scala
Partially Labeled Dirchlet Allocation (PLDA) [paper] is a topic model that extends and generalizes both LDA and Labeled LDA. The model is analogous to Labeled LDA except that it allows more than one latent topic per label and a set of background labels. Learning and inference in the model is much like the example above for Labeled LDA, but you must additionally specify the number of topics associated with each label.
12 weeks ago by rybesh
Partially Labeled Topic Models for Interpretable Text Mining
12 weeks ago by rybesh
Much of the world's electronic text is annotated with human-interpretable labels, such as tags on web pages and subject codes on academic publications. Effective text mining in this setting requires models that can exibly account for the textual patterns that underlie the observed labels while still discovering unlabeled topics. Neither supervised classification, with its focus on label prediction, nor purely unsupervised learning, which does not model the labels explicitly, is appropriate. In this paper, we present two new partially supervised generative models of labeled text, Partially Labeled Dirichlet Allocation (PLDA) and the Partially Labeled Dirichlet Process (PLDP). These models make use of the unsupervised learning machinery of topic models to discover the hidden topics within each label, as well as unlabeled, corpus-wide latent topics. We explore applications with qualitative case studies of tagged web pages from del.icio.us and PhD dissertation abstracts, demonstrating improved model interpretability over traditional topic models. We use the many tags present in our del.icio.us dataset to quantitatively demonstrate the new models' higher correlation with human relatedness scores over several strong baselines.
topicmodels
metadata
12 weeks ago by rybesh
Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression
12 weeks ago by rybesh
Although fully generative models have been successfully used to model the contents of text documents, they are often awkward to apply to combinations of text data and document metadata. In this paper we propose a Dirichlet-multinomial regression (DMR) topic model that includes a log-linear prior on document-topic distributions that is a function of observed features of the document, such as author, publication venue, references, and dates. We show that by selecting appropriate features, DMR topic models can meet or exceed the performance of several previously published topic models designed for specific data.
topicmodels
metadata
12 weeks ago by rybesh
DMRLoader (Mallet 2 API)
12 weeks ago by rybesh
This class loads data into the format for the MALLET Dirichlet-multinomial regression (DMR). DMR topic models learn topic assignments conditioned on observed features.
topicmodels
metadata
12 weeks ago by rybesh
rNews is here. And this is what it means. - NYTimes.com
february 2012 by rybesh
All you have to do is view source on any nytimes.com article published on or after January, 23 2012. In the HTML you will see new attributes like ‘itemtype’, ‘itemprop’ and ‘itemid’. If you paste an article URL into the Google Rich Snippets tool, you can see a parse of the structured data now embedded into every nytimes.com article,
linkeddata
microdata
news
metadata
webinfo
inls520
february 2012 by rybesh
Implications of MARC Tag Usage on Library Metadata Practices
february 2012 by rybesh
Only a small subset of MARC 21 fields are used in WorldCat. Even when considering the MARC fields that are heavily used in non-book formats, there are
only 21 to 30 tags that occur in 10% or more records.
libraries
cataloging
metadata
inls520
only 21 to 30 tags that occur in 10% or more records.
february 2012 by rybesh
Conditional Random Fields
february 2012 by rybesh
Conditional random fields (CRFs) are a probabilistic framework for labeling and segmenting structured data, such as sequences, trees and lattices. The underlying idea is that of defining a conditional probability distribution over label sequences given a particular observation sequence, rather than a joint distribution over both label and observation sequences. The primary advantage of CRFs over hidden Markov models is their conditional nature, resulting in the relaxation of the independence assumptions required by HMMs in order to ensure tractable inference. Additionally, CRFs avoid the label bias problem, a weakness exhibited by maximum entropy Markov models (MEMMs) and other conditional Markov models based on directed graphical models. CRFs outperform both MEMMs and HMMs on a number of real-world tasks in many fields, including bioinformatics, computational linguistics and speech recognition.
machinelearning
nlp
crf
textmining
metadata
february 2012 by rybesh
Web Data Commons
february 2012 by rybesh
Web Data Commons will extract all Microformat, Microdata and RDFa data that is contained in the Common Crawl corpus and will provide the extracted data for free download in the form of RDF-quads as well as CSV-tables for common entity types (e.g. product, organization, location, ...).
semweb
rdfa
web
metadata
webinfo
microdata
microformats
database
february 2012 by rybesh
The Problem of the Yellow Milkmaid: A Business Model Perspective on Open Metadata
january 2012 by rybesh
"The Milkmaid," one of Johannes Vermeer's most famous pieces, depicts a scene of a woman quietly pouring milk into a bowl. During a survey the Rijksmuseum discovered that there were over 10,000 copies of the image on the internet—mostly poor, yellowish reproductions. As a result of all of these low-quality copies on the web, according to the Rijksmuseum, "people simply didn't believe the postcards in our museum shop were showing the original painting. This was the trigger for us to put high-resolution images of the original work with open metadata on the web ourselves. Opening up our data is our best defence against the 'yellow Milkmaid.'"
metadata
business
art
museum
january 2012 by rybesh
The Doubly Correlated Nonparametric Topic Model
december 2011 by rybesh
Topic models are learned via a statistical model of variation within document collections, but designed to extract meaningful semantic structure. Desirable traits include the ability to incorporate annotations or metadata associated with documents; the discovery of correlated patterns of topic usage; and the avoidance of parametric assumptions, such as manual specification of the number of topics. We propose a doubly correlated nonparametric topic (DCNT) model, the first model to simultaneously capture all three of these properties. The DCNT models metadata via a flexible, Gaussian regression on arbitrary input features; correlations via a scalable square-root covariance representation; and nonparametric selection from an unbounded series of potential topics via a stick-breaking construction. We validate the semantic structure and predictive performance of the DCNT using a corpus of NIPS documents annotated by various metadata.
topicmodels
metadata
december 2011 by rybesh
Tutorial - Facebook Developers
october 2011 by rybesh
This tutorial will guide you through the key steps to build, test, and publish your first Open Graph app. We will build a sample app that allows users on your website to publish stories about cooking recipes.
semweb
metadata
webinfo
ontology
linkeddata
howto
october 2011 by rybesh
An Uber-comparison of RDFa, Microdata and Microformats | The Beautiful, Tormented Machine
october 2011 by rybesh
This post attempts to clarify those technical issues by providing an easy-to-read list of similarities and differences between RDFa, Microdata and Microformats. A simple table summarizing all features across each structured data syntax is listed below. Each feature is linked to a brief explanation of the feature toward the bottom of the page.
linkeddata
microdata
microformats
rdfa
metadata
webinfo
october 2011 by rybesh
Case Study: Contextual Search for Volkswagen and the Automotive Industry
october 2011 by rybesh
In summary the key benefits of using Semantic Web technology for Volkswagen were as follows:
A standardised interface to data and content, accessible to developers with different skillsets, using different technologies within and without the organisation.
Separation of concerns between information and application, both logically and physically.
Increases value, reusability and accessibility of data.
Very powerful federation features.
Adoption and use didn't necessitate process or change management. It could be leveraged at any stage within the product lifecycle painlessly and gracefully, both internally and externally.
semweb
linkeddata
search
inls520
metadata
A standardised interface to data and content, accessible to developers with different skillsets, using different technologies within and without the organisation.
Separation of concerns between information and application, both logically and physically.
Increases value, reusability and accessibility of data.
Very powerful federation features.
Adoption and use didn't necessitate process or change management. It could be leveraged at any stage within the product lifecycle painlessly and gracefully, both internally and externally.
october 2011 by rybesh
Definition of hashtags, the easy hashtag directory
august 2011 by rybesh
Discover what hashtags really mean, and add your own definitions in seconds.
twitter
hashtags
social
metadata
folksonomy
tagging
inls520
august 2011 by rybesh
ReMix: Linked Data and the Semantic Web
august 2011 by rybesh
From a Stanford Libraries newsletter: "Linked Data could provide the antidote to the chaos and complexity of the current overabundant array of too simple search mechanisms with too little precision and too short recall of relevant results."
inls520
metadata
rdf
semweb
linkeddata
august 2011 by rybesh
Archivematica
august 2011 by rybesh
Archivematica is a comprehensive digital preservation system. Archivematica uses a micro-services design pattern to provide an integrated suite of free and open-source tools that allows users to process digital objects from ingest to access in compliance with the ISO-OAIS functional model.
archives
metadata
preservation
august 2011 by rybesh
One Small Step for Metadata Management in HathiTrust: California Digital Library
june 2011 by rybesh
@BerkeleyISchool grad Stephanie Collett is the technical lead for #HathiTrust's #metadata management system:
HathiTrust
metadata
from twitter
june 2011 by rybesh
INF 385T Description and Metadata, Spring 2011
june 2011 by rybesh
This course comprises a critical, comparative examination of the concept of description and its institutionalization, in the form of metadata standards, rules, and formats, in three primary contexts: libraries, archives, and museums. We will investigate the conceptual foundations and goals of description in each context and the structures (guidelines, technologies) that have been developed to facilitate institutional goals. We will also explore emerging challenges to traditional models of description, including new forms of complex, dynamic entities (for example, continually updated documents, such as blogs), new technologies for searching, browsing, and describing (such as user-generated descriptive tags), and new ways of viewing the descriptive enterprise (via critical theory, for example).
The course will not emphasize the practice of creating descriptive metadata in any current environment (that is, you will not learn how to create library catalog records or archival finding aids according to existing content and technical standards). Instead, the course will focus on understanding and interrogating the conceptual foundations of existing standards and guidelines for such descriptions. We will emphasize rigorous and spirited analysis of these descriptive paradigms and their continued utility in a rapidly changing information landscape.
metadata
description
INLS520
The course will not emphasize the practice of creating descriptive metadata in any current environment (that is, you will not learn how to create library catalog records or archival finding aids according to existing content and technical standards). Instead, the course will focus on understanding and interrogating the conceptual foundations of existing standards and guidelines for such descriptions. We will emphasize rigorous and spirited analysis of these descriptive paradigms and their continued utility in a rapidly changing information landscape.
june 2011 by rybesh
Weekly’s Spotlight | Agricultural Information Management Standards (AIMS)
june 2011 by rybesh
AIMS supports the implementation of structured and linked information and knowledge by fostering a community of practice centered on the themes of interoperability, reusability and cooperation. It shares vocabularies, methodologies, tools and services in order to promote coherence in agricultural information.
agriculture
metadata
linkeddata
vocabulary
june 2011 by rybesh
Vocabulary Mapping Framework (VMF) project
may 2011 by rybesh
The VMF matrix is a tool which can be used to automatically compute the "best fit" mappings between terms in controlled vocabularies in different metadata schemes and messages of interest to the educational, bibliographic and content publishing sectors.
metadata
vocabulary
mapping
may 2011 by rybesh
existing rel values · Microformats Wiki
may 2011 by rybesh
This page contains tables of known HTML rel values from specifications, formats, proposals, brainstorms, and non-trivial POSH usage in the wild. In addition, dropped and rejected values are listed at the end for comprehensiveness
html
hypertext
links
metadata
may 2011 by rybesh
Welcome to Acoustid! | Acoustid
may 2011 by rybesh
Acoustid is an open source project that aims to create a free database of audio fingerprints with mapping to the MusicBrainz metadata database and provide a web service for audio file identification using this database.
audio
database
metadata
identity
may 2011 by rybesh
Describing Linked Datasets with the VoID Vocabulary
march 2011 by rybesh
VoID is an RDF Schema vocabulary for expressing metadata about RDF datasets. It is intended as a bridge between the publishers and users of RDF data, with applications ranging from data discovery to cataloging and archiving of datasets. This document is a detailed guide to the VoID vocabulary. It describes how VoID can be used to express general metadata based on Dublin Core, access metadata, structural metadata, and links between datasets. It also provides deployment advice and discusses the discovery of VoID descriptions.
linkeddata
metadata
march 2011 by rybesh
DSPL: Dataset Publishing Language - Google Code
february 2011 by rybesh
DSPL is the Dataset Publishing Language, a representation language for the data and metadata of datasets. Datasets described in this format can be processed by Google and visualized in the Google Public Data Explorer.
data
metadata
google
standards
february 2011 by rybesh
MADS/RDF Documentation
november 2010 by rybesh
The MADS/RDF (Metadata Authority Description Schema in RDF) vocabulary, a data model for authority and vocabulary data used within the library and information science (LIS) community, which is inclusive of museums, archives, and other cultural institutions.
rdf
semweb
metadata
standards
authority
linkeddata
november 2010 by rybesh
Extend Swift | SiLCC
august 2010 by rybesh
SiLCC is a cloud based service for parsing text and extracting relevant keywords.
nlp
tools
tagging
metadata
api
august 2010 by rybesh
RDFa API
june 2010 by rybesh
It must be simple for Web developers to extract and utilize structured information from a Web document. This document details such a mechanism; an RDFa Document Object Model Application Programming Interface (RDFa DOM API) that allows simple extraction and usage of structured information from a Web document.
rdfa
linkeddata
web
metadata
api
standards
june 2010 by rybesh
Provenance Vocabulary Core Ontology Specification
may 2010 by rybesh
This vocabulary enables providers of Web data to publish provenance-related metadata about their data.
semweb
vocabulary
provenance
metadata
may 2010 by rybesh
Graph API - Facebook Developers
april 2010 by rybesh
The new Graph API attempts to drastically simplify the way developers read and write data to Facebook. It presents a simple, consistent view of the Facebook social graph, uniformly representing objects in the graph (e.g., people, photos, events, and fan pages) and the connections between them (e.g., friend relationships, shared content, and photo tags).
social
data
api
rest
webservices
facebook
metadata
april 2010 by rybesh
Ontology for Media Resource 1.0
october 2009 by rybesh
The ontology is supposed to foster the interoperability among various kinds of metadata formats currently used to describe media resources on the Web.
media
metadata
multimedia
annotation
ontology
vocabulary
october 2009 by rybesh
The GeoJSON Format Specification
july 2009 by rybesh
GeoJSON is a geospatial data interchange format based on JavaScript Object Notation (JSON).
json
javascript
gis
metadata
locative
data
july 2009 by rybesh
W3C Media Fragments Working Group
june 2009 by rybesh
The mission of the Media Fragments Working Group, part of the Video in the Web Activity, is to address temporal and spatial media fragments in the Web using Uniform Resource Identifiers (URI).
multimedia
annotation
web
metadata
standards
visualweb
june 2009 by rybesh
Interlinking Multimedia: How to Apply Linked Data Principles to Multimedia Fragments
june 2009 by rybesh
Talk given at the 2nd Linked Data on the Web workshop (LDOW) co located with WWW 2009, Madrid.
multimedia
metadata
annotation
linkeddata
semweb
standards
visualweb
june 2009 by rybesh
FBK - IT - TeV - semantic image labelling: vem
june 2009 by rybesh
Our goal is to understand what is being seen through the lens of a camera. Through the integration of diverse skills from many disciplines, ranging from cartographic projection, advanced 3D computer graphics and cutting-edge machine vision algorithms we hope to automatically align, generate and visualize geo-referenced meta-data.
semantics
image
computervision
locative
geocoding
metadata
photography
research
contentanalysis
3d
graphics
visualweb
june 2009 by rybesh
Foresite ORE Validator and Transformer
june 2009 by rybesh
This service will attempt to download a given ORE Resource Map, either from the aggregation URI (if it resolves correctly to a resource map) or from the resource map's URI.
bibliography
metadata
standards
tools
june 2009 by rybesh
Structured data (rich snippets) - Webmasters/Site owners Help
may 2009 by rybesh
What Google does with embedded metadata.
google
microformats
rdfa
markup
search
semweb
metadata
may 2009 by rybesh
LC discovers infinity
march 2009 by rybesh
If you were at ALA Midwinter in Denver (January, 2009) you may have been in one of the meetings where the Library of Congress announced its intention to atone for the lcsh.info fiasco. In case you missed that, Ed Summers of LC created an online version of the Library of Congress Subject Heading authority records, re-organized as a SKOS vocabulary and available for linking on the open Web. After being available for about six months (beginning in May of 2008), Ed was asked by his employer to take down the site on December 18, 2008. This was in spite of the fact that the data had been out there long enough to have a number of users, and that the removal broke existing systems that had developed around the data.[Note, lcsh.info has been re-born as http://lcsubjects.org/, hosted by Talis.]The outcry in the community was strong, including a reply to Ed's lcsh.info blog post by Sir Web himself, Tim Berners-Lee. Library of Congress must have been suitably embarassed.Thus the announcement at Midwinter that LC not only understands the value of linked open access to LCSH, but that all of the vocabularies managed by LC -- from the name authorities to the lists of document types, languages, locations, etc., -- need to be openly available in a format suitable for inclusion in Web services. LC has created a web site to host these vocabularies: id.loc.gov. On that site they say:Initially, within 6 to 8 weeks, the Library of Congress will release its first offering: the Library of Congress Subject Headings. This will be an almost verbatim re-release of the system and content once found at the popular prototype lcsh.info service.They also say:We aim to make resources available on this site within 6-8 weeks. Check this site regularly for more updates as we continue to develop this service!The page is dated 1/22/09. My calculations show that 9 weeks have passed. OK, that's only one week over their stated deadline. But nothing on the page has changed. No resources have been made available. An "almost verbatim" release of lcsh.info should not be too hard given that Ed had code written that he has made publicly available.But even today, the promised service is 6-8 weeks away. It may stay that way for a long time. Maybe even forever.Why does this matter? It matters because the availability of these vocabularies is essential for the library world to move forward. Some of us have been asking LC to put the vocabularies online in a machine-actionable format for a very long time. The Dublin Core community worked with LC to create a machine-actionable and URI-identified version of the MARC role terms as early as 2005. You can't find this linked from any of the MARC documentation. Some of us brought up the topic ad nauseum at MARBI meetings, but to no avail. Now LC seems to have "gotten it" conceptually but they have yet to show us that they can deliver.I may seem to be undeservedly impatient on this score, but it's not that we have been waiting for this for 9 weeks: we've been waiting for years. And quite honestly, this is not rocket science, nor does LC have no guidance for how to manage this data. In fact, they could use the NSDL Metadata Registry, or, if they insist on hosting this themselves, the Registry's source code is available. Quite frankly, if LC does not prove to us soon that it can perform this necessary function, I feel that we are quite justified in going forward without them, registering the vocabularies where they can be used and managed by anyone who needs them, and going forward with a transformation of library data that will meet 21st century needs.
vocabularies
metadata
RDF
from google
march 2009 by rybesh
voiD Guide - Using the Vocabulary of Interlinked Datasets | rdfs.org – Your Ontologies Are Here
february 2009 by rybesh
voiD is a vocabulary and a set of instructions that enables the discovery and usage of linked datasets.
semweb
metadata
database
rdf
sparql
vocabulary
linkeddata
howto
february 2009 by rybesh
Digital documentry editions assistance for scholars and publishers
january 2009 by rybesh
Documents Compass provides non-profit assistance to those who are engaged in or planning documentary editing projects in the electronic age.
metadata
tools
editing
webservices
digitalhumanities
documentary
january 2009 by rybesh
SRU/CQL Standardization in OASIS
january 2009 by rybesh
The premise behind dynamic bindings is that any search engine, even one that existed prior to development of the standard, need only to provide a dynamic binding - a self-description. It need make no other changes in order to be accessible. A client will be able to access any search engine that provides a description, if only it implements the capability to read and interpret the description and use it to formulate a request (including a query) and interpret the response.
metadata
search
standards
webservices
IR
january 2009 by rybesh
Search Web Services - The OASIS SWS Technical Committee Work: The Abstract Protocol Definition, OpenSearch Binding, and SRU/CQL 2.0
january 2009 by rybesh
The OASIS Search Web Services Technical Committee is developing search and retrieval web services, integrating various approaches under a unifying model, an Abstract Protocol Definition.
metadata
search
standards
webservices
IR
january 2009 by rybesh
[whatwg] Trying to work out the problems solved by RDFa
january 2009 by rybesh
It would seem important that the Web easily enable small-time users of data to efficiently communicate with one another, without the need to have one of the giants as an intermediary.
opinion
semweb
rdfa
metadata
architecture
search
web
webinfo
january 2009 by rybesh
Buzzword.org.uk Draft: RDF Extracted Attributes from Styled Elements
january 2009 by rybesh
CSS is an external file that specifies how your document should look; RDF-EASE is an external file that specifies what your document means.
semweb
rdfa
metadata
ideas
microformats
january 2009 by rybesh
OpenSim goes Hyper - The Hypergrid | VintFalken.com
december 2008 by rybesh
We might need a few more buttons on objects we allow for sale by when this becomes ‘common fashion’, don’t you think? Suggestions: ‘This may (not) be transferred on the hypergrid.’ ‘This is ‘all ages’ / ‘mature’. (And then hypergrid enabled sims could block incoming ‘mature’ items if they wished.
metadata
hypermedia
drm
virtualworlds
december 2008 by rybesh
SLurl: Location-Based Linking in Second Life
december 2008 by rybesh
SLurl provides direct teleport links to locations in Second Life. If you have Second Life installed, clicking on the map link will automatically teleport you to that location in-world.
reference
3d
metadata
architecture
web
place
secondlife
virtualworlds
identification
december 2008 by rybesh
JeromeDL - e-Library with Semantics - Home
september 2008 by rybesh
With JeromeDL's social and semantic services every library user can bookmark interesting books, articles or other materials in semantically annotated directories.
digital
library
tools
semweb
metadata
research
september 2008 by rybesh
GettingStartedDeployment - argot-hub - Google Code - How to deploy your metadata.
august 2008 by rybesh
Whilst there are many ways to publish metadata, we're going to focus on the use of RDFa, and how to either embed the information in web-pages, or provide a separate XML feed.
rdfa
datarss
howto
webservices
metadata
august 2008 by rybesh
Edition Production & Presentation Technology
august 2008 by rybesh
Package for image-based editing, including overlapping two images and encoding their differences; encoding script; encoding variant manuscripts; collating different manuscripts; gathering statistical data; searching all structural and non-structural data.
image
editing
digitization
tei
xml
metadata
digitalhumanities
august 2008 by rybesh
Visual Search Lab - Idée Inc.
july 2008 by rybesh
Narrow by image similarity or tags.
image
search
interface
contentanalysis
metadata
visualweb
july 2008 by rybesh
the map is here for you to use (tecznotes)
june 2008 by rybesh
Bottom-up mapping, driven by locals, who decide what's important enough to be mapped. Constrast it with the centralized, top-down, privacy-invading Google Maps/Streetview approach.
locative
maps
collaboration
authoring
metadata
local
june 2008 by rybesh
foresite-toolkit - Google Code
june 2008 by rybesh
Libraries for constructing, parsing, manipulating and serializing OAI-ORE Resource Maps.
opensource
library
api
python
rdf
atom
metadata
june 2008 by rybesh
DCMI Kernel Metadata Community: Kernel Metadata and Electronic Resource Citations (ERCs)
june 2008 by rybesh
The Kernel vocabulary, based on a subset of the Dublin Core (DC) metadata element set, aims to describe objects of any form or category, but its reach is limited to a small number of fundamental questions such as who, what, when, and where.
library
metadata
standards
4w
w4
dc
june 2008 by rybesh
Finding the Celtic Home
june 2008 by rybesh
Finding the Celtic is an experiment to create an online digital humanities collaboratory for Celtic Studies funded by the National Endowment for the Humanities.
humanities
ireland
celtic
history
culture
archaeology
metadata
neh2007
june 2008 by rybesh
Bibliographic Ontology Specification
june 2008 by rybesh
The Bibliographic Ontology Specification provides main concepts and properties for describing citations and bibliographic references (e.g. quotes, books, articles, etc.).
bibliography
library
metadata
ontology
semweb
june 2008 by rybesh
WorldCat Web service: xISBN [OCLC - WorldCat Affiliate tools]: xOCLCNUM
may 2008 by rybesh
Submit an OCLC number to this service, and it returns a list of related OCLC numbers and selected metadata.
library
cataloging
metadata
bibliography
webservices
identity
may 2008 by rybesh
A Semantic Multimedia Web: Create, Annotate, Present and Share your Media
april 2008 by rybesh
We consider the use of Semantic Web technologies for improving the multimedia user experience on the Web.
multimedia
semweb
annotation
metadata
search
editing
research
CWI
april 2008 by rybesh
hkit - Google Code
march 2008 by rybesh
hkit is a simple PHP5 tool for extracting common microformats from a page.
microformats
metadata
webservices
xml
json
php
march 2008 by rybesh
related tags
3d ⊕ 4w ⊕ academia ⊕ acm ⊕ advertising ⊕ agriculture ⊕ ai ⊕ ajax ⊕ analysis ⊕ annotation ⊕ AP170 ⊕ api ⊕ archaeology ⊕ architecture ⊕ archives ⊕ art ⊕ atom ⊕ audio ⊕ authoring ⊕ authority ⊕ automatic ⊕ berkeley ⊕ bias ⊕ bibliography ⊕ biography ⊕ blog ⊕ books ⊕ brands ⊕ business ⊕ campaign ⊕ cataloging ⊕ catalogs ⊕ categorization ⊕ celtic ⊕ cinema ⊕ citation ⊕ classification ⊕ code ⊕ collaboration ⊕ collectiveaction ⊕ color ⊕ comics ⊕ commercial ⊕ commons ⊕ communication ⊕ community ⊕ computervision ⊕ conference ⊕ consumer ⊕ contentanalysis ⊕ convergence ⊕ cooking ⊕ courses ⊕ crf ⊕ crowdsourcing ⊕ culturalheritage ⊕ culture ⊕ CWI ⊕ dam ⊕ data ⊕ database ⊕ datamining ⊕ datarss ⊕ dc ⊕ delivery ⊕ description ⊕ design ⊕ digital ⊕ digitalhumanities ⊕ digitization ⊕ discussion ⊕ documentary ⊕ documentation ⊕ documents ⊕ doi ⊕ drm ⊕ drupal ⊕ economics ⊕ editing ⊕ EIND ⊕ election ⊕ emacs ⊕ emotion ⊕ entertainment ⊕ facebook ⊕ fall2005 ⊕ fans ⊕ fiction ⊕ filesystem ⊕ film ⊕ filters ⊕ flash ⊕ folksonomy ⊕ food ⊕ frbr ⊕ future ⊕ games ⊕ geocoding ⊕ gis ⊕ google ⊕ government ⊕ graphics ⊕ grid ⊕ hashtags ⊕ HathiTrust ⊕ helsinki ⊕ history ⊕ howto ⊕ html ⊕ humanities ⊕ humor ⊕ hypermedia ⊕ hypertext ⊕ ideas ⊕ identification ⊕ identity ⊕ ideology ⊕ image ⊕ information ⊕ infoviz ⊕ inls520 ⊕ interface ⊕ intermediation ⊕ internet ⊕ IR ⊕ ireland ⊕ ischool ⊕ japan ⊕ java ⊕ javascript ⊕ journalism ⊕ json ⊕ korea ⊕ kos ⊕ kr ⊕ labs ⊕ lda ⊕ libraries ⊕ library ⊕ linguistics ⊕ linkeddata ⊕ links ⊕ literature ⊕ local ⊕ locative ⊕ lucene ⊕ lyrics ⊕ machinelearning ⊕ management ⊕ manifesto ⊕ mapping ⊕ maps ⊕ markup ⊕ media ⊕ memory ⊕ metadata ⊖ microdata ⊕ microformats ⊕ mobile ⊕ model ⊕ models ⊕ mp3 ⊕ mpeg-7 ⊕ mpeg-21 ⊕ msmdx ⊕ multimedia ⊕ multimodal ⊕ museum ⊕ music ⊕ names ⊕ narrative ⊕ neh2007 ⊕ networking ⊕ newmedia ⊕ news ⊕ nlp ⊕ nyc ⊕ ontology ⊕ opendata ⊕ opensource ⊕ opinion ⊕ oralhistory ⊕ organization ⊕ p2p ⊕ pdf ⊕ people ⊕ perl ⊕ personalization ⊕ philips ⊕ photography ⊕ php ⊕ place ⊕ platform ⊕ playlist ⊕ plda ⊕ plugin ⊕ politics ⊕ pornography ⊕ preservation ⊕ propaganda ⊕ provenance ⊕ python ⊕ qualitative ⊕ quicktime ⊕ quote ⊕ radio ⊕ rda ⊕ rdf ⊕ rdfa ⊕ recognition ⊕ recommendation ⊕ reference ⊕ reggae ⊕ relaxng ⊕ remix ⊕ research ⊕ rest ⊕ reviews ⊕ rhetoric ⊕ ruby ⊕ scala ⊕ science ⊕ search ⊕ secondlife ⊕ semantics ⊕ semiotics ⊕ semweb ⊕ sharing ⊕ shopping ⊕ skos ⊕ sms ⊕ social ⊕ socialscience ⊕ sociology ⊕ sociomaterial ⊕ SoI ⊕ spam ⊕ sparql ⊕ specialcollections ⊕ sports ⊕ spring2006 ⊕ SSMS2006 ⊕ standard ⊕ standards ⊕ statistics ⊕ storage ⊕ strategy ⊕ streaming ⊕ subtitle ⊕ svg ⊕ syndication ⊕ tagging ⊕ technology ⊕ tei ⊕ textmining ⊕ theory ⊕ thesaurus ⊕ timetags ⊕ tools ⊕ topicmodels ⊕ trust ⊕ tv ⊕ twitter ⊕ ubicomp ⊕ uk ⊕ unix ⊕ unmediated ⊕ urban ⊕ userresearch ⊕ video ⊕ virtualworlds ⊕ visualweb ⊕ vocabularies ⊕ vocabulary ⊕ voice ⊕ w4 ⊕ wales ⊕ web ⊕ web2.0 ⊕ webinfo ⊕ webservices ⊕ wiki ⊕ windows ⊕ wishlist ⊕ workshop ⊕ xml ⊕ xquery ⊕ xslt ⊕ yahoo ⊕ YRB ⊕ zonetag ⊕ zonetags ⊕Copy this bookmark: