jschneider + digitization 411
Where are the Novels?
25 days ago by jschneider
"The lion’s share of the private/for-profit scans are from the Corvey Collection which the publisher Gale appears to control. ""Based on the sample, we may guess that about 58%—somewhere between 47% and 68%—of the 2,903 novels have publicly accessible scans.1 For any given novel, however, the chance of finding a scan seems to depend on two things: (1) the novel’s year of publication and (2) the novel having subsequent editions or printings (see Figure 2).""Two results stand out. First, the 19th century British novel is a phenomenally well-preserved part of cultural history. Copies of nearly ever novel published during the period survive. Second, the proportion of novels scanned between 1800 and 1820 is low—likely around 33% based on this sample. This raises concerns about any claim of representativeness made on behalf of existing corpora covering those years. As libraries and private collections continue to be digitized and, I hope, be made publicly accessible, such concerns should diminish."
digitization
openaccess
publicdomain
corpora
25 days ago by jschneider
ePubs and quality
november 2011 by jschneider
"Unlike the other two, the NZETC version has accents, bold and italics in the right place. It' the only one with a workable and useful table of contents. It is also edition which has been extensively revised and expanded."
epub
ebooks
digitization
TEI
november 2011 by jschneider
Mediterranean Ceramics: Blogging my Digital Humanities 2011 Talk
july 2011 by jschneider
"The point of this post is not to go into great depth about what the pairing of these objects tells us about the role of materiality in the Late Roman/Byzantine Egypt. Instead, I want to stress that the opportunity to think about that issue with such relative ease arises from acts of independent self-digitization that exist within wider contexts of topically related efforts also engaging in self-digitization. That leads to an environment in which intellectual risk taking is rewarded."
materiality
digitalhumanities
digitization
july 2011 by jschneider
Digital rent-seeking – The Aporetic
may 2011 by jschneider
"“Pars International,” the outfit that handles permission for the NYT, wants to charge me $460 to use this image in my book. They call this an “image preparation fee.” Of course, if I found a copy of the NYT from 1910, I could do whatever I wanted with it, including reprint it, because it’s out of copyright."
digitization
nytimes
enclosure-of-the-commons
digital-heritage
pay-per-view
permission
copyright
may 2011 by jschneider
Book Saver
march 2011 by jschneider
via http://twitter.com/#!/jenelfarrell/status/52360893828300800
ebooks
digitization
scanners
hardware
march 2011 by jschneider
Journal liberation: A community enterprise « Everybody's Libraries
november 2010 by jschneider
"Consider the New England Journal of Medicine (NEJM), for example. It’s one of the most prominent journals in the world, valued both for its reports on groundbreaking new research, and for its documentation, in its back issues, of nearly 200 years of American medical history. Many other journals with lesser value still cannot be read without paying for a subscription, or visiting a research library that has paid for a subscription. But you can find and read most of NEJM’s content freely online, both past and present. Several groups of people made this possible. Here are some of them.
The journal’s publisher has for a number of years provided open access to all research articles more than 6 months old, from 1993 onward. (Articles less than 6 months old are also freely available to readers in certain developing countries, and in some cases for readers elsewhere as well.) A registration requirement was dropped in 2007.
Funders of medical research, such as the National Institutes of Health, the Wellcome Trust, and the Howard Hughes Medical Institute, have encouraged publishers in the medical field to maintain or adopt such open access policies, by requiring their grantees (who publish many of the articles in journals like the NEJM) to make their articles openly accessible within months of publication. Some of these funders also maintain their own repositories of scholarly articles that have appeared in NEJM and similar journals.
Google Books has digitized most of the back run of the NEJM and its predecessor publications as part of its Google Books database. Many of these volumes are freely accessible to the public. This is not the only digital archive of this material; there’s also one on NEJM’s own website, but access there requires either a subscription or a $15 payment per article. Google’s scans, unlike the ones on the NEJM website, include the advertisements that appeared along with the articles. These ads document important aspects of medical history that are not as easily seen in the articles, on subjects ranging from the evolving requirements and curricula of 19th-century medical schools to the early 20th-century marketing of heroin for patients as young as 3 years old.
It’s one thing to scan journal volumes, though; it’s another to make them easy to find and use– which is why NEJM’s for-pay archive got a fair bit of publicity when it was released this summer, while Google’s scans went largely unnoticed. As I’ve noted before, it can be extremely difficult to find all of the volumes of a multi-volume work in Google Books; and it’s even more difficult in the case of NEJM, since issues prior to 1928 were published under different journal titles. Fortunately, many of the libraries that supplied volumes for Google’s scanners have also organized links to the scanned volumes, making it easier to track down specific volumes. The Harvard Libraries, for instance, have a chronologically ordered list of links to most of the volumes of the journal from 1828 to 1922, a period when it was known as the Boston Medical and Surgical Journal."
openaccess
NEJM
digitization
The journal’s publisher has for a number of years provided open access to all research articles more than 6 months old, from 1993 onward. (Articles less than 6 months old are also freely available to readers in certain developing countries, and in some cases for readers elsewhere as well.) A registration requirement was dropped in 2007.
Funders of medical research, such as the National Institutes of Health, the Wellcome Trust, and the Howard Hughes Medical Institute, have encouraged publishers in the medical field to maintain or adopt such open access policies, by requiring their grantees (who publish many of the articles in journals like the NEJM) to make their articles openly accessible within months of publication. Some of these funders also maintain their own repositories of scholarly articles that have appeared in NEJM and similar journals.
Google Books has digitized most of the back run of the NEJM and its predecessor publications as part of its Google Books database. Many of these volumes are freely accessible to the public. This is not the only digital archive of this material; there’s also one on NEJM’s own website, but access there requires either a subscription or a $15 payment per article. Google’s scans, unlike the ones on the NEJM website, include the advertisements that appeared along with the articles. These ads document important aspects of medical history that are not as easily seen in the articles, on subjects ranging from the evolving requirements and curricula of 19th-century medical schools to the early 20th-century marketing of heroin for patients as young as 3 years old.
It’s one thing to scan journal volumes, though; it’s another to make them easy to find and use– which is why NEJM’s for-pay archive got a fair bit of publicity when it was released this summer, while Google’s scans went largely unnoticed. As I’ve noted before, it can be extremely difficult to find all of the volumes of a multi-volume work in Google Books; and it’s even more difficult in the case of NEJM, since issues prior to 1928 were published under different journal titles. Fortunately, many of the libraries that supplied volumes for Google’s scanners have also organized links to the scanned volumes, making it easier to track down specific volumes. The Harvard Libraries, for instance, have a chronologically ordered list of links to most of the volumes of the journal from 1828 to 1922, a period when it was known as the Boston Medical and Surgical Journal."
november 2010 by jschneider
Go To Hellman: Internet Archive Sets Fair-Use Bait With Open Library Lending
august 2010 by jschneider
"The fact that at least one author was asked for permission suggests that the Archive is being very careful about what it chooses to make available through the lending program. A look at the 187 items in the lending library supports this view. There are
Works by well-known copyright reform advocates such as Brand and Lawrence Lessig.
Obsolete computer books. Example: Microsoft Windows 98 at a glance
Older books likely to be orphan works: example- Alice James: her brothers~her journal, the posthumously published diary of Alice James.
An in-copyright version of an out-of-copyright work: Brace Lineage, a genealogical work apparently based on an older version.
Other genealogic works which might be considered to be data compilations. Example: Twelve generations in America
Spanish language works published in Guatemala. Example: Regimenes agrarios
A collection of stories by anonymous pregnant teens: You look too young to be a mom: teen moms speak out on love, learning, and success
In short, if you wanted to take legal action to stop the digital lending library, each of the books included in the lending library would pose some sort of problem for you.
"
archive.org
IP
digital-lending
digitization
copyright
fair-use
orphanworks
Works by well-known copyright reform advocates such as Brand and Lawrence Lessig.
Obsolete computer books. Example: Microsoft Windows 98 at a glance
Older books likely to be orphan works: example- Alice James: her brothers~her journal, the posthumously published diary of Alice James.
An in-copyright version of an out-of-copyright work: Brace Lineage, a genealogical work apparently based on an older version.
Other genealogic works which might be considered to be data compilations. Example: Twelve generations in America
Spanish language works published in Guatemala. Example: Regimenes agrarios
A collection of stories by anonymous pregnant teens: You look too young to be a mom: teen moms speak out on love, learning, and success
In short, if you wanted to take legal action to stop the digital lending library, each of the books included in the lending library would pose some sort of problem for you.
"
august 2010 by jschneider
We're not going anywhere .... OK, we lied ... - Lorcan Dempsey's Weblog
june 2010 by jschneider
"Most of my library time, though, went to wandering the long, narrow corridors of the stacks, Despite being surrounded by tens of thousands of books, I don't remember feeling the anxiety that's symptomatic of what we today call "information overload." There was something calming in the reticence of all those books, their willingness to wait years, decades even, for the right reader to come along and pull them from their appointed slots. Take your time the books whispered to me in their dusty voices. We're not going anywhere. [The Shallows, p.12]
It seems they lied ;-) While many of the books will remain on the shelves for some years to come, it is clear that the exodus has begun. The books are leaving the library."
Lorcan
Dempsey
digitization
off-site-storage
It seems they lied ;-) While many of the books will remain on the shelves for some years to come, it is clear that the exodus has begun. The books are leaving the library."
june 2010 by jschneider
Aura - Lorcan Dempsey's Weblog
may 2010 by jschneider
"Afterwards, I was reminded of Walter Benjamin's celebrated - if exasperating - essay on The Work of Art in an Age of Mechanical Reproduction in which he tried to figure out what happens to authenticity in an age of perfect copying. "Even the most perfect reproduction of a work of art," he wrote, "is lacking in one element: its presence in time and space, its unique existence at the place where it happens to be." This "aura" that emanates from original works - which is, Benjamin argued, stripped away by reproduction - is hard to pin down (and Benjamin doesn't really nail it, in my opinion) but, believe me, you know it when you feel it. And not even Google can digitise it. [For a moment, I was in the presence of Newton's genius]"
digitization
reproduction
perfect-copies
risks-of-perfect-copies
may 2010 by jschneider
The Shifted Librarian » Broken Boxes
may 2010 by jschneider
"Ted pointed me to in my own account, the page that listed every highlight and every note that I had taken on my Kindle version of John Seely Brown’s new book Pull, I could only think two words:
Game. Changer.
All of a sudden, by reading the book electronically as opposed to in print, I now have:
* all of the most relevant, thought-provoking passages from the book listed on one web page, as in my own condensed version of just the best pieces
* all of my notes and reflections attached to those individual notes
* the ability to copy and paste all of those notes and highlights into Evernote which makes them searchable, editable, organizable, connectable and remixable
* the ability to access my book notes and highlights from anywhere I have an Internet connection.""One of Lanier’s concerns is how decisions made in the design of our digital tools lock us in to behaviors that reduce — and even remove — our humanity. For the ebook context, an alternative title for his book could have been “You Are Not an App.”"
annotation
kindle
commonplace-books
freedom
lock-in
digitization
Game. Changer.
All of a sudden, by reading the book electronically as opposed to in print, I now have:
* all of the most relevant, thought-provoking passages from the book listed on one web page, as in my own condensed version of just the best pieces
* all of my notes and reflections attached to those individual notes
* the ability to copy and paste all of those notes and highlights into Evernote which makes them searchable, editable, organizable, connectable and remixable
* the ability to access my book notes and highlights from anywhere I have an Internet connection.""One of Lanier’s concerns is how decisions made in the design of our digital tools lock us in to behaviors that reduce — and even remove — our humanity. For the ebook context, an alternative title for his book could have been “You Are Not an App.”"
may 2010 by jschneider
Scriptio Continua: Object Artefact Script
january 2010 by jschneider
"Digital representations may improve (or even make possible) the reading of difficult texts, such as the Vindolanda tablets or the Archimedes Palimpsest, so for purposes of interpretation, they may be superior to the physical reality. They can combine data, metadata, and other contextual information in ways that help a reader to work with documents. But they cannot satisfactorily replicate the physicality of the document, and it may be a bit dishonest to try." See also http://www.nesc.ac.uk/action/esi/contribution.cfm?Title=1014 http://www.nesc.ac.uk/esi/events/1014/ and http://giacometti.tumblr.com/post/213059488/object-artefact-script
hieroglyphics
digitization
tablets
codicology
from delicious
january 2010 by jschneider
The Book Mechanic - The Chronicle Review - The Chronicle of Higher Education
december 2009 by jschneider
"What is too often forgotten is that the book itself is a remarkable technology, easily one of the most socially significant in human history. Amazingly, books are still among our most reliable tools for preserving information. They never give you an error message or fail to open because of incompatibility with newer technologies. If you want your descendants to read your memoirs, you're smarter to leave them a book than a disk. Both the euphoric embrace of digital technologies and the haze of nostalgia for the book have obscured its technological nature, propagating the notion that the meeting between the book and the computer pits the pretechnological against the technological. Any triumph for one is viewed as a loss for the other. Without denying that digitization poses serious new challenges for the book, one can argue that that perspective is deeply flawed."
Chronicle
Terry
Belanger
rare-book-school
bookhistory/bookfutures
digitization
december 2009 by jschneider
words :: Funny thing…
november 2009 by jschneider
"No one in this cycle—not the researchers in the fiefdom, not the institution, not the grant agency, no one—takes responsibility for the post-grant existence of anything the fiefdom produces."
sustainability
digital-projects
digitization
november 2009 by jschneider
The Other Online Learning « Machine Learning (Theory)
november 2009 by jschneider
"For classes commonly taught through high school, it’s difficult to imagine how good a learning experience could be after millions of hours spent refining to create the perfect approach. Imagine repeating a lecture over-and-over, testing the resulting student understanding a {day, week, month, year, decade} later to such an extent that every slide, every sentence, and every exercise is optimized for excellent learning. We could even imagine adapting the lecture to the learning style of each student.
The process of converting to the second approach has been slow, but it seems to be picking up."
online-learning
machine-learning
e-learning
digitization
The process of converting to the second approach has been slow, but it seems to be picking up."
november 2009 by jschneider
Promoting access to the best literature of the past « Everybody’s Libraries
november 2009 by jschneider
"Current scholarship is not spontaneously generated from the brain or lab of the writer. Useful scholarship must understand and interpret past work, to be effective in the present. In many fields, and not just the classical humanities, the relevant past work may stretch back hundreds or even thousands of years. Current scholarship and study will be more effective if its source material is also made openly accessible, and if proper attention is drawn to the most useful sources. And now is an especially opportune time for scholars of all sorts, professional and amateur, to get involved in the process.""Google’s PageRank algorithm may take advantage of implicit curation of web pages (through the choices of authors’ page links), but Google and other aggregators have had a much harder time drawing attention to the most useful books, scholarly articles, or other works created without built-in hyperlinks."
openaccess
digitization
curation
metadata
november 2009 by jschneider
Open Libraries – Evolution not Revolution
october 2009 by jschneider
"In A History of Online Information Sources, 1963-1974, Charles P. Bourne and Trudi Bellardo Hahn assert, “The serious application of computers to document reference retrieval began in the late 1950s, with slow serial searches of small files of bibliographic records on magnetic tapes. A precursor of effective, large-scale online information retrieval (IR) systems was an experiment in searching bibliographic records on an IBM disk memory system called RAMAC (Random Access Method of Accounting and Control). …The genesis of online retrieval systems can be traced to the first half of the 1960s.” What we see as nearly comprehensive coverage of journals online is a product of over forty years of experimentation, from metadata to full-text delivery."""Anarchy is not necessarily chaotic and dangerous. It is organization through disorganization—anarchist tactics generally involve uncoordinated actions toward a coordinated goal.""books ...transcends...$"
libraries
history
digitization
recommended
october 2009 by jschneider
Go To Hellman: Rehashing the Copyright Salami
october 2009 by jschneider
"I recognize however, that there are lawyers willing to argue that any index is inherently a derivative work and that rights holders should be able even to control the indexing of their work. It's clear from looking at cases such as The Harry Potter Lexicon case and Seinfeld Aptitude Test case that judges use a variety of tests to determine if a work is indeed a collection of facts. In the latter case, infringement was established because the "facts" were fictional. Because judges look at a variety of factors, we can't remove copyright law from the picture just because there's no copying. Still, the judge in the Harry Potter Lexicon case was pretty clear that an index per se is a transformative and thus allowed use of the work""
copyright
digitization
bacon
october 2009 by jschneider
The Secret Of Google's Book Scanning Machine Revealed - As A Matter Of Fact Blog : NPR
october 2009 by jschneider
"Google created some seriously nifty infrared camera technology that detects the three-dimensional shape and angle of book pages when the book is placed in the scanner. This information is transmitted to the OCR software, which adjusts for the distortions and allows the OCR software to read text more accurately."
NPR
googlebooks
digitization
october 2009 by jschneider
Blown to Bits » Blog Archive » Do It Yourself Book Scanning
october 2009 by jschneider
"Or make them publicly available. I said “can,” not “may” or “should.” But the existence of the device has the potential to raise lots of the same kinds of questions those other duplicating technologies raised. It empowers individuals, and enough empowered individuals could produce a Wikipedian digital library, collectively assembled, imperfect and incomplete, but growing and expanding."
digitization
DIY-scanner
googlebooks
october 2009 by jschneider
The Millions: Bringing Book Scanning Home
october 2009 by jschneider
"dan reetz & his diy book scanner
It’s pure 21st-century ingenuity. Reetz designed his first book scanner because, as a grad student at North Dakota State, he was appalled by textbook prices. Then he built it, in two days, from old digital cameras, cardboard, and scrap parts; a friend wrote the page-processing software.""I think the Reetz-Clancy continuum augurs good things for the future of books. On one end, the recognition that books have to live online now, and that publishing has to operate at internet scale. On the other, the passion for (obsession with?) independence and the cottage-industry craftiness that’s been the best part of book publishing for so long already."
digitization
DIY-scanner
googlebooks
It’s pure 21st-century ingenuity. Reetz designed his first book scanner because, as a grad student at North Dakota State, he was appalled by textbook prices. Then he built it, in two days, from old digital cameras, cardboard, and scrap parts; a friend wrote the page-processing software.""I think the Reetz-Clancy continuum augurs good things for the future of books. On one end, the recognition that books have to live online now, and that publishing has to operate at internet scale. On the other, the passion for (obsession with?) independence and the cottage-industry craftiness that’s been the best part of book publishing for so long already."
october 2009 by jschneider
Go To Hellman: The Revolution Will Be Digitized (By Cheap Book Scanners)
october 2009 by jschneider
"Reetz is a tinkerer and a liberator of information. He spent some time in Russia and became accustomed to the conveniences of digital books in a society that doesn't pay much attention to copyright laws...went dumpster diving for materials, then posted instructions for how to make the scanner online.""let's assume that an effective book digitizer can be built and deployed for $500...Then the cost of putting a book scanner in 20,000 libraries would be $10,000,000. If these libraries digitized an average of even one book per day, they could digitize 10,000,000 books in two years. Since 10 books per day should be well within the capabilities of an inexpensive digitizer, the libraries should have no technical difficulties with digitizing 4 million books per month.
If libraries acquired the capability of digitizing millions of books per month, then Google's erstwhile monopoly on digitized out-of-print books could evaporate quickly in an appropriate legal environment."
Eric
Hellman
digitization
googlebooks
opensource
accessibility
recommended
If libraries acquired the capability of digitizing millions of books per month, then Google's erstwhile monopoly on digitized out-of-print books could evaporate quickly in an appropriate legal environment."
october 2009 by jschneider
Digital Search I: Google Poisons the Well « Easily Distracted
october 2009 by jschneider
"The fear was always that Google would try to grab hold of the “orphan works” in large research libraries once they were digitized and sell those back to research institutions on an exclusive basis, to become the king vendor atop the mountain of digital databases. Well, once the settlement took on concrete shape, that turned out to be exactly where the company was heading.
I was initially welcoming to Google’s initiative because I believe that digitization is crucial for the improved dissemination of knowledge. ...Brin disingenuously suggests that out-of-print work is available now only to those who can afford to hop on a plane and fly to a library which holds such work. There’s a very small class of materials about which this is true: rare books, archival holdings and the like. Which are not the materials being digitized at the moment. Otherwise, ...inter-library loan or in more regional equivalents. The books may have to fly on a plane, but not the researchers.
google
digitization
orphan-works
I was initially welcoming to Google’s initiative because I believe that digitization is crucial for the improved dissemination of knowledge. ...Brin disingenuously suggests that out-of-print work is available now only to those who can afford to hop on a plane and fly to a library which holds such work. There’s a very small class of materials about which this is true: rare books, archival holdings and the like. Which are not the materials being digitized at the moment. Otherwise, ...inter-library loan or in more regional equivalents. The books may have to fly on a plane, but not the researchers.
october 2009 by jschneider
Journal Print-E mix: one, the other, both, or neither? : Christina's LIS Rant
september 2009 by jschneider
"Turns out that the entire point of the article was to show two color graphics on the second page. Well this journal is widely held, but we weren't sure who could actually do a color scan. You see, AGU scanned these articles from that time period in black and white."
" * black and white when the original is color
* missing articles or pages out of articles or even issues
* fuzzy or illegible scans
* omitted front matter, ads, editorials, etc.
Some of this is planning and reflects the evolution of the whole process. Older scans were just of the research articles. Most publishers have doubled back and now go from cover to cover. I always find it amazing that publishers don't have complete runs of their publications. They seriously don't....On the other hand, one of the aggregators does everything in really crappy color so even articles originally in black and white aren't very readable."
print
journals
PoD
digitization
reformatting
disaster-planning
digital-archiving
Christina
" * black and white when the original is color
* missing articles or pages out of articles or even issues
* fuzzy or illegible scans
* omitted front matter, ads, editorials, etc.
Some of this is planning and reflects the evolution of the whole process. Older scans were just of the research articles. Most publishers have doubled back and now go from cover to cover. I always find it amazing that publishers don't have complete runs of their publications. They seriously don't....On the other hand, one of the aggregators does everything in really crappy color so even articles originally in black and white aren't very readable."
september 2009 by jschneider
Google Book Search: why it matters - Times Online
september 2009 by jschneider
"Several European nations, including France and Germany, have expressed concern that the proposed settlement gives Google a monopoly in content. Since the settlement was the result of a class action against Google, it applies only to Google. Other companies would not be free to digitise books under the same terms. "
googlebooks
copyright
EU
digitization
orphaned-works
film
september 2009 by jschneider
Language Log » Google Books: A Metadata Train Wreck
august 2009 by jschneider
"Most of the misdatings are pretty obviously the result of an effort to automate the extraction of pub dates from the OCR'd text.""The BISAC scheme is well suited to organizing the shelves of a modern 35,000 foot chain bookstore or a small public library where ordinary consumers or patrons are browsing for books on the shelves. But it's not particularly helpful you're flying blind in a library with several million titles, including scholarly works, foreign works, and vast quantities of books from earlier periods."
googleBooks
metadata
Digitization
recommended
OCR
automation
BISAC
august 2009 by jschneider
Papers Past
june 2009 by jschneider
"Papers Past contains more than one million pages of digitised New Zealand newspapers and periodicals. The collection covers the years 1839 to 1932 and includes 52 publications from all regions of New Zealand. "
Digitization
NewZealand
newspapers
code4lib
june 2009 by jschneider
NASA Wants Your Ideas for Digitizing Rocket Scientist’s Notes | Wired Science | Wired.com
june 2009 by jschneider
"NASA is taking the rare step of reaching out to the public for help. The space agency is looking for the best way to analyze and electronically catalog a precious collection of notes that chronicle the early history of the human space flight program."
NASA
code4lib
Digitization
Wired
june 2009 by jschneider
Stomping on Yeti: Incentivizing the Kindle
june 2009 by jschneider
"If I could buy a Kindle and it would come with free/low-cost (.$25-$1.00) electronic copies off all of the books I've bought off Amazon in the past 6 months/1 year/since Kindle go-live, I would definitely reconsider. Then you are getting about as close as you can get to an eBook iPod, short of including CDs inside every cover."
kindle
business-models
digitization
june 2009 by jschneider
Web exclusive: 'Madame Bovary goes interactive' by Brigid Grauman | Prospect Magazine May 2009 issue 158
june 2009 by jschneider
A digitization success story. "www.bovary.fr and the more scholarly flaubert.univ-rouen.fr/are fascinating for the insights they offer into the craft of fiction. Flaubert was a compulsive but felicitous rewriter, sometimes producing 20 versions of the same page. He condensed, cut transitions, removed metaphors ...hunted down word repetition, and was weary of direct speech because he found it overly theatrical. He went on changing things until the final typesetter’s proof. He wrote 52 versions of one of the novel’s most famous episodes, the passage in which Emma is heady with the excitement of her new love affair with country aristocrat Rodolphe.""Transcribed by professionals as well as amateurs, safeguards were set up to minimise the risk of error. Texts were read through several times by experts like Sibires, and corrections are still being introduced to this day. Gerard jots down things she spots, and goes on line later to fix them. “At this stage, it’s mostly comas and accents,”"
drafts
archives
digitization
june 2009 by jschneider
Decapod — Showcase
may 2009 by jschneider
"Decapod provides the facilities to tackle material such as small journals, original source materials in the humanities and social sciences, non-journal periodicals, and more." See also http://www.decapodproject.org
digitization
may 2009 by jschneider
Coyle's InFormation: Walt Crawford should read the document
may 2009 by jschneider
"there is general agreement that Google gets a monopoly... at least on out-of-print books, which is the vast majority of books in libraries. (Not on public domain books, which is what the OCA digitizes, but anyone can digitize public domain books.) So although the libraries and their contents will still be there, and can be used in hard copy as they are today, no one but Google can digitize the in-copyright works without incurring liability. So "monopoly on online version of their contents" is a factual statement, if you understand that public domain is public domain."
Walt
Crawford
google
digitization
Karen
Coyle
fairuse
may 2009 by jschneider
Next Age of Discovery - WSJ.com
may 2009 by jschneider
"Improved technology is allowing researchers to scan ancient texts that were once unreadable -- blackened in fires or by chemical erosion, painted over or simply too fragile to unroll. Now, scholars are studying these works with X-ray fluorescence, multispectral imaging used by NASA to photograph Mars and CAT scans used by medical technicians.""By taking high-resolution digital images in 14 different light wavelengths, ranging from infrared to ultraviolet, Oxford scholars are reading bits of papyrus that were discovered in 1898 in an ancient garbage dump in central Egypt.""While conservationists are quick to stress that pixels and bytes can never replace priceless physical artifacts, many see digitization as a vital tool for increasing public access to rare items, while at the same time creating a disaster-proof record and perhaps unearthing new information.""... decode ancient manuscripts with multispectral imaging will begin at the UMichigan, Berkeley,Columbia...400 papyrus pieces."
WSJ
digitization
papyrus
multi-spectral-imaging
may 2009 by jschneider
Bedford-style Book of Hours
april 2009 by jschneider
"This is a 15th century parchment Book of Hours manuscript produced in France in the style of the workshop/scriptorium of the Master of the Duke of Bedford. The manuscript (Huntington HM 1100) can be viewed in two webpages of thumbnails, hosted by the Digital Scriptorium. "" ¶There's an interesting politico-cultural thread for exegesis in comparing the relatively free and open digitisation attitude in the States -vs- the seemingly grudging, less-is-best and user pays mentality of UK institutions (yes, a great big generalisation, I know). You would think, on first blush, given their respective capitalism -vs- socialism tendencies (and that's another one), that it would be the other way around."
manuscripts
hours
bibliodyssey
digitization
april 2009 by jschneider
Luggage home
april 2009 by jschneider
" I COLLECT LOST LUGGAGE, PHOTOGRAPH IT, AND THEN TRY TO FIND THE OWNERS. IT’S A LITTLE ODD BUT NOT AS ODD AS STAMP COLLECTING, JUST A LITTLE HARDER TO FIND STORAGE SPACE. "
art
digitization
luggage
culture
april 2009 by jschneider
DOE documents available for adoption » Federated Search Blog
april 2009 by jschneider
"How much does it cost to sponsor digitization and broad availability of a technical report? $60.00 – approximately the same cost as ordering a hard copy. This is payable by check or Visa, MasterCard, Discover, Bravo or Private Issue credit cards."
federated-search
OSTI
digitization
cool
april 2009 by jschneider
Blind Spots - ChronicleReview.com
april 2009 by jschneider
"Libraries have long been characterized as the heart of the research university, the center for scholarship. No one disputes that the aggregated sum of information and material available online, even primary material, has shifted the library's identity. Scholars no longer only visit archives in person; they access library material with immediacy (and increased mediation) online.""The design of new environments for performing scholarly work cannot be left to the technical staff and to library professionals.
Chronicle
digitization
scholarship
lotf
libraries
scholarly-communication
Stanford
digitahumanities
Johanna
drucker
april 2009 by jschneider
Google, Robert Darnton, and the Digital Republic of Letters | Au Courant
march 2009 by jschneider
"over the last decade and more, public policy has been consistently worse than useless in helping to make most of the works of the 20th century searchable and usable in digital form. This is the alternative against which we should evaluate Google Book Search and Google’s settlement with publishers and authors."
Paul
Courant
LC
copyright
digitization
mass-digitization
publicpolicy
DMCA
march 2009 by jschneider
The Google Books settlement: A symposium, and a call for library action « Everybody’s Libraries
march 2009 by jschneider
"Like Peter Hirtle, I didn’t hear many people saying directly that the settlement should not go forward (though I did hear that from a few people who were there but not on the stage). I did hear repeated concerns over how the deal came about, and how it made substantial changes to the contours of copyright through class action litigation rather than legislation; and I also heard substantial concerns over the monopolies given to Google and the Book Rights Registry, and how those monopolies might stifle innovation and competition, drive up prices for online access to books, and infringe on rights of readers. (Despite its name, the Book Rights Registry is not simply a passive store of information; it also actively negotiates terms and prices of online book uses on behalf of rightsholders, and manages the distribution of revenues from these uses.)"
googlebooks
privacy
settlement
preservation
orphanworks
digitization
march 2009 by jschneider
Inside Google Book Search: The Bodleian's treasures, available to all
march 2009 by jschneider
"In 2004, Google began a partnership with Oxford University Library to scan mostly 19th century public domain books from its Bodleian library. Five years on, we're delighted to announce the end of this phase of our scanning with Oxford, our first European partner."
publicdomain
digitization
googlebooksearch
march 2009 by jschneider
DX coding on film cassettes
march 2009 by jschneider
"One interesting question that I get asked from time to time is how to manually encode DX information on film cansisters."
film
history
digitization
standards
march 2009 by jschneider
related tags
"1984" ⊕ "bibliocaust" ⊕ "data-services" ⊕ "robotic-scribe" ⊕ "social-life-of-information" ⊕ #esbook ⊕ *** ⊕ **** ⊕ ***** ⊕ ****** ⊕ 2D ⊕ 3D-scanners ⊕ 19th-century ⊕ 20th-century ⊕ AAP ⊕ AAUP ⊕ abstract-models ⊕ academia ⊕ academic ⊕ access ⊕ access-vs.-preservation ⊕ access2007 ⊕ accessibility ⊕ accuracy ⊕ Acharya ⊕ Acrobat ⊕ adactio ⊕ Adams ⊕ Adobe ⊕ advertising ⊕ advice ⊕ aerospace ⊕ affordances ⊕ afghanistan ⊕ age ⊕ air-traffic-control ⊕ ALA ⊕ ala2008 ⊕ Alberta ⊕ Aldous ⊕ alphabetization ⊕ amazon ⊕ Amazoogle ⊕ amerind ⊕ Amherst ⊕ analog-to-digital ⊕ analogies ⊕ Anarcticia ⊕ anatomy ⊕ Andrew ⊕ annotation ⊕ annotations ⊕ ants ⊕ Anurag ⊕ api ⊕ APIs ⊕ application-profiles ⊕ applications ⊕ archive ⊕ archive.org ⊕ archives ⊕ archiveteam ⊕ archiving ⊕ archivists ⊕ archon ⊕ ARL ⊕ art ⊕ arthistory ⊕ artifacts ⊕ audio ⊕ Australia ⊕ authorities ⊕ authority ⊕ authors ⊕ automation ⊕ bacon ⊕ Bade ⊕ barcodes ⊕ Belanger ⊕ best-practices ⊕ bestpractices ⊕ bibliodyssey ⊕ bibliographers ⊕ bibliographic-control ⊕ bibliographic-data ⊕ bibliographies ⊕ biodiversity ⊕ BISAC ⊕ Bisson ⊕ blacklight ⊕ Blake ⊕ blogging ⊕ blogs ⊕ BMJ ⊕ BNF ⊕ BoingBoing ⊕ book ⊕ book-collecting ⊕ book-digitization-projects ⊕ bookhistory/bookfutures ⊕ books ⊕ books-as-artifacts ⊕ books-toread ⊕ booksearch ⊕ bookstoread ⊕ bookstores ⊕ Booksurge ⊕ BookSurge ⊕ Boston ⊕ botany ⊕ bpl ⊕ Brantley ⊕ Brantly ⊕ brewster ⊕ Brian ⊕ britannica-vs-wikipedia ⊕ BritishLibrary ⊕ broadsides ⊕ browsing ⊕ Bryn ⊕ burining ⊕ Burroughs ⊕ business ⊕ business-models ⊕ c4l09 ⊕ Cadbury ⊕ california ⊕ cameras ⊕ canada ⊕ CAPTCHA ⊕ captchas ⊕ card-catalogs ⊕ card-catalogues ⊕ Casey ⊕ catalog ⊕ cataloging ⊕ catalogs ⊕ ccs ⊕ CDI ⊕ CDL ⊕ CDROM ⊕ centers ⊕ centralization ⊕ chained-libraries ⊕ change ⊕ CHE ⊕ chemistry ⊕ children's ⊕ chocolate ⊕ Christina ⊕ chronicle ⊕ Chudnov ⊕ church-of-reading ⊕ CIC ⊕ CIPAC ⊕ citations ⊕ classification ⊕ Cliff ⊕ Clifford ⊕ climate ⊕ CLIR ⊕ CLOCKSS ⊕ closed-stacks ⊕ CNI ⊕ code4lib ⊕ codicology ⊕ Cohen ⊕ collaboration ⊕ collaboration" ⊕ collation ⊕ collection ⊕ collection-development ⊕ collections ⊕ colleges ⊕ Colorado ⊕ commentary ⊕ commercialization ⊕ commonplace-books ⊕ communication ⊕ communities ⊕ companies ⊕ compression ⊕ concerns ⊕ conference ⊕ conferences ⊕ consortia ⊕ content ⊕ contracts ⊕ controlled-vocabulary ⊕ cool ⊕ cooperation ⊕ cooperative-cataloging ⊕ cooperative-collection-development ⊕ Copernicus ⊕ copy&paste ⊕ copyright ⊕ Cornell ⊕ corpora ⊕ corrections ⊕ COUNTER ⊕ Courant ⊕ coyle ⊕ Crane ⊕ Crawford ⊕ criticism ⊕ cropping ⊕ crosswalking ⊕ crowdsourcing ⊕ ct ⊕ culture ⊕ curation ⊕ curl-up-factor ⊕ cyberinfrastructure ⊕ DailyShow ⊕ Daly ⊕ Dan ⊕ Darnton ⊕ data ⊕ databases ⊕ datamining ⊕ David ⊕ daVinci ⊕ DC ⊕ DCMI ⊕ DDI ⊕ democracy ⊕ Dempsey ⊕ deskewing ⊕ development ⊕ DevonThink ⊕ Dewey ⊕ Diane ⊕ digitahumanities ⊕ digital ⊕ digital-archiving ⊕ digital-collections ⊕ digital-divide ⊕ digital-heritage ⊕ digital-historian ⊕ digital-humanities ⊕ digital-lending ⊕ digital-libraries ⊕ digital-preservation ⊕ digital-primary-sources ⊕ digital-projects ⊕ digitalhistory ⊕ digitalhumanities ⊕ digitalimages ⊕ digitallibraries ⊕ digitalpreservation ⊕ digital_scholarship ⊕ digitization ⊖ digitzation ⊕ disaster-planning ⊕ discovery ⊕ dissemination ⊕ DIY ⊕ DIY-scanner ⊕ DLF ⊕ DLIB ⊕ DMCA ⊕ document-delivery ⊕ documents ⊕ domain ⊕ downloading ⊕ drafts ⊕ DRM ⊕ drucker ⊕ DSpace ⊕ Duke ⊕ DUke ⊕ e-books ⊕ e-learning ⊕ EAD ⊕ ebooks ⊕ economics ⊕ Edmonton ⊕ EDUCAUSE ⊕ electronic ⊕ electronic-publishing ⊕ electronic-resources ⊕ Emory ⊕ enclosure-of-the-commons ⊕ engine ⊕ England ⊕ English ⊕ entertainment ⊕ ephemera ⊕ epub ⊕ ERES ⊕ Eric ⊕ error ⊕ errros ⊕ Espresso ⊕ etymology ⊕ EU ⊕ Europe ⊕ examples ⊕ Excel ⊕ facebook ⊕ facets ⊕ fair-use ⊕ fairuse ⊕ family ⊕ federated-search ⊕ fileformats ⊕ film ⊕ FirstMonday ⊕ Flash ⊕ flash-in-the-pan ⊕ flickr ⊕ Florida ⊕ folger ⊕ fonts ⊕ football ⊕ format ⊕ formats ⊕ frameworks ⊕ France ⊕ FRBR ⊕ free-kittens ⊕ freedom ⊕ French ⊕ full-text ⊕ funded ⊕ funding ⊕ future ⊕ future-of-the-book ⊕ future-of-the-library ⊕ fuutre ⊕ games-with-a-purpose ⊕ genealogy ⊕ geneaology ⊕ geneology ⊕ German ⊕ Germany ⊕ GIS ⊕ gladwell ⊕ global ⊕ goggle ⊕ google ⊕ google-vs-libraries ⊕ googlebooks ⊕ googlebooksearch ⊕ Gorman ⊕ grants ⊕ graphs ⊕ Greek ⊕ green ⊕ Gregory ⊕ Guardian ⊕ Gutenberg ⊕ hardware ⊕ Harvard ⊕ Haverford ⊕ HCI ⊕ health ⊕ hegemony ⊕ Hellman ⊕ Henry ⊕ hieroglyphics ⊕ highwire ⊕ Hillmann ⊕ history ⊕ history-of-digitization ⊕ HMD ⊕ Homer ⊕ hours ⊕ humanities ⊕ humor ⊕ Huxley ⊕ IBM ⊕ ICDL ⊕ ideas ⊕ identifiers ⊕ ideology ⊕ IDPF ⊕ if:book ⊕ IFLA ⊕ Iliad ⊕ ILL ⊕ ILS ⊕ IM ⊕ images ⊕ imaging ⊕ IMLS ⊕ impact-factor ⊕ incunabula ⊕ indexing ⊕ India ⊕ Indiana ⊕ Indians ⊕ information ⊕ information-economy ⊕ information-literacy ⊕ information-quality ⊕ infrastructure ⊕ innovation ⊕ institutional ⊕ institutional-repositories ⊕ intellectual-property ⊕ intellectual-publishing ⊕ interdisciplinary ⊕ interesting ⊕ interface ⊕ interfaces ⊕ internet ⊕ interoperability ⊕ IP ⊕ iPhone ⊕ IR ⊕ ISBN ⊕ Italy ⊕ JAL ⊕ Japan ⊕ jargon ⊕ Jason ⊕ JBIG2 ⊕ jbrinley ⊕ JCDL ⊕ JHU ⊕ JISC ⊕ Johanna ⊕ john ⊕ jonathan ⊕ journalism ⊕ journals ⊕ jpeg2000 ⊕ K12 ⊕ kahle ⊕ karen ⊕ Karen-Coyle ⊕ kindle ⊕ Kirtas ⊕ knowledge ⊕ knowledge-worker ⊕ language ⊕ languages ⊕ last-copies ⊕ Lavoie ⊕ law ⊕ LC ⊕ LCSH ⊕ leadership ⊕ LEEP ⊕ legal ⊕ Leggott ⊕ letters ⊕ Levy ⊕ libraires ⊕ librarian ⊕ librarians ⊕ librarianship ⊕ libraries ⊕ libraries-and-TV ⊕ libraries-vs-Google ⊕ library ⊕ library-vendor-history ⊕ library-vs-Google ⊕ licenses ⊕ licensing ⊕ lifehacker ⊕ lifelogging ⊕ light ⊕ Linnaeus ⊕ lis ⊕ listservs ⊕ LITA ⊕ livebooks ⊕ Liza ⊕ LJ ⊕ LMS ⊕ LoC ⊕ lock-in ⊕ LOCKSS ⊕ logistics ⊕ Lorcan ⊕ lotf ⊕ Luddites ⊕ luggage ⊕ Lynch ⊕ machine-learning ⊕ malcolm ⊕ management ⊕ Mann ⊕ manuscripts ⊕ maps ⊕ Marc ⊕ Mark ⊕ marketing ⊕ Markey ⊕ Markov ⊕ maryland ⊕ mass ⊕ mass-communication ⊕ mass-digitization ⊕ massdigitization ⊕ materiality ⊕ Mawr ⊕ Max-Planck ⊕ mbooks ⊕ medals ⊕ medicine ⊕ medieval ⊕ Mellon ⊕ memory ⊕ meta-read ⊕ metadata ⊕ metadata-consistency ⊕ metadata-quality ⊕ metaphors ⊕ METS ⊕ MHC ⊕ Michael ⊕ michigan ⊕ microfilm ⊕ microform ⊕ microsoft ⊕ Midwinter ⊕ migration ⊕ mindcontrol ⊕ misinformation ⊕ MODS ⊕ money ⊕ monopolies ⊕ monopoly ⊕ Montana ⊕ multi-spectral-imaging ⊕ multimedia ⊕ Murray ⊕ museums ⊕ music ⊕ mylifebits ⊕ names ⊕ NARA ⊕ NASA ⊕ NASIG ⊕ NationalAirAndSpace ⊕ Native-American ⊕ NDA ⊕ NDIPP ⊕ NEDCC ⊕ negotiation ⊕ NEH ⊕ NEJM ⊕ network ⊕ newmedia ⊕ newspaper ⊕ newspapers ⊕ NewYorker ⊕ NewZealand ⊕ next-generation ⊕ NFL ⊕ ngc4lib ⊕ Ngram ⊕ NIH ⊕ NISO ⊕ NIST ⊕ NLM ⊕ Nobel ⊕ NPR ⊕ ny ⊕ nypl ⊕ nyreviewofbooks ⊕ nytimes ⊕ OAI-ORE ⊕ OAI-PMH ⊕ OAI_ORE ⊕ obsolescence ⊕ oca ⊕ OCLC ⊕ OCLR ⊕ OCR ⊕ OCRopus ⊕ odd ⊕ off-site-storage ⊕ offices ⊕ Ohio ⊕ OhioLINK ⊕ olifactory ⊕ online-learning ⊕ online-reference ⊕ OOD ⊕ OP ⊕ OPAC ⊕ openaccess ⊕ openbookalliance ⊕ openlibrary ⊕ openness ⊕ opensource ⊕ optics ⊕ oral ⊕ oral-history ⊕ oralhistory ⊕ ORE ⊕ oreilly ⊕ orphan-works ⊕ orphaned-works ⊕ orphanworks ⊕ OSTI ⊕ outsourcing ⊕ ownership ⊕ Oxford ⊕ Pace ⊕ pages ⊕ paper ⊕ papyrus ⊕ Paul ⊕ pay-per-view ⊕ PDF ⊕ PDFs ⊕ PennTags ⊕ people ⊕ perfect-copies ⊕ permission ⊕ permissions ⊕ personal ⊕ Peter ⊕ Peter-Brantley ⊕ Philadelphia ⊕ photos ⊕ Photoshop ⊕ picasa ⊕ plaster ⊕ POD ⊕ PoD ⊕ podcasts ⊕ poe ⊕ policy ⊕ popculture ⊕ population ⊕ Portico ⊕ postage ⊕ postcards ⊕ posters ⊕ power ⊕ Prelinger ⊕ present ⊕ presentations ⊕ preservation ⊕ press ⊕ primary-sources ⊕ princeton ⊕ print ⊕ printing ⊕ privacy ⊕ problems ⊕ project-management ⊕ propaganda ⊕ ProQuest ⊕ public ⊕ public-domain ⊕ public-libraries ⊕ publicdomain ⊕ publicly ⊕ publicpolicy ⊕ publishers ⊕ publishing ⊕ Purdue ⊕ Pynchon ⊕ python ⊕ quality ⊕ quality-control ⊕ quantity ⊕ quantity-not-quality ⊕ quotes ⊕ rare ⊕ rare-book-school ⊕ rare-books ⊕ rations ⊕ RDF ⊕ reading ⊕ recaptcha ⊕ recommendations ⊕ recommended ⊕ record-management ⊕ records ⊕ reference ⊕ reference-librarians ⊕ references ⊕ reflection ⊕ reformatting ⊕ regional ⊕ relevance ⊕ remote-storage ⊕ reports ⊕ repositories ⊕ reproducibility ⊕ reproduction ⊕ reputation ⊕ research ⊕ resource ⊕ responsibility ⊕ REST ⊕ reuse ⊕ reviews ⊕ rights ⊕ rightsmanagement ⊕ risks-of-perfect-copies ⊕ RLG ⊕ Rob ⊕ Robert ⊕ rochkind ⊕ Ross ⊕ Roy ⊕ RSS ⊕ SAA ⊕ SAA2008 ⊕ SALT ⊕ san-serif ⊕ scanner ⊕ scanners ⊕ scanning ⊕ Schneider ⊕ scholarly-communication ⊕ scholarship ⊕ schoogle ⊕ science ⊕ scientific-communication ⊕ scores ⊕ scott ⊕ scrapbooks ⊕ scratch-n-sniff ⊕ screens ⊕ scribe ⊕ scribes ⊕ search ⊕ Searcher ⊕ searching ⊕ Seattle ⊕ secondlife ⊕ secrecy ⊕ self-archiving ⊕ semantic ⊕ sentiments ⊕ serials ⊕ series-authority ⊕ serif ⊕ serious-games ⊕ services ⊕ settlement ⊕ sf ⊕ shakespeare ⊕ sharing ⊕ shelves ⊕ silos ⊕ Singer ⊕ situational-awareness ⊕ Siva ⊕ small-press ⊕ smell ⊕ Smithsonian ⊕ snarky ⊕ SOA ⊕ sociology-of-knowledge ⊕ software ⊕ solr ⊕ sound ⊕ spaulding ⊕ special-collections ⊕ specialcollections ⊕ spoken ⊕ squeeze-generation ⊕ standards ⊕ stanford ⊕ statistics ⊕ Steinbeck ⊕ stereotypes ⊕ Steven ⊕ stickers ⊕ storage ⊕ stories ⊕ strategy ⊕ structured ⊕ Styles ⊕ subscription-required ⊕ summarization ⊕ summary ⊕ surrogates ⊕ sustainability ⊕ Swarthmore ⊕ Switzerland ⊕ synchronised-data ⊕ tables ⊕ tablets ⊕ tagging ⊕ technology ⊕ TEI ⊕ Tennant ⊕ Terry ⊕ text-mining ⊕ textMD ⊕ texts ⊕ Thomas ⊕ thoughts ⊕ tim ⊕ timelines ⊕ TOC ⊕ todo ⊕ tolisten ⊕ tolook ⊕ tools ⊕ tools-of-change ⊕ toread ⊕ toreread ⊕ torered ⊕ Tornoto ⊕ Toronto ⊕ ToS ⊕ towatch ⊕ track-changes ⊕ training ⊕ transparency ⊕ treeware ⊕ trends ⊕ trust ⊕ tutorial ⊕ typesetting ⊕ typos ⊕ U.S. ⊕ UCSD ⊕ udell ⊕ UIUC ⊕ UK ⊕ unbound ⊕ UNESCO ⊕ union-catalogs ⊕ unique ⊕ unique-copies ⊕ university ⊕ University-presses ⊕ uplift ⊕ usage ⊕ user-focus ⊕ Utah ⊕ vendors ⊕ video ⊕ videogames ⊕ Vinge ⊕ Virginia ⊕ virtual ⊕ visual ⊕ Wales ⊕ Walt ⊕ WaltDisney ⊕ war ⊕ WashingtonPost ⊕ wax-cylinders ⊕ weather ⊕ web ⊕ web-as-living-organism ⊕ web-services ⊕ web2.0 ⊕ webservices ⊕ weeding ⊕ wikipedia ⊕ Wired ⊕ wireless ⊕ wishlist ⊕ word ⊕ words ⊕ work ⊕ workflow ⊕ Writers ⊕ writing ⊕ WSJ ⊕ WWI ⊕ WWII ⊕ Xerox ⊕ XML ⊕ yearbooks ⊕ youtube ⊕ Zigtag_Imported_Bookmarks ⊕ zotero ⊕Copy this bookmark: