Save the Libraries – With Open Source
November 27th, 2008 by Glyn Moody
For some in the world of free software, libraries are things that you call, rather than visit. But the places where books are stored – especially those that make them freely available to the public – are important repositories of the world's knowledge, of relevance to all. So coders too should care about them alongside the other kind, and should be concerned that there is a threat to their ability to provide ready access to knowledge they have created themselves. The good news is that open source can save them.
The story begins even before RMS had his idea about the benefits of hackers sharing code, back in 1967, when the Online Computer Library Center (OCLC) was created:
OCLC was founded in 1967 by Fred Kilgour, a pioneering Ohio librarian, with a simple idea: Instead of having every library in the country separately catalog a book -- laboriously entering its title, author, and subjects in just the right format -- why not have one person enter the cataloging information, upload it to a central computer, and then let everyone else download a copy from there?
The kinship with free software and many collaborative content projects like Wikipedia is evident. But the OCLC's WorldCat has not followed the same development path as Stallman's GNU project:
Today [WorldCat] has around 50 million book records. But OCLC, the group that owns and operates it, has been a different story. It started small -- a little office in Ohio, a set of membership dues to share the cost of running the servers. But OCLC's control passed from librarians and academics to business people (its senior executive comes from consulting firm Deloitte & Touche). They realized they had a monopoly on their hands and as costs for running servers have gone down, their prices have gone up. They charge you once to get your records added to WorldCat and charge you again to get them back out and charge you a third time for a whole series of additional fees and services.
And these prices are high. A friend who runs a small public library with around 5000 cardholders was asked to pay $5400 to contribute his records and $700 to get records out, plus a whole series of "User Support" and "New Member Implementation" fees -- all far more than he could afford.
Clearly, the original kinship with GNU has long gone: now, users are expected to cough up not just to use WorldCat's records, but even to contribute. In other words, WorldCat has moved to the Microsoft model, where you have to pay for the program, and also for support in order to file bug reports to improve the program.
As the rest of the post quoted above explains, the situation looks like it is going to get even worse. In particular, it seems that it is going to get harder to export bibliographic data without constraints to other catalogues, including free alternatives such as Open Library.
The open source community has been here before, when the communally-created CDDB database was bought and the terms governing its use were modified, essentially making it hard to export data to other, free alternatives – just as is happening in the world of libraries today:
Things changed dramatically when the open CDDB.com server was bought by a company that wanted to make money from the contributions that users had made. The index file created by the Internet community could no longer be copied. Patents were obtained and granted. A large public outcry resulted, and led to the start of several projects to create an Open Source competitor for the commercial CDDB.com (now Gracenote).
In other words, the open source community simply routed around the damage by not submitting data to the closed CDDB, and by supporting instead free alternatives like Freedb and MusicBrainz, both of which are thriving.
Against this background, an obvious solution to the libraries' dependence on OCLC's central repository of bibliographic information is to start up an open equivalent along the lines of Freedb and MusicBrainz: basic hosting costs are now small, and altruists willing to run mirrors would doubtless start popping up given that the benefit to the community as a whole is so great. Users will rapidly switch their contributions to a truly free database that all can access, and this will soon surpass any legacy system, which will, in any case, find itself cut off from the community it has hitherto depended on (and taken for granted).
Unfettered access won't be the only way that users benefit. As has become evident in recent years, releasing large quantities of primary data for free allows all kinds of innovative secondary uses to be developed, some of which can be exploited commercially – but without needing to close off the primary materials, or charge for them. As a rich and diverse ecosystem arises, users will gain new tools and capabilities that simply were not possible with a locked-down database, and companies will gain a host of new business opportunities.
What's striking about the current discussions swirling around the OCLC saga is that they are being conducted in something of a vacuum, despite the fact that open source has a rich store of relevant experience that librarians could usefully refer to. The only problem is that little of it is to be found in books.
Glyn Moody writes about openness at opendotdotdot.
__________________________
Special Magazine Offer -- Free Gift with Subscription
Receive a free digital copy of Linux Journal's System Administration Special Edition as well as instant online access to current and past issues. CLICK HERE for offer
Linux Journal: delivering readers the advice and inspiration they need to get the most out of their Linux systems since 1994.
Subscribe now!
The Latest
Newsletter
Tech Tip Videos
- Nov-04-09
- Oct-29-09
- Oct-26-09
Recently Popular
From the Magazine
December 2009, #188
If last month's Infrastrucuture issue was too "big" for you then try on this month's Embedded issue. Find out how to use Player for programming mobile robots, build a humidity controller for your root cellar, find out how to reduce the boot time of your embedded system, and if you're new to embedded systems find out the basics that go into one. You can also read about the Beagle Board, the Mesh Potato and a spate of other interestingly named items. And along with our regular columns don't miss our new monthly column: Economy Size Geek.
Delicious
Digg
StumbleUpon
Reddit
Facebook








Google Books
On December 2nd, 2008 Anonymous (not verified) says:
You can use Google Books for free to access Worldcat without having to pay the fee...
How?
On December 2nd, 2008 Glyn Moody says:
Do you mean it actually connects you to Worldcat, or that it provides the same kind of service?
When you do a search in
On December 4th, 2008 Anonymous (not verified) says:
When you do a search in Google Books, and click on a searcch result MOST of the books will have a link on the right side of the screen under the buying links that says "Borrow this Book" and underneath that "Find this book in a library" which links to/searches WorldCat records for free.
Forgot
On December 4th, 2008 Anonymous (not verified) says:
Forgot to mention, still doesn't solve the problem of the creation of records, but as far as retrieving them, it can be done for free.
Thanks...
On December 4th, 2008 Glyn Moody says:
...for the clarification. But I think this is really about ensuring the metadata is freely available for anyone to use in any way, rather than being able to find and access stuff.
thanks
On November 30th, 2008 izzy (not verified) says:
I am currently halfway through my library media science degree and really enjoyed this article. I'm looking into Biblios right now, I had never heard about it.
Library and Archives Canada Offers Free and Open Access
On November 28th, 2008 Barbara I. Irwin (not verified) says:
What a sad state of affairs with OCLC. When I was in the public library business I remember OCLC, WLN (Washington Library Network), and UTLAS (University of Toronto Library Automation System) -- they truly subscribed to the free and open model of sharing cataloguing records.
There is a bright spot, however. Library and Archives Canada through their Amicus Database, allows free access to their database of 30 million+ records from 1,300 Canadian libraries including the National Library of Canada and Archives Canada. Using the Z39.50, an information retrieval standard, libraries can download (or upload) bibliographic records without charge. Free registration is required.
See: http://www.lac-bac.gc.ca/amicus/ for more information.
Good news
On November 29th, 2008 Glyn Moody says:
Thanks for the linnk.
Great
On November 27th, 2008 Glyn Moody says:
So why doesn't everyone stop moaning about the OCLC's move and just move to Biblios?
Because it's far easier to
On November 27th, 2008 Chris Cormack (not verified) says:
Because it's far easier to moan about something, than do something about it?
Beta?
On November 28th, 2008 goblin says:
... and because it's still Beta?
True...
On November 28th, 2008 Glyn Moody says:
...but so is Gmail.
Also complaining
On December 28th, 2008 goblin says:
People also complain about Gmail - so there :-)
ah yes...
On November 27th, 2008 Glyn Moody says:
...silly me.
Its on the way already
On November 27th, 2008 Chris Cormack (not verified) says:
Check out http://biblios.net/ (the beta is currently running)
Post new comment