Save the Libraries – With Open Source

For some in the world of free software, libraries are things that you call, rather than visit. But the places where books are stored – especially those that make them freely available to the public – are important repositories of the world's knowledge, of relevance to all. So coders too should care about them alongside the other kind, and should be concerned that there is a threat to their ability to provide ready access to knowledge they have created themselves. The good news is that open source can save them.

The story begins even before RMS had his idea about the benefits of hackers sharing code, back in 1967, when the Online Computer Library Center (OCLC) was created:

OCLC was founded in 1967 by Fred Kilgour, a pioneering Ohio librarian, with a simple idea: Instead of having every library in the country separately catalog a book -- laboriously entering its title, author, and subjects in just the right format -- why not have one person enter the cataloging information, upload it to a central computer, and then let everyone else download a copy from there?

The kinship with free software and many collaborative content projects like Wikipedia is evident. But the OCLC's WorldCat has not followed the same development path as Stallman's GNU project:

Today [WorldCat] has around 50 million book records. But OCLC, the group that owns and operates it, has been a different story. It started small -- a little office in Ohio, a set of membership dues to share the cost of running the servers. But OCLC's control passed from librarians and academics to business people (its senior executive comes from consulting firm Deloitte & Touche). They realized they had a monopoly on their hands and as costs for running servers have gone down, their prices have gone up. They charge you once to get your records added to WorldCat and charge you again to get them back out and charge you a third time for a whole series of additional fees and services.

And these prices are high. A friend who runs a small public library with around 5000 cardholders was asked to pay $5400 to contribute his records and $700 to get records out, plus a whole series of "User Support" and "New Member Implementation" fees -- all far more than he could afford.

Clearly, the original kinship with GNU has long gone: now, users are expected to cough up not just to use WorldCat's records, but even to contribute. In other words, WorldCat has moved to the Microsoft model, where you have to pay for the program, and also for support in order to file bug reports to improve the program.

As the rest of the post quoted above explains, the situation looks like it is going to get even worse. In particular, it seems that it is going to get harder to export bibliographic data without constraints to other catalogues, including free alternatives such as Open Library.

The open source community has been here before, when the communally-created CDDB database was bought and the terms governing its use were modified, essentially making it hard to export data to other, free alternatives – just as is happening in the world of libraries today:

Things changed dramatically when the open CDDB.com server was bought by a company that wanted to make money from the contributions that users had made. The index file created by the Internet community could no longer be copied. Patents were obtained and granted. A large public outcry resulted, and led to the start of several projects to create an Open Source competitor for the commercial CDDB.com (now Gracenote).

In other words, the open source community simply routed around the damage by not submitting data to the closed CDDB, and by supporting instead free alternatives like Freedb and MusicBrainz, both of which are thriving.

Against this background, an obvious solution to the libraries' dependence on OCLC's central repository of bibliographic information is to start up an open equivalent along the lines of Freedb and MusicBrainz: basic hosting costs are now small, and altruists willing to run mirrors would doubtless start popping up given that the benefit to the community as a whole is so great. Users will rapidly switch their contributions to a truly free database that all can access, and this will soon surpass any legacy system, which will, in any case, find itself cut off from the community it has hitherto depended on (and taken for granted).

Unfettered access won't be the only way that users benefit. As has become evident in recent years, releasing large quantities of primary data for free allows all kinds of innovative secondary uses to be developed, some of which can be exploited commercially – but without needing to close off the primary materials, or charge for them. As a rich and diverse ecosystem arises, users will gain new tools and capabilities that simply were not possible with a locked-down database, and companies will gain a host of new business opportunities.

What's striking about the current discussions swirling around the OCLC saga is that they are being conducted in something of a vacuum, despite the fact that open source has a rich store of relevant experience that librarians could usefully refer to. The only problem is that little of it is to be found in books.

Glyn Moody writes about openness at opendotdotdot.

Comments

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Google Books

Anonymous's picture

You can use Google Books for free to access Worldcat without having to pay the fee...

How?

Glyn Moody's picture

Do you mean it actually connects you to Worldcat, or that it provides the same kind of service?

When you do a search in

Anonymous's picture

When you do a search in Google Books, and click on a searcch result MOST of the books will have a link on the right side of the screen under the buying links that says "Borrow this Book" and underneath that "Find this book in a library" which links to/searches WorldCat records for free.

Forgot

Anonymous's picture

Forgot to mention, still doesn't solve the problem of the creation of records, but as far as retrieving them, it can be done for free.

Thanks...

Glyn Moody's picture

...for the clarification. But I think this is really about ensuring the metadata is freely available for anyone to use in any way, rather than being able to find and access stuff.

thanks

izzy's picture

I am currently halfway through my library media science degree and really enjoyed this article. I'm looking into Biblios right now, I had never heard about it.

Library and Archives Canada Offers Free and Open Access

Barbara I. Irwin's picture

What a sad state of affairs with OCLC. When I was in the public library business I remember OCLC, WLN (Washington Library Network), and UTLAS (University of Toronto Library Automation System) -- they truly subscribed to the free and open model of sharing cataloguing records.

There is a bright spot, however. Library and Archives Canada through their Amicus Database, allows free access to their database of 30 million+ records from 1,300 Canadian libraries including the National Library of Canada and Archives Canada. Using the Z39.50, an information retrieval standard, libraries can download (or upload) bibliographic records without charge. Free registration is required.

See: http://www.lac-bac.gc.ca/amicus/ for more information.

Good news

Glyn Moody's picture

Thanks for the linnk.

Great

Glyn Moody's picture

So why doesn't everyone stop moaning about the OCLC's move and just move to Biblios?

Because it's far easier to

Chris Cormack's picture

Because it's far easier to moan about something, than do something about it?

Beta?

goblin's picture

... and because it's still Beta?

True...

Glyn Moody's picture

...but so is Gmail.

Also complaining

goblin's picture

People also complain about Gmail - so there :-)

ah yes...

Glyn Moody's picture

...silly me.

Its on the way already

Chris Cormack's picture

Check out http://biblios.net/ (the beta is currently running)

White Paper
Linux Management with Red Hat Satellite: Measuring Business Impact and ROI

Linux has become a key foundation for supporting today's rapidly growing IT environments. Linux is being used to deploy business applications and databases, trading on its reputation as a low-cost operating environment. For many IT organizations, Linux is a mainstay for deploying Web servers and has evolved from handling basic file, print, and utility workloads to running mission-critical applications and databases, physically, virtually, and in the cloud. As Linux grows in importance in terms of value to the business, managing Linux environments to high standards of service quality — availability, security, and performance — becomes an essential requirement for business success.

Learn More

Sponsored by Red Hat

White Paper
Private PaaS for the Agile Enterprise

If you already use virtualized infrastructure, you are well on your way to leveraging the power of the cloud. Virtualization offers the promise of limitless resources, but how do you manage that scalability when your DevOps team doesn’t scale? In today’s hypercompetitive markets, fast results can make a difference between leading the pack vs. obsolescence. Organizations need more benefits from cloud computing than just raw resources. They need agility, flexibility, convenience, ROI, and control.

Stackato private Platform-as-a-Service technology from ActiveState extends your private cloud infrastructure by creating a private PaaS to provide on-demand availability, flexibility, control, and ultimately, faster time-to-market for your enterprise.

Learn More

Sponsored by ActiveState