musings related to metadata, cataloging, and the “great big” world of librarianship (plus some other stuff…)
Wednesday, June 27, 2007

New! Journal of Library Metadata

The information below comes directly from the Haworth Press website. Be on the lookout for the first call for papers (coming soon)!


Journal of Library Metadata

The journal with the unique focus on metadata applications in libraries

Editor

Jeffrey Beall, MSLS, MA, BA
Metadata Librarian/Assistant Professor
Auraria Library
University of Colorado at Denver and Health Sciences Center

About the Journal

Get the latest research, innovations, news, and expert views about metadata applications and the role of metadata in information retrieval

Libraries are a unique realm for metadata users, with distinct needs and design specifications for metadata applications. The Journal of Library Metadata is the exclusive forum for the latest research, innovations, news, and expert views about all aspects of metadata applications in libraries and about the role of metadata in information retrieval. This focused journal comprehensively discusses practical, applicable information that libraries can effectively use in their own information discovery environments. Specialized knowledge, the latest technology, and top research are presented pertaining specifically to evolving metadata use in libraries.

The Journal of Library Metadata provides three basic categories of articles. The journal will provide top-quality traditional, peer-reviewed articles of importance to metadata users. The second category includes shorter, scholarly, non-peer reviewed articles providing descriptions of innovative new applications or developments, brief research on a narrow topic, and/or descriptions of problems and solutions pertaining to specific or general metadata applications. The third category are viewpoint articles providing upbeat, critical analyses on topics related to library metadata applications which point out the strengths and weaknesses of particular metadata applications, content standards, etc., designating a preferred direction for improved applications of metadata technology.

Topics for articles covered by the Journal of Library Metadata include:

  • application profiles
  • best practices
  • controlled vocabularies
  • crosswalking of metadata and interoperability
  • digital libraries and metadata
  • display of search results
  • federated repositories
  • federated searching
  • folksonomies
  • individual metadata schemes
  • institutional repository metadata
  • metadata content standards
  • metadata harvesting
  • ontologies
  • preservation metadata
  • resource description framework
  • resource discovery and metadata
  • search engines and metadata
  • SKOS
  • stochastic vs. deterministic searching
  • tagging and tag clouds
  • topic maps
  • visual image and moving image metadata

The Journal of Library Metadata is essential reading for metadata librarians, catalog librarians, librarians working on institutional repositories or digital libraries, information scientists, researchers in library science and organization of information, researchers in and designers of information retrieval systems, preservation librarians who work with preservation metadata, and researchers in and developers of information standards.

Product Details:
SKU: J517v8

Current Volume:
Volume 8, No. 1—Spring—2008.
Volume 9, No. 1—Spring—2009. Quarterly (4 issues per year)

Wednesday, June 27, 2007

I am a hyphen

I scored 38% Sociability and 41% Sophistication!


I am comfortable around others. While I don’t have to go out every night, I take pride in being easy to get along with. This should not, however, be misconstrued as believing (as many do) that I am without subtlety. In fact, I have the power to inform the anal retentive that, indeed, they are discussing an anal-retentive issue. Who else can do that? Quotation marks intimidate me a little bit.*


Link: The Which Punctuation Mark Are You Test written by Gazda on OkCupid, home of the The Dating Persona Test

*I’m not sure why, though…

Monday, June 25, 2007

ALA Annual 2007: ALCTS President’s Program

Ambient Findability – Librarians, Libraries and the Internet of Things
Information that’s hard to find will remain information that’s hardly found.

Presented by Peter Morville, president of Semantic Studios, author of Ambient Findability, and adjunct faculty at University of Michigan.

The ballroom at the Renaissance Hotel was PACKED, even though Morville’s presentation was competing with Julie Andrews’s book-signing.

Morville’s presentation dealt with the role of findability in defining the user experience. “Ambient findability” is a phrased he coined so that he’d have a place to “stuff anything into.” Findability is his “personal passion.”

On the definition of “information architecture”:

  • in the “polar bear” book (1st ed.) on information architecture there was no concise definition
  • in the 2nd-4th eds. there were four definitions!

It is important to provide multiple paths to the same information. See Stanford University’s website. Many times, global navigation schemes are challenging – there’s too much pressure on one or two words. Web designers should bundle up subcategories, like ConsumerReports.org has done. Users need visual clues.

Information architecture is an element of the user experience. While many web developers view it as a software development challenge, others see it as developing online collections of information, such as online encyclopedias and digital libraries.

The Elements of the User Experience (book by Jesse James Garrett)

Morville’s “User Experience Honeycomb” illustrates the facets of the user experience.

user experience honeycomb

Morville contends that even though the term “usability” has grown synonymous with valuable, usability really encompasses much more.

Useful

  • we need to ask ourselves “is it useful?” and “could it be more useful?”
  • we need to keep up with human-computer interaction research in terms of usability

Desirable

  • attractive things work better; image, identity, brand, and marketing are all important

Findable

  • we need to know if 1) our users can find our websites, 2) our users and find their way around our websites, and 3) they can find products and information despite our websites

Accessible

  • can people with disabilities access our information? What about people with mobile devices?

Credible

  • what are the elements of design that influence whether people believe what’s on our websites?

All of these qualities relate to and affect each other. We need to keep them in mind along with OUR business goals and missions.

Morville says that users trust the things that come at the top of Google search results and that credibility and findability are becoming increasingly connected. He gave an example from his work with the National Cancer Institute’s cancer website. He pointed out that while the single most common search term used for finding the NCI site is “cancer,” there are also searches on specific cancer types, and these constitute more than 70% of searches. Absearch on the term “melanoma” doesn’t make doesn’t make NCI’s site come up in the top 10 sites in Google. This is an example of designers needing to engage in search engine optimization, etc.

“Good things happen when you focus on findability.”

There are still huge barriers to access, so Morville asks “how do we break down the walls?” He gives the example of Thomson Gale’s AccessMyLibrary. This site aims to “break down walls” by exposing more information from articles to the open web so that search engines can find them. The service then provides users with ways to find the articles in a local library. BUT, Morville showed that articles can’t be found via Google unless the search query includes “access my library.” We need to be awary of Google’s supplementary index, which he calls “Google Hell.” Thomson Gale has a good strategy, but there are still problems.

Another example Morville showed is University of Michigan’s MLibrary 2.0. They are applying web 2.0 concepts in order to come up with new and exciting ways to serve customers (see Jessamyn West’s presentation slides and notes from June 8, 2007 here: http://www.librarian.net/talks/mlibrary/.

So, what about the future? Morville suggests that we have one foot in the past and one in the future. We need to be designing for the future.

Where are things going LONGER TERM?

Findability

  • object (physical/digital) – what are all the different ways that somebody could find this object? how can we make it more findable?
  • wayfinding, navigation and retrieval

Ambient

  • refers to surroundings

“the ability to find anyone or anything from anywhere at anytime”

BUT perfect findability is unattainable. Not everyone lives in information age; lots of folks are starving for information. Morville showed us John Wood’s website about his book, Leaving Microsoft to Change the World.

“World change starts with educated children.”

According to Morville, we live in the world of information anxiety - there is an absurd amount of information.

“A wealth of information creates a poverty of attention” — Herbert Simon.

“Information is gushing toward your brain like a firehose aimed at a teacup” — Dogbert.

In an age when we can pick and choose our news, etc., what does this mean? Morville says it means that “things are only going to get weirder.” As an example of this, he first showed us Ambient Orb, “a [$150.00] glass lamp that uses color to show weather forecasts, trends in the market, or the traffic on your homeward commute.” According to the ThinkGeek website,

The Ambient Orb is a device that slowly transitions between thousands of colors to show changes in the weather, the health of your stock portfolio, or if your boss or friend is on instant messenger. It is a simple wireless object that unobtrusively presents information. Imagine if you had to go to your computer and type in your zip code whenever you wanted to check what time it was. Your important information should be as accessible as looking at a clock, now the Ambient Orb can make a variety of information just a glance away.

The Orb arrives set to indicate the Dow - glowing more green to indicate market movement up and red to indicate movement down, or yellow when the market is calm. If the market is up or down more than 1.5% the Orb will pulsate. It can be customized to a set of free channels, such as market indices (Dow, Nasdaq, S&P 500) or weather in select cities. Optionally, you can upgrade to access more premium channels, such as your customized portfolio, local weather, pollen count, or IM buddy watch. There’s also a developer interface where any semi-savvy web programmer can control the color of their Orb with a simple http “get” call. Track how full your hard drive is, traffic on your website, Slashdot posts, or your credit-card debt.

The Ambient Orb is simply plugged into any standard 110V power outlet and it is up and running on a nationwide wireless network - no internet connection required. The Orb does not attach to a PC. The channel for the Orb can be selected via a web interface and will update in a short period of time. Depending on which channel the Orb is monitoring, it will receive updates every few minutes, or perhaps once per hour for some channels.

This is just one example of the alternative interfaces to digital networked information that we are beginning to see. Another example is the Microsoft Surface, a giant touchscreen computer that “turns an ordinary tabletop into a vibrant, interactive surface.

We are also importing vast amounts of information about the physical outside world into our digital environments. According to Morville, we are right on the verge of a convergence of mobile devices and location awareness (i.e., our devices will know where we are and will be able to share that information with others). For example, Amazon.com sells a wristwatch with a built in GPS that you can lock onto your child’s wrist so that you could check on his/her location. It also has a breadcrumbing feature which allows you to see where your child has been. Interestingly enough, the customer reviews of this product on Amazon.com mentioned nothing about privacy (or lack thereof); to the contrary, comments included complaints that the watch doesn’t work well enough!

Indeed, the technology is just about here, but we need to determine just how we want to use it. Where is it useful?

Morville told us about Tim Hibbard’s website where you can know where he is at any given moment. This kind of service will become more popular when we can control who can access the information.

And look at the iPhone. This is truly having the web in our pockets; rich, full featured. Morville speculates that soon Tim will be able to wander around with his iPhone and watch himself move!!!

The Internet of Things

Morville thinks it will be about 30 years until this is fully propagated. Imagine having the ability to lie in bed and do a Google search in order to find your socks.

Cisco has a wireless location appliance that gives you the ability to use RFID to tag and track highly valuable objects. Hospitals apparently misplace wheelchairs all the time, so they use this technology to save staff time in searching for wheelchairs. Who knew? So, what are some potential ways we can use this technology to save time and money in terms of tracking objects? Perhaps we can use it for wayfinding.

Morville mentioned Jennifer Tomblin and Amal Graafstra, the couple who had RFID chips implanted in their hands so they can access each other’s computers and front doors (works like a key card). They consider this the ultimate expression of love.

Google Street View

Google is taking pictures and posting them so that anyone around the world can see them. This is interesting, true, but it poses some serious privacy questions. Imagine being caught on camera in some compromising situation…

David Brin, in his book The Transparent Society, asks an interesting question: “do we want to be able to watch the watchers?”

More questions from Morville

  • how do we create bigger needles for the bigger haystacks?
  • how do we describe the uniqueness of our information objects?

Microsoft Bob

Well, it won’t be with something like Microsoft Bob and it won’t be with information visualization either.

Rummy

“There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don’t know. But there are also unknown unknowns. There are things we don’t know we don’t know.”

So who is really going to help us? In his article “Revenge of the Librarians,” Morville argued that the Internet would turn anyone into a librarian.

Now people can’t shut up about metadata. The stuff from catalog cards is now the subject of intense discussions. Metadata has become sexy:

  • free tagging of flickr and del.icio.us
  • religious/political zeal for folksonomies – let the users do it!
  • it’s a free world!

David Weinberger, author of Everything is Miscellaneous, says, “The old way creates a tree. The new rakes leaves together.” In other words, tags self organize into clusters.

BUT…

This morning, Peter Morville said, “Clusters of leaves eventually rot and return to the ground and become food for trees which live a very long time and come in many sizes shapes and colors.” He wrote a response to Weinberger’s book called (Not) Everything is Miscellaneous.

Morville asserts that we need to find ways to bring the old and the new together. We need to find ways to take the fast moving world and create a feedback loop to the lower layers. There are a lot of websites that are experimenting with this now, such as Etsy.com, “peer-to-peer commerce with tagging. (There is a good overview of the site by Michael Arrington on TechCrunch.)

But Morville doesn’t think we shouldn’t be throwing tags everywhere – look at Amazon.com. Tags don’t work well on Amazon.com, even though the similar setup (user tagging) of LibraryThing does. LibraryThing’s founder, Tim Spalding, wrote an article on this topic called “When Tags Work and When They Don’t: Amazon and LibraryThing.”

Morville’s opinion of Web 2.0:

  • there is a lot of traditional information architecture that goes into designing web 2.0, but
  • there is a whole lot of info architecture that goes on OUTSIDE web 2.0

On the future of findability

  • we will still be entering keywords 10 years from now
  • the search interface is a primary interface for commerce and government

Is this the end of browsing, then? “No,” says Morville. Even at the end of a Google search, we find a page that needs to be browsed.

One size will not fit all

    we need to look at search as a system
  • we need to understand who are users are and what they are looking for and how they search
  • we need to look at our interfaces – get rid of rot, etc.

The Long Tail

We need to continue exploring distributed algorithmic approaches but we shouldn’t ignore the what Morville calls the “Fat Head.” Search is a true learning process – search is the most important way that we learn - and we should be careful not to optimize for simplicity at the expense of associative learning.

Endeca is a good example – faceted classification:

  • works the way users work
  • the results provide many possible next steps for refining query

Some examples of Endeca implementations include

  • NCSU’s new version of its library catalog
  • UC Berkeley’s FLAMENCO interface:
  • The Flamenco search interface framework has the primary design goal of allowing users to move through large information spaces in a flexible manner without feeling lost. A key property of the interface is the explicit exposure of category metadata, to guide the user toward possible choices, and to organize the results of keyword searches. The interface uses hierarchical faceted metadata in a manner that allows users to both refine and expand the current query, while maintaining a consistent representation of the collection’s structure. This use of metadata is integrated with free-text search, allowing the user to follow links, then add search terms, then follow more links, without interrupting the interaction flow.

    FLAMENCO stands for FLexible information Access using MEtadata in Novel COmbinations.

BUT, this sort of thing is hard to replicate on the public web. Clusty, the clustering search engine, is an attempt, but it doesn’t work very well; it can be more confusing than useful.

Possible solutions?

  • Google’s new universal search with links across top of screen - but the interface is problematic (need to get rid of ads!)
  • Flickr’s automatic clustering based on tags - they let the tags speak for themselves
  • Google Book Search - expansion of the searchable web
  • Everyzing.com - uses automatic speech-to-text software; provides searchable transcripts and then links to audio and video clips

The intersection between people and content

There are areas like “social search” that are being researched:

  • Google’s page rank algorithm is a good example. Ever notice how Google favors blogs in its search results?

  • Wikipedia – an interesting mashup of old and new; traditional global navigation on top and sides – supports findability and establishes perceived authority/credibility; plus the ability for user editing

Morville says that things are not going to stay tied to the web; people are experimenting with ways to tag physical things, like with Semapedia.org:

Our goal is to connect the virtual and physical world by bringing the right information from the internet to the relevant place in physical space.

To accomplish this, we invite you to create Semapedia-Tags which are in fact cellphone-readable physical hyperlinks. You can create such Tags easily yourself by choosing and pasting a Wikipedia URL into the form above. Once created, you put the Tags up at their according physical location. You just hyperlinked your world! Others can now use their cellphone to ‘click’ your Tag and access the information you provided them.

Another example is Delicious Library which allows you to scan barcodes and create your own personal digital library. You can even link up with your neighbors and create a neighborhood library. Wikipedia gives the following overview about Delicious Library:

Delicious Library is a media cataloguing application for Mac OS X, developed by Delicious Monster. The software allows users to manage their collections of books, movies, CDs and video games by either entering them manually or by using a Bluetooth scanner, an iSight camera, or a USB-keyboard-type barcode scanner (such as a modified CueCat). In its first month Delicious Library generated $250,000 worth of sales.

Some additional reading suggested by Morville:

But this is not enough, says Morville. He told the story of the three stone cutters, each of whom was asked “what are you doing?” The first said “I’m making a living.” The second said “I’m doing the best job of stone cutting in the county.” The third said “I’m building a cathedral.” It’s this third stone cutter who gets the big picture.

We need to apply this to libraries. Libraries lift us up and inspire us.

Morville concluded his presentation by providing a list of links:

IA Therefore I Am

Peter Morville
morville@semanticstudios.com

Semantic Studios
http://semanticstudios.com/

Ambient Findability
http://findability.org/

IA Institute
http://iainstitute.org/

A PDF file of the slides from Morville’s presentation can be found at here: http://semanticstudios.com/alcts.pdf

Monday, June 25, 2007

ALA Annual 2007: Electronic Resources Interest Group

ERMS Continues: More on Standards and Systems
Saturday, June 23, 2007, 10:30 AM-12:00 PM

After the enthusiasm of acquiring a new electronic resource management system (ERMS) subsides, libraries are often overwhelmed by the extent of infrastructure adjustments and sheer manual keying that accompany the care and feeding of an ERM system. This program addressed some emerging standards from ONIX for Serials, ONIX for Licensing Terms, and the NISO Standardized Usage Statistics Harvesting Initiative (SUSHI) that could expedite the assimilation of information from content providers (via aggregators and PAMS) into a library’s electronic resource management system.

The speakers for this excellent program were Kathy Klemperer (Library and Information Systems Consulting, Acton, MA) and Linda Miller (Library of Congress).

Linda Miller gave an interesting overview of LC’s implementation of Innovative Interfaces, Inc.’s ERM to work with their Ex Libris Voyager system, a decision LC made in order to help deal with the increasing challenges associated with electronic resource management:

  • moving windows and embargoes on coverage
  • package gymnastics (packaging and repackaging)
  • license negotiation and translation
  • password management
  • supporting and enhancing the discovery process
  • tracking performance issues

LC’s ERMS implementation team is made up of an “energetic and enthusiastic” group of catalogers and acquisitions specialists with an “impressive array of language skills.” As they were putting the ERMS in place, the team realized that new skills would be needed in order to make the best use of the system (a combining of acquisitions and cataloging to build a knowledge base). These skills include:

  • license interpretation
  • FTP and other file loading/processing techniques
  • hunting for good cataloging copy…sometimes in new places
  • reconciling internal cataloging policy
  • defining the “perfect” access level record
  • developing a more sophisticated knowledge of the e-resource marketplace

The III ERMS provides LC the ability to track:

  • license details (terms of usage, start and end dates)
  • cost data, renewal data, usage statistics
  • updated coverage/holdings; current URLs
  • trial management
  • performance monitoring (i.e., keeping track of problems with electronic access)
  • subscription management

The III ERMS structure includes resource records (e.g., ABI/INFORM), license records, contact records (e.g., the vendor ProQuest) and provides a mechanism for tracking specific journals titles in packages and their holdings.

Currently, LC is analyzing license data and manually keying it into the ERM client and end-users are able to view terms of usage in LC’s Electronic Resource Online Catalog (III’s Web OPAC).

Miller spoke about some of the challenges LC has encountered during the implementation process, specifically licensing and holdings data. LC has found that interpreting and converting licenses is the most labor-intensive activity (sometimes requiring 1-3 hours per license) related to implementing the EMRS because few licenses are written in “plain English” which necessitated LC’s editing the licenses in order to make them more readable. Holdings information has proved problematic because many electronic resources lack persistent identifiers (e.g., ISSNs), plus the discrepancy between how public access management services (PAMS) (e.g., Serials Solutions) record titles and how the titles are actually cataloged makes automatic loads of holdings data problematic (matching is difficult and loads are often aborted).

Another problem with holdings information results from the fact that some PAMS don’t provide serials enumeration (volume number, issue number, etc.). Often, the information from the PAMS will be in the form of month and year only which results in confusion from a user’s perspective (print holdings are much more specifically displayed in many cases as compared to PAMS-supplied holdings information). Miller noted that even when the holdings information IS present, many PAMS lack ISSNs or other proprietary numbers (which are needed for matching). According to Miller, in May, over 3,000 automatic record updates failed because of the lack of a unique identifier that could be used for matching.

Kathy Klemperer gave a wonderful presentation explaining data communications formats for ERMS. Communications standards are key in ensuring the success of exchanging data, despite the quality of that data.

There are three current areas in which communications standards are being developed that are relevant to ERMS:

  • online holdings (which journals and volumes)
  • license management (who can use them)
  • usage statistics (how much are they being used and is there a trend)

Online holdings relates to the transmission of library-specific holdings information and this information is used frequently to generate A-Z journal lists, for link resolution, access management, and check-in and claiming.

One standard for online holdings is ONIX for Serials. ONIX is a “family of XML formats for communicating information about serial products and subscription information.” The development of ONIX for Serials has been a joint project of EDItEUR and NISO.

One ONIX format is the Serials Online Holdings (SOH) format, which is used for communicating library-specific electronic serials holdings details from publication access management systems (PAMS) to libraries. This format has been implemented by a number of vendors, including Serials Solutions, EBSCO and TDNet as senders and Innovative Interfaces and OCLC as recipients. According to Klemperer, “the SOH message is of great benefit in the maintenance of library catalogs and other end-user applications such as link resolvers and A-to-Z lists, where up-to-date and accurate online holdings statements are needed.”

Another ONIX format is “the Serials Products and Subscriptions (SPS) format, which is used for communicating information about serial subscription products, optionally with prices and specific subscription information. The SPS format is of use in a number of situations, including the transmission of price catalogs from publishers to agents; the transmission of price quotes from publishers or agents to libraries; and the transmission of a library’s subscription list among publishers, agents and libraries.”

The third ONIX format for serials is the Serials Release Notification (SRN) format, which is used for communicating information about the print publication or electronic availability of one or more serial releases. Klemperer says that “content suppliers, content consumers and intermediaries will all find it advantageous to send and/or receive Serial Release Notifications. Such notifications will serve to advertise the availability of new content, will help in minimizing unnecessary claims, and will make possible the automatic maintenance of precise holdings in online catalogs and link resolvers.”

In terms of license management, Klemperer noted that the ideal would include the ability to:

  • transmit license terms between parties
  • load pre-parsed publisher licenses into an ERMS (no need to parse the license manually)
  • compare publisher licenses to the library’s model license and
  • share negotiated licenses among consortium members

According to EDItEUR, the ONIX Publisher License (ONIX-PL) format “is intended to support the communication of license terms for electronic resources from a publisher to a user institution (directly or through a subscription agent), to enable the license terms to be loaded into an electronic resources management system maintained by the receiving institution.” ONIX-PL

  • supports communication of license terms from one party to another
  • is suitable for loading into an ERMS
  • includes mapping to and from the ERMI list of licensing terms
  • uses open source editing tools: OPLE (”OPLE is The ONIX-PL editor. It is a web application for creating and editing ONIX-PL licence expressions. For those curious about the name, OPLE is also a homonym for opal, just as ONIX is a homonym for onyx.”–http://ople.sourceforge.net/)

Other license-related activities of note, according to Klemperer, include:

In terms of e-journal usage statistics, Klemperer listed several needs:

  • the ability to load usage statistics into the ERMS
  • the ability to consolidate and compare usage among journals, vendors and members of consortia
  • the ability to perform calculations based on other criteria such as subscription costs, number of FTE and user categories

One usage statistics initiative is the COUNTER code of practice, which includes guidelines on what to count and how to count it. This code has been widely adopted by publishers, but it is important to remember that it is not a communications format: by itself COUNTER does not allow the automatic downloading, consolidation or comparison of statistics.

NISO’s Standardized Usage Statistics Harvesting Initiative (SUSHI), on the other hand, is a communications format that provides a mechanism for standardized delivery of COUNTER statistics. SUSHI is both a communication protocol and a data format. Klemperer explained that some of the benefits of using SUSHI include

  • automated downloads - no more manual retrieval of statistics from publisher sites
  • ability of the receiving system to consolidate and compare usage among jounrlas
  • ability of receiving system to perform calcluations base don other criteria such as subscription costs, number of FTE, and user categories

According to Klemperer, many publishers and providers are adopting SUSHI, in addition to many ERMS and some intermediaries such as database aggregators and subscription agents. Some new intermediaries like conversion and consolidation services are also adopting SUSHI.

So, what can we do in order to benefit from these standards?

Klemperer suggests that we

  • adopt e-resource management standards
  • pressure our vendors (publishers, aggregators, subscription agents) to adopt
  • include compliance in our RFPs
Sunday, June 24, 2007

Libraryman’s Book Deal

Michael “Libraryman” Porter has signed a book deal with ALA Editions. I don’t know any details, but I had to be the first person to blog about it…

That’s all.

« Previous Entries | Next Entries »