Information Overload

My class mate over at Library Corner wrote a very interesting blog post titled “The Future is Ours”. In it she mentions how we are taking control of writing our own histories via blogs, twitter, and other social media. Instead of history being written by the victor history will be written by its subject. She also brings up a valid concern, the ability for people to keep up with all these personal histories. For instance if you have a hundred blogs from a hundred different view points of a certain event your average reader isn’t going to read a hundred blogs but one or two of the most well rated ones. Even as our ability to disseminate information increases via the internet information overload becomes a real problem.

Information overload is a concern that can only really be handled through organization. And the author of Library Corner makes a very good point that Metadata is one of the ways to organize this information. With proper metadata use and attaching a tweet to a popular hashtag it is possible for even a little known author to have their information seen by others.


Format Fun

Snow Byte and the Seven Formats is brilliant. I loved everything about this article. It was witty it was funny and it made me feel smart which is a nice perk. But really, the way the story grew around storage formats was downright fun. It introduces some terms like “obsolescence” while merely hinting at others such as the prince Dublin referencing the Dublin Core elements. It also demonstrates the benefits of a digital repository, letting Snow Byte get access to her files after the wicked Queen deleted them all.

I don’t know if the author actually intended the story to be read and understood by children or if that comment was more of a jest. I believe some of the information might be a little complex specifically Dublin’s discussion of the XML wrapper around the information, then again perhaps that is my own lack of familiarity showing through. In either case I think this story brings up many important points concerning information storage and treats them in a light, fun way. I could see this story being a jumping off point for discussions in class or just a really fun relaxing read trying to catch all the references made throughout.

Dublin Core Metadata Terms

I’ve worked a little bit with Dublin Core and felt I had a fair grasp of how it worked and the fifteen core elements that it contained. So I was a bit shocked when I looked at the Dublin Core Metadata Terms and saw a long list of information in tabulated format. At first I was more than a bit intimidated, but after taking a closer look it started making a bit more sense. Rather than listing the DC elements this document lists and defines all the terms associated with the metadata schema. It rather reminds me of the data dictionaries mentioned here and discussed in my last post.

However, rather than give a simple list of terms to define the metadata the “Dublin Core Metadata Terms” document fully describes the term. This document is indeed meta-metadata or metadata about metadata, it lists the term, gives a definition of what it is used for and further documents how it relates to the terms around it. I specifically appreciated that the links provided in each “entry” of the lists gave links which gave even more information such as whether the entry was the most up to date term used or if it had been supplanted by a newer one.

Thoughts on “Considering Emulation for Digital Preservation”

I really appreciated John’s post on emulation and digital preservation which can be found on his blog here. While I have briefly covered the concerns of outdated technology such as floppy discs and cassettes, I haven’t spent a lot of time on the issue. It was nice to get the perspective of someone who has clearly dealt with the emulation technology. While I still need to read some more background articles on emulation, I think that John’s article showed how a current technological practice can be used to help preserve and use niche technologies that might otherwise fall by the wayside.

Personalized Search or Drone?

Richard Nurse’s blog post on “full library discovery” caught my attention on two levels. First was recognition. When discussing “full library discovery” my first thought was of the University of Alabama’s library search feature Scout. Scout, while it may not go as far as “full library discovery” as I am not sure if it pulls information from libguides and study aids, allows users to search across the catalog, journals, and even databases. I’ve used it a little myself, but I often find that what it gains in recall it lacks in precision making it less useful than other finding aides.

The other point that resonated with me is the question of how personalized should searches become. If users have to log-in to  a system how much should search results be personalized and in what ways? The examples given come from an academic background and involve returning items used in previous years, by successful students, etc. I think I’ve seen and read too many science fiction stories to believe that guiding and controlling search results will lead to a best outcome. On the other hand having a best search system which highlights the most common or “useful” sources could save users a great deal of time. As mentioned in the comments of the post, I think the most important aspect of modified or personalized search results is making sure the user knows that the search has been edited and giving them the power to perform an unlimited search.

So how do these two revelations fit together? I think it comes to transparency in search results; what is highlighted and what is hidden. In a “full library search” there are often items that are not searched. I know there are several databases that I often use for my research that are not searchable via Scout. Likewise in a personalized search some sources will be excluded. Finding some sort of medium is vital and making sure that the user knows what is available both upfront and what might be hidden under another search layer is just good Librarianship.

Persistent Identifiers…wha?

I’ll admit this article mostly left me stumped. It could be that I’m not completely certain of the terms being used, but I was mostly scratching my head. I could understand some of what, I think, were the main points. I thought the models based on the discovered questions had an organization that I could follow. It makes sense that before knowing what steps should be taken to persistently preserve an identifier, it must be known what needs a persistent identifier and so forth.

I even found it interesting that persistent identifiers could be applied not only to concrete documents, but to relationships and such abstractions as FRBR’s concepts of Work and Manifestation. But when the article came to trying to explain what “persist” meant I simply could follow the logic. I couldn’t tell if the article meant that a certain flyer, no longer available on the web, was still referenced by another document because it was mentioned or something more high-tech than that. I think I’ll try some of the background readings and come back to this article.

A policy checklist for enabling persistence of identifiers