JISC-PoWR

Preservation of Web Resources: a JISC-sponsored project

Draft Programme for Workshop 3

Posted by Marieke Guy on August 27th, 2008

The draft programme for the third JISC-PoWR workshop (Friday 12th September 2008, University of Manchester) is now available:

Presentation. 1. Introduction to JISC-PoWR (Kevin Ashley, ULCC)
Presentation. 2. Records Management vs. Web Management (Marieke Guy, UKOLN)
Breakout Session: Web Preservation in your organisation
Presentation. 3. Web Preservation and Web 2.0 (Brian Kelly, UKOLN)
Presentation. 4. Legal issues (Jordan Hatcher, Opencontentlawyer)

LUNCH

Presentation. 5. The JISC-PoWR Workshops - Inputs and Outcomes (Marieke Guy, UKOLN)
Presentation. 6. The JISC-PoWR Handbook - Explaining Web Preservation (Kevin Ashley, ULCC)
Presentation. 7. The JISC-PoWR Handbook - Identifying Web Issues (Richard Davis)

COFFEE

Breakout Session: The next steps for Web Preservation in your organisation
Presentation. 8. The JISC-PoWR Handbook - Recommended Approaches (Ed Pinsent, ULCC)
Presentation. 9. Future possibilities
Final Thoughts

More information is available on the Workshop 3 page.

Places are still available. You can register using the Online Registration Form (note that this link takes you out of the JISC-PoWR blog to a Google Doc form).

Posted in Workshops, Events | No Comments »

5 Reasons why you should do Web Preservation

Posted by Marieke Guy on August 18th, 2008

For those who still need to convince their senior management here are five reasons why you should embed Web preservation strategies within your institution:

1. You need to protect your institution
University Web sites contain evidence of institutional activity which is not recorded elsewhere and may be lost if the Web site is not archived or regular snapshots are taken. If you do not record certain information you are in danger of failing to comply with legal acts such as FOI and DPA, you may be breaking contractual and auditing obligations and put your institution at risk. This risk management approach has been taken to countless other digital resources (for example email - Curation of emails), it is only a matter of time before it is a standard approach to Web sites.

2. Starting a Web preservation programme will make you look like a ‘forward thinking’ university
You could be one of the first to start an official ‘Web preservation’ programme which will be great marketing fodder. (Remember the first UK Universities to offer blogs to students (Warwick), launch a YouTube channel and offer downloadable lectures using iTunes (University College London)? How about the first to get sued by a student for changing the course specification and having no record of the previous entry? Universities have already been sued over Web site accessibility, copyright of material on their site and allowing plagarism to take place.) Embedding Web preservation strategies will also help you think about the continuity of resources, dead links etc.

3. It could save you money
Web resources cost money to create and failing to repurpose and reuse them will waste money. Although Web preservation may have an initial cost, once the process has begun the savings can be great. Having a good strategy in place (which also should include selection and deletion where appropriate) will save both money and energy in the long run. Brian Kelly’s recent UK Web Focus Blog post on the environmental issues involved in digital preservation touches on this. As Owen Steven suggests in his comment it may make sense to link digital preservation to commercialism.

4. You have a responsibility to the people who use your resources
Students and staff may make serious choices based on Web site information and you have a responsibility to make sure a record is kept of this information.

5. You have a responsibility to the people who may need to use your resources in the future
Many of resources your institution publishes are unique and deleting them may mean that invaluable scholarly, cultural and scientific resources (heritage records) will be unavailable to future generations.


These reasons should give your senior management food for thought. These drivers and others will be expanded on in the JISC-PoWR handbook.

You can find out more on how to get started on a Web Preservation strategy by attending our upcoming workshop on Embedding Web Preservation Strategies Within Your Institution.

Posted in Preservation | 1 Comment »

Heritage Records and the Changing Filter through which we View our World

Posted by Marieke Guy on August 11th, 2008

At both of the JISC-PoWR workshops delegates have been keen for the project team to spell out the reasons why institutions might want to preserve Web resources. These ‘drivers’ then give fuel to their case for the funds needed to archive the institutional Web site.

The idea of ‘heritage records’ is one that is often mentioned. Using Web sites as a ‘cultural snap shot’ has the potential to be a highly useful activity.

In his interesting and functional text Managing the Crowd: Rethinking Records Management for the Web 2.0 World Steve Bailey puts forward the point that deciding what will be important in the future is a tricky business. As he explains in the section on appraisal, retention and destruction: “The passage of time inevitably changes the filter through which we view our world and assess its priorities.”

Steve gives the example of the current plethora of Web sites that offer what we might call ‘quack’ remedies for medical problems. These sites may not seem to be of great interest right now but they may be invaluable to future historians who wish to demonstrate the distrust of the medical profession exhibited in 21st century western culture.

James Curral in his recent plenary talk at the recent Institutional Web Management Workshop used the example of blog posts made by soldiers out in Iraq and Afghanistan to demonstrate the irony of modern technology; these highly informative records could easily be lost while the diaries of World War II soldiers remain accessible.

Preservation mistakes have been made aplenty in the past. The destruction of much of the BBC’s flagship programmes in the 1970s has been well documented and in 2001 the BBC launched a a treasure hunt campaign to locate recordings of pre-1980 television or radio programmes. Ironically the Web site is no longer being updated, though it is still hosted on the BBc server.

So who can know what the future will bring? Which Web resources will we wish we had kept? Which student blog writer will go on to be a future prime minister or an infamous criminal? What bit of the terrabytes is the most important?

As Steve Bailey points out there is no crystal ball. It has always has been, and always will be, very difficult to predict what resources may prove to be valuable to future generations.

Although this offers little recompense for those making these choices, it does at least argue the case that we do need to preserve and we need to do so soon.

Posted in Web 1.0, Challenges, Records management, Preservation | 2 Comments »

Workshop 3 - Embedding Web Preservation Strategies Within Your Institution

Posted by Marieke Guy on August 9th, 2008

Bookings are now open for the third JISC-PoWR workshop to be held at the Flexible Learning Space, University of Manchester on Friday 12th September 2008. The workshop entitled Embedding Web Preservation Strategies Within Your Institution is free to attend and open to Web, information and records managers working in HE/FE Institutions and related HE and FE agencies.

For information on how to reserve a place see the workshop 3 page.

Posted in Workshops, Events | No Comments »

Preservation Experts Suggest That The Term “Digital Preservation” Is Harmful

Posted by Brian Kelly on August 8th, 2008

A recent post entitled “Digital Preservation” term considered harmful?” on the Digital Curation blog begins with the words:

Over the past few weeks I have become acutely aware that the term “digital preservation” may be becoming a problem.

Not quite what one might expect from Chris Rusbridge, director of the Digital Curation Centre (DCC)! And James Currall, who recently gave a plenary talk on Web site preservation issues at UKOLN’s IWMW 2008 event, appears to have been responsible for such heresy with his view that:

The digital preservation community has become very good at talking to itself and convincing ‘paid-up’ members of the value of preserving digital information, but the language used and the way that the discourse is constructed is unlikely to make much impact on either decision-makers or the creators of the digital information (academics, administrators, etc.).

But I have to say that I think that these views reflect the experiences we have had in the JISC PoWR project. Indeed Alison Wildish was quite open about this in her presentation at the first JISC PoWR workshop.

While we have to use “digital preservation” in appropriate contexts, including technical and other in-house discussions, and digital curation is appropriate in other contexts, terms that reflect the outcomes are more persuasive. The outcome of successful digital preservation is that digital resources remain accessible and usable over the long term.

and concludes by arguing that:

… outcome-related phrases like “long term accessibility” or “usability over time” are better than the process-oriented phrase “digital preservation”.

Amen to that! This reflects my views on the need to take a user-focussed approach to Web site development, with long term accessibility and usability simply mean that we need to think about the users in the future and not just those we have today.  And perhaps that’s the approach we have to take in order to ’sell’ preservation to somewhat sceptical Web developers.

Should our slogan be “Web preservation is dead! Long live long term accessibility! Long live usability over time!” I wonder?

Posted in Digital preservation | 1 Comment »

RSS Feeds Of Changes To Web Pages

Posted by Brian Kelly on August 6th, 2008

Lorcan Dempsey picked up on the work of the JISC PoWR project in a blog post entitled The institutional record and web archiving. Lorcan described the presentation given at the first JISC PoWR workshop by Alison Wildish and Lizzie Richmond in which they described the changes to the University of Bath printed prospectus over the lifetime of the University of Bath.  Lorcan drew parallels between this print publication and the digital environment:

The University would always have kept the print manifestation; what now to do with the web manifestation? One of the interesting changes they note over this time is the ‘rise of the logo’, and tracing changes in how the institution presents itself over time is also interesting.

In a response to Lorcan’s post Tony Hirst referenced a blog post by Michael Nolan on the Edge Hill Web Services team blog in which Michael pointed out “one [example of interesting use of RSS] that caught my eye was the University of Warwick’s recent changes feed which allows you to subscribe to find out when the homepage changes. Better still, they have this for every page in their CMS.

An example of this can be seen for the Research page on the University of Warck Web site. Although not nornmally visibile to most end users who visit this page, there is a link to an RSS feed of recent changes to the page. Using tools such as the Greasemonkey RSS Panel (available for Firefox) you can view the changes, as shown below.

News feed of change on a page on the University of Warwick Web site 

In his comment on Lorcan’s blog Tony Hirst went on to suggest that ”A change feed, like on a wiki, could be one way (maybe) of facilitating 1st, 2nd or 3rd party web page archiving?“. I think Tony might be right. And maybe we are seeing the University of Warwick pioneering this approach, as the feed of recent changes seems to be provided by their in-house Sitebuilder 2 software, “the University’s web publishing tool“.

Perhaps when institutions are next procuring a CMS system they should be asking if vendors provide RSS feeds of changes to pages.  

Posted in Web 1.0 | 1 Comment »

JISC PoWR Workshop 2: Preservation and Web 2.0

Posted by Brian Kelly on July 31st, 2008

The second JISC PoWR workshop was held on 23rd July 2008 as part of UKOLN’s annual institutional Web management workshop, IWMW 2008.

This workshop provided an opportunity to review the outcomes of the first workshop, in which members of the JISC PoWR team and the 30+ participants identified some of the challenges to be faced in preserving content held on institutional Web services and explored some of the ways in which these challenges can be addressed. The slides for this review are available on Slideshare and are embedded below.


The main focus of the second workshop, however, was to look at the additional challenges which need to be addressed in a Web 2.0 context, when the content may be more dynamic, hosted by third party services and created by a wide range of users.

A PowerPoint presentation was used to initiate discussions based on a number of scenarios including use of blogs, wikis, Twitter, communications tools, social networks, ‘amplified events’ and use of third party repository services such as Slideshare - which is appropriate as this presentation is itself available on Slideshare and is embedded below.


This presentation doesn’t have any answers to these challenges - it was intended to initiate the debate at the workshop. Some of the approaches which may be relevant to the various scenarios have already been discussed on this blog including use of wikis, student blogs, use of Slideshare, instant messaging and Twitter and the wider set of discussions which took place at the workshop will feed into the final JISC PoWR handbook.

It is worth noting that this presentation was spotlighted on the Slideshare home page. This has helped to increase the visibility of the work of the JISC PoWR project: a week after the presentation hed been given there had been 713 views of the slides. It should also be noted that other Slideshare users had assigned various tags to the presentation (including data-portability, digital-preservation, sioc and preservation). As can be seen if you follow these links, we are beginning to see use of such social Web technologuies which can help users to discover related resources of interest to the digital preservation community.  This, to me, is a good example of the potential benefits which Web 2.0 can provide to those with n interest in the presevation of Web resources.

Posted in Workshops, Web 2.0 | 1 Comment »

The History of Your Institution’s Web Site

Posted by Brian Kelly on July 21st, 2008

A recent blog post by Lorcan Dempsey on “The institutional web presence again” provided a link to a page on “the history of U.Va. on the web” which provides details of 14 year’s history for the University of Virginia’s Web site from 1994-2008.

The page provides details of the Web usage statistics in the early years, with screen images shown of major changes to the home page from 1997 (unfortunately no screen images are available for the first three years of the service).

Information is provided on the people and groups responsible for the design, the changes which were made as new technologies became available, significant additional content that was added and details of awards which the site won.

This is an approach which I feel all institutions should consider taking.  And let’s start recording the history of those early years quickly, before the first generation of institutional Web managers start to retire, leave or forget the details of the institution’s Web history.

University of Virginia Web Site History

Posted in Web 1.0 | No Comments »

Preservation And Instant Messaging

Posted by Brian Kelly on July 21st, 2008

Background

The Web 2.0 environment has a strong emphasis on communications between individuals and not just one-way publishing. This pattern of usage places additional challenges for institutions wishing to ensure that records are kept of the dialogue which takes place. And these challenges may well need to be addressed within the context of policies on the preservation of Web resources as increasingly digital communications technologies will have Web interfaces.

We will be publishing a series of posts looking at different aspects of Web 2.0. In this initial post we will provide a brief case study on use of instant messaging to support communications between two institutions. The case study will attempt to draw out some of the general policy issues which should be applicable more widely.

Use of IM for the QA Focus Project

This example describes the approaches taken to use of instant messaging to support communications between the project partners for the JISC-funded QA Focus project which was launched in January 2002. The project partners were UKOLN (based at the University of Bath) and, initially, ILRT, University of Bristol. However after the end of the first year of the project ILRT withdrew form the project and were replaced by AHDS, who were based in London.

In order to minimise the amount of travel and to help to provide closely integrated working across the project partners it was agreed to make use of instant messaging technologies. As well as enabling the team members to have speedy contact with each other it was also recognised that official project meetings could be held using the technology. It was appreciated that in this context there was a need to have a slightly formal protocol for managing the meetings, to compensate for the limitations of online meetings. And in addition to the best practices for managing the online meetings it was also agreed that a record of the transcript would be kept, and that this record would be copied across to the Intranet along with other formal documents.

After AHDS replaced by ILRT as project partners we decided to change our IM client from Yahoo Messenger to MSN Messenger. It was either during this change of IM tools or whilst making use of another IM client (I can’t recollect the exact details) that we noticed that different IM applications work in slightly different ways. This includes whether a transcript of dialogue is kept automatically and whether new participants to a group chat will see only new discussions or discussions which have taken place previously (which has the potential to cause embarrassments at the least).

The experiences we gained in use of IM led the project partners to develop a policy on use of IM (which covered issues such as the possible dangers of interruptions, as well as keeping records of formal meetings held on IM). The policy also clarified use of IM in an informal context, with their being no guarantee that records would be kept.

The policy stated that:

  • IM software may be used for formal scheduled meetings. In such cases standard conventions for running meetings should be used. For example an agenda should be produced, actions clearly defined, changes of topics flagged and a record of the meeting kept.
  • IM software may be used for direct communications between individual team members. For example it may be used for working on particular tasks, to clarify issues when working on collaborative tasks and to support team working. IM may be particularly suited for short term tasks for which no archive is needed and other team members need not be involved - for example, arranging a meeting place.
  • Highly confidential information will not be sent using IM, due to the lack of strong encryption.

General Issues

The general issues arising from this case study include:

  • The need to ensure that the users of the IM technologies and those involved in developing policies related to its use have a good understanding of how the technologies work together with an understanding of the differences between different IM systems.
  • The need for simple documented policy statements

Posted in Web 2.0 | No Comments »

Preservation and Slideshare

Posted by Brian Kelly on July 18th, 2008

Slideshare is a popular externally hosted Web 2.0 service for providing access to presentations. And as I’ve described on the UK Web Focus blog, there is evidence to demonstrate its impact in maximising awareness of presentations - and this might include both awareness of research activities, as described in my post, but also marketing activities.

But what about the risks associated in making use of a third party service in this way? What will happen if, for example, the Slideshare’s business model is flawed and the company goes bankrupt? Rather than making use of a Web 2.0 service shouldn’t we be providing Slideshare’s functionality in-house?

I feel this is the wrong response: it would be similar to saying that we should not allow third party organisations to manage our savings - but we all have bank accounts. And, although we know from recent experiences in the UK that there can be risks when using banks, we don’t shut down our accounts when we became aware if incidents such as Northern Rock financial difficulties. Rather we assess the risks and then manage the risks (in the case of savings, this might be to limit one’s saving to a maximum of £35,000 with any single bank, as this amount is guaranteed by the Government).

In the case of Slideshare an in-house solution would not only be costly to replicate its functionality, but it would also be unlikely to provide the impact and popularity which Slideshare has.

The challenge then is to assess possible risks and to explore mechanisms for managing such risks. The approach I take is to look at the popularity of the service and its user community (an approach, incidentally, which has also been recommended when selecting open source software). The Techcrunch service can be useful if providing information on the financial background to many Web 2.0 companies and its information on Slideshare seems reassuring, with a post in May 2008 described how SlideShare had Secured $3M for Embeddable Presentations.

The risk management approach I have taken is to store a managed master copy of the slides on the UKOLN Web site and ensure that links to this resource are provided on Slideshare.  As can be seen from the image,  the URL is included  on the title slide and in the accompanying metadata. In addition the URL is also included in the footer of the hard copy printouts. I also provide a Creative Commons licence for the resource, which seeks to avoid any legal barriers to future curation of the resource and allow the resource to be downloaded from the Slideshare site.

Metadata provided on the Slideshare service

This approach aims to ensure that the master resource is kept at a stable managed location, allows users to make a copy of the resource (if, for example, the Slideshare service suffers from performance or reliability problems) and allows uses to bookmark or cite the managed master version of the file.

Posted in Web 2.0 | 4 Comments »