<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/wordpress-mu-1.2.5" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>
<channel>
	<title>Comments on: What&#8217;s the average lifespan of a Web page?</title>
	<link>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/</link>
	<description>Preservation of Web Resources: a JISC-sponsored project</description>
	<pubDate>Tue, 16 Mar 2010 16:32:23 +0000</pubDate>
	<generator>http://wordpress.org/?v=wordpress-mu-1.2.5</generator>

	<item>
		<title>By: Avi Rappoport</title>
		<link>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/#comment-33659</link>
		<dc:creator>Avi Rappoport</dc:creator>
		<pubDate>Wed, 03 Mar 2010 20:20:14 +0000</pubDate>
		<guid>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/#comment-33659</guid>
		<description>I found the source:

http://www.sciam.com/0397issue/0379kahle.html

Kahle, Brewster, Preserving the Internet, Scientific American, March 1997</description>
		<content:encoded><![CDATA[<p>I found the source:</p>
<p><a href="http://www.sciam.com/0397issue/0379kahle.html" rel="nofollow" onclick="javascript:urchinTracker ('/outbound/comment/www.sciam.com');">http://www.sciam.com/0397issue/0379kahle.html</a></p>
<p>Kahle, Brewster, Preserving the Internet, Scientific American, March 1997</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Web Page Life Span &#171; The ADL Librarian</title>
		<link>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/#comment-18712</link>
		<dc:creator>Web Page Life Span &#171; The ADL Librarian</dc:creator>
		<pubDate>Tue, 18 Aug 2009 00:39:46 +0000</pubDate>
		<guid>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/#comment-18712</guid>
		<description>[...] this context, I came across, a posting, &#8220;What’s the average lifespan of a Web page?&#8220;. It was nice to read a sort of scientific discussion of this nuisance. The posting pulls [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] this context, I came across, a posting, &#8220;What’s the average lifespan of a Web page?&#8220;. It was nice to read a sort of scientific discussion of this nuisance. The posting pulls [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ResourceShelf &#187; Blog Archive &#187; What’s the average lifespan of a Web page?</title>
		<link>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/#comment-18452</link>
		<dc:creator>ResourceShelf &#187; Blog Archive &#187; What’s the average lifespan of a Web page?</dc:creator>
		<pubDate>Wed, 12 Aug 2009 21:45:42 +0000</pubDate>
		<guid>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/#comment-18452</guid>
		<description>[...] Marieke Guy does an impressive job pulling together several estimates and the underlying papers where they come from. It&#8217;s one challenging question and getting even more so each day in this time of Twitter and similar services. [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] Marieke Guy does an impressive job pulling together several estimates and the underlying papers where they come from. It&#8217;s one challenging question and getting even more so each day in this time of Twitter and similar services. [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Day</title>
		<link>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/#comment-18442</link>
		<dc:creator>Michael Day</dc:creator>
		<pubDate>Wed, 12 Aug 2009 16:24:52 +0000</pubDate>
		<guid>http://jiscpowr.jiscinvolve.org/2009/08/12/whats-the-average-lifespan-of-a-web-page/#comment-18442</guid>
		<description>As the quotation from my &lt;em&gt;Collecting and preserving the World Wide Web&lt;/em&gt; report implies, Alexa's 75 day estimate was also referred to in a 2001 scientific paper written by Lawrence &lt;em&gt;et al&lt;/em&gt;. [1]. Ironically, given that paper's subject matter, the only reference provided there was to Alexa's main Web page. From that, I couldn't trace the original percentage (or the methodology used to generate it), so I just used the figure as a general indication of volatility.

That said, as you imply, these types of figures can hide a great deal of complexity, in that certain types of Web content are likely to be more stable than others. Moreover, in practice, it probably only makes sense to think about link decay within particular contexts. For example, the article by Lawrence &lt;em&gt;et al&lt;/em&gt;. cited above was specifically concerned with Web references in scientific publications, with a case study based on computer science journal and conference papers and technical reports. Their study found that the percentage of invalid links varied over time, e.g. from 23% in 1999 to 53% in 1994. Even then, much of the content (around 80%) was actually still available, and could be found using search engines or other methods.

Over the past few years, similar studies have been made of link decay in medical and general science journals [2], followed by similar analysis of various other domains. The evidence here also tends to suggest that link volatility is higher than that of the content itself [3]. Looking that these kinds of detailed studies might actually be more helpful than providing general link decay figures that appear to mean very little. I'm NOT saying that Web link volatility isn't a problem, just that it is difficult to understand what link decay percentages mean without an understanding of exactly what they were trying to measure.

&lt;strong&gt;References:&lt;/strong&gt;

[1] Steve Lawrence, David M. Pennock, Gary William Flake, Robert Krovetz, Frans M. Coetzee, Eric Glover, Finn Arup Nielsen, Andries Kruger, and C. Lee Giles, "Persistence of Web References in Scientific Research," &lt;em&gt;Computer&lt;/em&gt; 34(2), February 2001, DOI: 26-3110.1109/2.901164

[2] Robert P. Dellavalle, Eric J. Hester, Lauren F. Heilig, Amanda L. Drake, Jeff W. Kuntzman, Marla Graber, and Lisa M. Schilling, "Going, Going, Gone: Lost Internet References," &lt;em&gt;Science&lt;/em&gt;, 302(5646), 787-788, 31 October 2003, DOI: 10.1126/science.1088234

[3] Jonathan D. Wren, "URL Decay in MEDLINE - a 4-year Follow-up Study," &lt;em&gt;Bioinformatics&lt;/em&gt;, 24(11), 1381-85, 1 June 2008, DOI: 10.1093/bioinformatics/btn127</description>
		<content:encoded><![CDATA[<p>As the quotation from my <em>Collecting and preserving the World Wide Web</em> report implies, Alexa&#8217;s 75 day estimate was also referred to in a 2001 scientific paper written by Lawrence <em>et al</em>. [1]. Ironically, given that paper&#8217;s subject matter, the only reference provided there was to Alexa&#8217;s main Web page. From that, I couldn&#8217;t trace the original percentage (or the methodology used to generate it), so I just used the figure as a general indication of volatility.</p>
<p>That said, as you imply, these types of figures can hide a great deal of complexity, in that certain types of Web content are likely to be more stable than others. Moreover, in practice, it probably only makes sense to think about link decay within particular contexts. For example, the article by Lawrence <em>et al</em>. cited above was specifically concerned with Web references in scientific publications, with a case study based on computer science journal and conference papers and technical reports. Their study found that the percentage of invalid links varied over time, e.g. from 23% in 1999 to 53% in 1994. Even then, much of the content (around 80%) was actually still available, and could be found using search engines or other methods.</p>
<p>Over the past few years, similar studies have been made of link decay in medical and general science journals [2], followed by similar analysis of various other domains. The evidence here also tends to suggest that link volatility is higher than that of the content itself [3]. Looking that these kinds of detailed studies might actually be more helpful than providing general link decay figures that appear to mean very little. I&#8217;m NOT saying that Web link volatility isn&#8217;t a problem, just that it is difficult to understand what link decay percentages mean without an understanding of exactly what they were trying to measure.</p>
<p><strong>References:</strong></p>
<p>[1] Steve Lawrence, David M. Pennock, Gary William Flake, Robert Krovetz, Frans M. Coetzee, Eric Glover, Finn Arup Nielsen, Andries Kruger, and C. Lee Giles, &#8220;Persistence of Web References in Scientific Research,&#8221; <em>Computer</em> 34(2), February 2001, DOI: 26-3110.1109/2.901164</p>
<p>[2] Robert P. Dellavalle, Eric J. Hester, Lauren F. Heilig, Amanda L. Drake, Jeff W. Kuntzman, Marla Graber, and Lisa M. Schilling, &#8220;Going, Going, Gone: Lost Internet References,&#8221; <em>Science</em>, 302(5646), 787-788, 31 October 2003, DOI: 10.1126/science.1088234</p>
<p>[3] Jonathan D. Wren, &#8220;URL Decay in MEDLINE - a 4-year Follow-up Study,&#8221; <em>Bioinformatics</em>, 24(11), 1381-85, 1 June 2008, DOI: 10.1093/bioinformatics/btn127</p>
]]></content:encoded>
	</item>
</channel>
</rss>
