ULCC Publications Archive

ArchivePress: 
A Really Simple Solution to Archiving Blog Content

Pennock, Maureen and Davis, Richard M. (2009) ArchivePress: 
A Really Simple Solution to Archiving Blog Content. In: Sixth International Conference on Preservation of Digital Objects (iPRES 2009), 5-6 October 2009, California Digital Library, San Francisco, USA.

[img]
Preview
PDF (Paper) - Published Version

/Pennock, Maureen and Davis, Richard M. (2009) ArchivePress: 
A Really Simple Solution to Archiving Blog Content. In: Sixth International Conference on Preservation of Digital Objects (iPRES 2009), 5-6 October 2009, California Digital Library, San Francisco, USA./thumbnails/1/small.png


252Kb
[img]PDF (Slides for presentation) - Presentation

/Pennock, Maureen and Davis, Richard M. (2009) ArchivePress: 
A Really Simple Solution to Archiving Blog Content. In: Sixth International Conference on Preservation of Digital Objects (iPRES 2009), 5-6 October 2009, California Digital Library, San Francisco, USA./thumbnails/2/small.png


1096Kb

Official URL: http://escholarship.org/uc/item/7zs156mb

Abstract

ArchivePress is a new technical solution for collecting and archiving content from blogs. Current solutions are commonly based on typical web archiving activities, whereby a crawler is configured to harvest a copy of the blog and return the copy to a web archive. This approach is perfectly acceptable if the requirement is that the site is presented as an integral whole. However, ArchivePress is based upon the premise that blogs are a distinct class of web-based resource, in which the post, not the page, is atomic, and certain properties, such as layouts and colours, are demonstrably superfluous for many (if not most) users. As a result, an approach that builds on the functionality provided by web feeds to capture only selected aspects of the blog offers more potential. This is particularly the case when institutions wish to develop collections of aggregated blog content from a range of different sources. The presentation will describe our research to develop such an approach, including work to define the significant properties of blogs, details of the technical development, and pilot collections against which the tool has been tested.

Export Citation
Item Type:Conference or Workshop Item (Paper)
Additional Information:No further information
Uncontrolled Keywords:blogs, blog archiving, web preservation
Subjects:Digital Archives and Libraries > Projects
Digital Archives and Libraries > Digital Preservation
Divisions:Digital Archives
ID Code:101
Deposited By:Mr Richard Davis
Deposited On:06 Dec 2009 22:39
Last Modified:21 Apr 2010 12:38
See downloads' graphs for this item

Fulltext Downloads

Repository Staff Only: item control page


Comments

Add a Comment


Notes

Add a Note - this will be visible to you alone, while you are logged in.

Note title [optional]:

Tag this item (You may enter a comma separated list):