Pennock, Maureen and Davis, Richard M. (2009) ArchivePress: A Really Simple Solution to Archiving Blog Content. In: Sixth International Conference on Preservation of Digital Objects (iPRES 2009), 5-6 October 2009, California Digital Library, San Francisco, USA.
| PDF (Paper) - Published Version /Pennock, Maureen and Davis, Richard M. (2009) ArchivePress: A Really Simple Solution to Archiving Blog Content. In: Sixth International Conference on Preservation of Digital Objects (iPRES 2009), 5-6 October 2009, California Digital Library, San Francisco, USA./thumbnails/1/small.png 252Kb | |
| PDF (Slides for presentation) - Presentation /Pennock, Maureen and Davis, Richard M. (2009) ArchivePress: A Really Simple Solution to Archiving Blog Content. In: Sixth International Conference on Preservation of Digital Objects (iPRES 2009), 5-6 October 2009, California Digital Library, San Francisco, USA./thumbnails/2/small.png 1096Kb |
Official URL: http://escholarship.org/uc/item/7zs156mb
Abstract
ArchivePress is a new technical solution for collecting and archiving content from blogs. Current solutions are commonly based on typical web archiving activities, whereby a crawler is configured to harvest a copy of the blog and return the copy to a web archive. This approach is perfectly acceptable if the requirement is that the site is presented as an integral whole. However, ArchivePress is based upon the premise that blogs are a distinct class of web-based resource, in which the post, not the page, is atomic, and certain properties, such as layouts and colours, are demonstrably superfluous for many (if not most) users. As a result, an approach that builds on the functionality provided by web feeds to capture only selected aspects of the blog offers more potential. This is particularly the case when institutions wish to develop collections of aggregated blog content from a range of different sources. The presentation will describe our research to develop such an approach, including work to define the significant properties of blogs, details of the technical development, and pilot collections against which the tool has been tested.
| Item Type: | Conference or Workshop Item (Paper) |
|---|---|
| Additional Information: | No further information |
| Uncontrolled Keywords: | blogs, blog archiving, web preservation |
| Subjects: | Digital Archives and Libraries > Projects Digital Archives and Libraries > Digital Preservation |
| Divisions: | Digital Archives |
| ID Code: | 101 |
| Deposited By: | Mr Richard Davis |
| Deposited On: | 06 Dec 2009 22:39 |
| Last Modified: | 21 Apr 2010 12:38 |
Fulltext Downloads
Repository Staff Only: item control page

