A review of – E-Archiving:
An Overview of Some Repository Management Software Tools
The introduction of electronic journals (E-journals) has changed the traditional scholarly communication. The transformations are in ways of speed up the research consequences, providing the access of hyperlinks of citation, and heightening the text with a variety of illustrations (Cantara, 2003). In a recent article from ARIADNE, Prudlo addresses an overview of some most widely known repository management software tools including LOCKSS, EPrints, and DSpace respectively. His main point is to bring the attention of digital preservation to the library mainly, but also to the scholars and publisher as well. Furthermore, Prudlo mentions several questions in many perspectives, in terms of what the material and software to choose, the cost of the system, long-term preservation issue, and know-how requirement (2005). These questions are primarily brought to the librarian. In addition, Prudlo’s article has an implication of phenomena of digitalised documentations across the world. Carpenter (2005) states a trend that transforms the way of preservation from paper-based to digitalized format. He mentions some issues that
“Physical carriers of digital material deteriorate; digital data can become corrupted; the hardware that reads particular carriers wears out and cannot be replaced when it has become obsolete; file formats become obsolete in the course of software evolution, as backward compatibility is lost over a succession of versions; older versions of software, even when these are available, may not work on new hardware or operating systems. Valuable digital assets of institutions are at risk of loss, in the medium-term as well as long-term future.” (2005, ¶ 1)
This also supports Prudlo’s view on the importance of digital repository and capability of preservation in long-term perspective.
Libraries used to purchase the license of an electronic journal in order to obtain journals they required in print, and the license has its time restriction. However, with the help of ICTs, the access is provided via databases and e-journal subscriptions. In other words, the library owns the journal as long as it keeps the journal archived when subscribing to a print journal (Prudlo, 2005).
Nearly one hundred institutions use Lots of Copies Keep Stuff Safe (LOCKSS) in the United States and in Europe. LOCKSS is open use to any library around the world. The subscription of the journal allows the library to extend its possession of the journal as long as the original copy is well archived by the library (Prudlo, 2005). Even with the expiration of the license would not terminate the access to the content of the journal in the library. In fact, due to the nature of LOCKSS, slowly collection of electronic journals, the system keeps an original copy of the journal to ensure the consistency of the content. Meanwhile, since LOCKSS does not suitable for web sites that update changes frequently, archiving governmental published documents on the web is an alternative way instead. In fact, LOCKSS does not provide immediate update so that makes the cost cheaper and requires less time for installation the system (2005). Furthermore, the decentralised architecture design makes LOCKSS a safe system when the attack of the system happens. Prudlo addresses that approximately “six redundant copies of a title are required to safeguard a title’s long-term preservation (2005, ¶ 5).” In other words, LOCKSS attempts to keep as many copies as possible to minimize the system attack. However, when too much redundancy occurs, it is most likely to decrease the efficiency and the capability of the system operation. In addition, LOCKSS is not yet overcome the problem of format migration (2005). This means that LOCKSS is not able to provide the service of conversion to the current format that used for preservation in the new system.
Developed at the University of Southampton in the UK, EPrints is an open source package that complies with Open Archives Initiative (OAI)-compliant. OAI helps EPrints to be available to cross-archive searching so that a registered archive will be automatically “included in a global program of metadata harvesting and other added-value services run by academic and scientific institutions across the globe (Prudlo, 2005, ¶ 16).“ Meanwhile, there are more than one hundred and forty repositories that use the EPrints software internationally (2005). EPrints is created to collect, store and provide Web access to material and provides access to institutions. It is mainly used as a means to manage the form of books, posters, or conference papers in research (2005). The installation of EPrints is time-consuming and requires specific operating system such as Linux, Solaris, or MacOSX. Accordingly, EPrints enables the authors to submit self-publishing material by providing friendly user interface so that authors can upload the material in the way they preferred. In fact, uploaded materials are held in “buffer” first. The material submitted will be reviewed first then finally deposited based on the decision made by the administrator.
The DSpace is open source software developed by the join venture of the Massachusetts Institute of Technology Libraries and Hewlett-Packard. Over one hundred organizations worldwide use DSpace. Its initiative is to archive huge and various types of content, including “articles, datasets, images, audio files, video files, computer programs, and reformatted digital library collections (Prudlo, 2005, ¶24).” DSpace uses UNIX-type operating system such as LINUX or Solaris, and an experienced system administrator is required to install and configure the system (Prudlo, 2005). DSpace software has much strength. First, it uses customized workflow process to describe digital works. Second, digital works of an institution are distributed onto the Web; thus, it enables users’ accessibility of searching and retrieving materials. Third, the preserved works in form of digital are stored in long-term aspect.
There are many advantages for each of preservation software tools and disadvantages come along as well. The library needs to consider which one is best suitable with some restrictions such as annual budget, size of the library, area of collection, and so on. In long-term perspective, EPrints and DSpace are both fulfill the requirement via providing the access on the Web (Prudlo, 2005). Although E-archiving is still a brand new concept, there are already many repository software management tools on their way. Nonetheless some people are in doubt on the future of digital works, it is better to prepare for it than it’s too late, especially for FIS students to be aware of this issue.
Reference lists –
Cantara, L. (2003). Archiving electronic journals. Research founded by the Andrew W. Mellon Foundation. Retrieved October 25, 2005, from http://www.diglib.org/preserve/introduction.html
Carpenter, L. (2005). Supporting digital preservation and asset management in institutions. Ariadne. 43. Retrieved October 24, 2005, from http://www.ariadne.ac.uk/issue43/carpenter/
EPrints Web site http://eprints.org/
DSpace Web site http://www.dspace.org/
LOCKSS Web site http://lockss.stanford.edu/
Prudlo, M. (2005). E-Archiving: An overview of some repository management software tools. Ariadne. 43. Retrieved October 24, 2005, from http://www.ariadne.ac.uk/issue43/prudlo/
