Developing services for open eprint archives范文[英语论文]

资料分类免费英语论文 责任编辑:王教授更新时间:2017-04-25
提示:本资料为网络收集免费论文,存在不完整性。建议下载本站其它完整的收费论文。使用可通过查重系统的论文,才是您毕业的保障。
The rapid growth of scholarly information resources available in electronic form and their organisation by digital libraries is proving fertile ground for the development of sophisticated new services, of which citation linking will be one indispensable example. Many new projects, partnerships and commercial agreements have been announced to build citation linking applications. This describes the Open Citation (OpCit) project, which will focus on linking s held in freely accessible eprint archives such as the Los Alamos physics archives and the other distributed archives, and which will build on the work of the Open Archives initiative to make the data held in such archives available to compliant services. The emphasises the work of the project in the context of emerging digital library information environments, explores how a range of new linking tools might be combined and identifies ways in which different linking applications might converge. Some early results of linked pages from the OpCit project are ed.
Keywords: electronic publishing, digital library information architectures, reference linking, distributed collections, eprint archives, Open Archives

Introduction
The process of scholarly communication, in particular the aggregation of formal academic s in journals, is probably about to enter its fastest period of change since 1994-6. The immediate platform for this change was the emergence of the World Wide Web as a popular medium in 1994 and the subsequent conversion of most established journals to electronic facsimiles delivered over the Web, a process which began to build momentum in 1996 (Hitchcock et al. 1996). While growth in the number of e-journals continues to accelerate towards an estimated 10 000 (Maclennan 1999, Hunter 1999), it is prior developments, such as the establishment of free electronic archives, or eprint archives, that are beginning to influence wider changes.

Launched in 1991, the importance of the Los Alamos physics eprint archive, the first and preeminent archive of its kind (Ginsparg 1994), cannot be underestimated, but its practical ramifications have so far been largely confined to its home community in physics. Many have questioned, because of cultural differences between different academic disciplines (Kling and McKim 1999), whether the eprint model will be accepted beyond physics. That contention will be challenged by the most significant new eprint archives to have emerged since 1991: PubMed Central, launched at the beginning of this year and sponsored by the National Institutes of Health, covering all fields in biomedical and life sciences (Varmus 1999); and the Computing Research Repository (CoRR), sponsored by the ACM and the British Computer Society (Halpern and Lagoze 1998). An immediate effect of PubMedCentral has been the announcement by some biomedical publishers, notably the British Medical Journal (Delamothe and Smith 1999) and the Current Science Group (Anon. 1999), of freely accessible archives of electronic copies of current and past s, both pre- and post-publication. As well as discipline-based archives, institutionally-based initiatives such as Scholars Forum are planned. (Buck et al. 1999)

The essential feature of the Los Alamos eprint archive model is author self-archiving based on a free-to-archive, free-to-access service. Compared with journals, eprint archives provide an almost wholly automated and highly efficient organisational framework and distribution mechanism based on the Internet, but without many of the additional services that journals provide, such as peer review, and other services that mostly require human intervention. As more archives attract more s the test for journals is how they respond to the effective loss of exclusivity that most depend on, and how they cope with the new economics of journal publishing in which every facet of traditional value-adding is re-evaluated against the for-free services.

There is another dimension too: globalisation. With no geographical or financial barriers the next inevitable step is, if not universalisation where archives all adopt the same technical infrastructure, then integration of the archives based on new services. In this agenda archives are not only freely accessible to users but are  open to independent, third-party computational processes - for an example, see the UPS prototype (Van de Sompel et al. 2017) - on which these services will be built. This approach has been formalised by the Open Archives initiative, a group that includes archive managers, potential service developers and academic librarians, in its Santa Fe Convention (Van de Sompel and Lagoze 2017).

The publishing industry anticipates that links on citations within scholarly s will be one of the primary new services driving integration between scholarly sources (Needleman et al. 1999). Linking services cannot be implemented piecemeal on such a scale. A number of journal publishers will collaborate through CrossRef  to use a Digital Object Identifier (Davidson and Douglas 1998) based system to form reference links between their, separately maintained, journal contents (Atkins et al. 2017). Most traditional journals service providers - aggregators, subscription agents and secondary publishers (e.g. Brunelle and Johnson 1999) - as well as new producers, highlight the important role of citation links in their services.

This considers the implications of the new wave of eprint archives and the development of open archives. It focusses on an Open Archives service being developed by the Open Citation (OpCit) project, funded by the joint NSF-JISC international digital libraries programme. The project will build citation links uniting large, high-profile and distributed archives but the service is planned to be extensible to other services that provide access to refereed scholarly s. 

Open archives: separating content from services
Eprint archives are noticeably entering a new phase. Not only have significant new archives launched, but new services are being developed to complement the traditional content management functions of the archives. The key feature is that many of these new services will be independent of the underlying archive contents, and there are good reasons for this. Growth in the number of archives and greater use create opportunities for third-party developers, and such developments thus do not add to the low-cost base of most archives and can be developed either commercially or non-commercially.

More importantly, as the use of archives crosses boundaries between disciplines, the role of new services will be to enable the user to view an integrated set, or selected subset, of all archives. User customisation features to set the scope of a service are likely to become increasingly common. For archives predicated on free and open access, capitalising on this feature means not constraining users within the boundaries of a single discipline or archive. Cross-disciplinary navigation support such as indexing, searching and linking should provide a consistent interface and seamless service regardless of which archives are accessed by the user, who need have no knowledge of the structure of an archive or, apart from perhaps noting the archive source and publisher identities, from where a viewed document originated.

Integration of free eprint archives through independent services is the preferred view of the Open Archives initiative of the way in which such archives may pave the way for a unified global scholarly literature, encompassing not just the archives but journals and other contributing literatures.

This level of service will be achieved by interoperability agreements between the archives, for example the protocols through which services can communicate with the archives, and metadata forms that will expose the data structures and the contents of archives. Methods chosen must encourage widespread adoption, but also ensure conformity of safe and trusted practices that do not compromise the integrity of the archives.

A well established protocol for data communication in digital library applications is Dienst, developed at Cornell University (Dienst 1999). An implementation of part of the Dienst protocol has been adopted by the Open Archives initiative for data harvesting. The OpCit project, in which the Dienst research group is one of the principal partners, is similarly building on this core technology for its linking applications.

Linking the open archives: the OpCit project
Linking is an apparently simple concept, especially the model implemented by the Web. A unidirectional point-and-click event presents the user with the page from the location pointed at by a locator, the URL, that is authored into the linking page. This simplified form of hypertext linking, it has long been argued, is inadequate to support robust services required for large-scale information environments, such as the contents of scholarly communications organised via digital libraries.

In terms of reference linking services, an early pioneer and predecessor of the OpCit project was the Open Journal project (Hitchcock et al. 1998a). The information environment envisaged in that project centred on published journal contents but was essentially unbounded, allowing links to take users to other types of materials, such as abstracting and indexing services, dictionaries and biological databases. In practice different implementations of Open Journals had to be bounded, principally to manage the user interface more effectively.

Originally supported by four publishers, by its end 12 publishers were involved. It is safe to assume that many more scholarly publishers now recognise the importance and power of links in electronic documents, especially links which implement the long-established, non-electronic form of linking inherent to the scholarly or scientific , the reference.

With this wider participation has come the recognition, as Caplan and Arms (1999) show, that reference linking may not be as simple as originally envisaged. There are a number of reasons for this. Expectations are high, of links from almost every reference in every electronically-accessible , both backwards in time as provided by a typical reference list in a , and forwards in time in a manner made familiar by the Science Citation Indexes (Hitchcock et al. 1998b). Constraining these expectations are the need fo accuracy and reliability of links, and availability of the necessary contents in electronic form (Hitchcock et al. 1998c). There may be multiple, but not identical, versions of the same document at multiple locations (Caplan and Flecker 1999). Finally, there are the financial and authorisation barriers imposed by commercial journals and services. Access requirements can differ for each user, for each access location, for each document and for each document location. Competing publishers may have become allies in cross-publisher reference linking, but although there are various possible solutions to the problem of linking across distributed collections, there is as yet no convincing demonstration or detail of how this might be achieved in an exclusively commercial environment (Atkins 1999).

In this context, linking is more than just a technical process but must be viewed as part of the social and business phenomena that are shaping the new information environments. This is recognised in the scope and partnerships that form the OpCit project. Ironically, in the first environments the project will explore, selected open archives, such is the accessibility of so much content that the project could almost reduce the problem to its technical issues, but the longer-term commitment is to collaborating with others in the wider environments discussed below.

网站原创范文除特殊说明外一切图文作品权归所有;未经官方授权谢绝任何用途转载或刊发于媒体。如发生侵犯作品权现象,英语论文题目,保留一切法学追诉权。()
更多范文欢迎访问我们主页 当然有需求可以和我们 联系交流。-X()英语论文范文
免费论文题目: