Search CORE

9,905 research outputs found

A modular methodology for converting large, complex books into usable, accessible and standards-compliant ebooks

Author: Dawson A.
McCulloch E.
Publication venue: Arts and Humanities Data Service
Publication date: 01/04/2006
Field of study

This report describes the methodology used for ebook creation for the Glasgow Digital Library (GDL), and provides detailed instructions on how the same methodology could be used elsewhere. The document includes a description and explanation of the processes for ebook creation followed by a tutorial

University of Strathclyde Institutional Repository

Managing and Sharing Data; a best practice guide for researchers

Author: Bishop Libby
Corti Louise
Horton Laurence
Van den Eynden Veerle
Woollard Matthew
Publication venue: UK Data Archive
Publication date: 01/01/2011
Field of study

University of Essex Research Repository

PDF-Malware Detection: A Survey and Taxonomy of Current Techniques

Author: Aniello L.
Baldoni R.
Elingiusti M.
Querzoni L.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Portable Document Format, more commonly known as PDF, has become, in the last 20 years, a standard for document exchange and dissemination due its portable nature and widespread adoption. The flexibility and power of this format are not only leveraged by benign users, but from hackers as well who have been working to exploit various types of vulnerabilities, overcome security restrictions, and then transform the PDF format in one among the leading malicious code spread vectors. Analyzing the content of malicious PDF files to extract the main features that characterize the malware identity and behavior, is a fundamental task for modern threat intelligence platforms that need to learn how to automatically identify new attacks. This paper surveys existing state of the art about systems for the detection of malicious PDF files and organizes them in a taxonomy that separately considers the used approaches and the data analyzed to detect the presence of malicious code. © Springer International Publishing AG, part of Springer Nature 2018

Crossref

Southampton (e-Prints Soton)

Archivio della ricerca- Università di Roma La Sapienza

Data DNA: The Next Generation of Statistical Metadata

Author: Cynthia M. Taeuber
Daniel W. Gillman
Laura Smith
Publication venue: 'Brookings Institution Press'
Publication date: 03/03/2007
Field of study

Describes the components of a complete statistical metadata system and suggests ways to create and structure metadata for better access and understanding of data sets by diverse users

IssueLab

Survey-based naming conventions for use in OBO Foundry ontology development

Author: Chris F.
Daniel Schober
E. Kusnierczyk
Lomax Waclaw
Mungall Jane
Philippe Rocca-Serra
Smith Barry
Susanna-Assunta Sansone
Suzanna Lewis
Taylor Chris
Publication venue
Publication date: 01/01/2009
Field of study

A wide variety of ontologies relevant to the biological and medical domains are available through the OBO Foundry portal, and their number is growing rapidly. Integration of these ontologies, while requiring considerable effort, is extremely desirable. However, heterogeneities in format and style pose serious obstacles to such integration. In particular, inconsistencies in naming conventions can impair the readability and navigability of ontology class hierarchies, and hinder their alignment and integration. While other sources of diversity are tremendously complex and challenging, agreeing a set of common naming conventions is an achievable goal, particularly if those conventions are based on lessons drawn from pooled practical experience and surveys of community opinion. We summarize a review of existing naming conventions and highlight certain disadvantages with respect to general applicability in the biological domain. We also present the results of a survey carried out to establish which naming conventions are currently employed by OBO Foundry ontologies and to determine what their special requirements regarding the naming of entities might be. Lastly, we propose an initial set of typographic, syntactic and semantic conventions for labelling classes in OBO Foundry ontologies. Adherence to common naming conventions is more than just a matter of aesthetics. Such conventions provide guidance to ontology creators, help developers avoid flaws and inaccuracies when editing, and especially when interlinking, ontologies. Common naming conventions will also assist consumers of ontologies to more readily understand what meanings were intended by the authors of ontologies used in annotating bodies of data

PhilPapers

Motivations and Experiences of UK Students Studying Abroad:Statistical Sources - Summary Metadata Report

Author: Findlay A.
Geddes A.
Smith F.
Publication venue: Department for Business Innovation and Skills
Publication date: 01/01/2010
Field of study

University of Dundee Online Publications

An integrated approach to preparing, publishing, presenting and preserving theses

Author: Sefton Peter
Publication venue
Publication date: 01/01/2007
Field of study

[Abstract]: This paper describes progress on a project funded by the Australian government to create Free software; the Integrated Content Environment for research and scholarship (ICE-RS). ICE-RS is a multi-faceted project which will add value to finished theses by making them available in both HTML and PDF, as well as providing a mechanism for packaging multimedia theses. The project will also concentrate on providing services for thesis production, with version control, automated backup and collaboration services. The paper begins with the established content management system that is the basis for the project, ICE-RS , originally developed to create courseware packages. ICE includes distributed, version controlled collaboration, using word processing software and works on multiple platforms, with standard document formats. We survey other approaches to content authoring and publishing for ETDs. We showcase exploratory work on integration of the thesis writing process with Institutional Repository software including publishing theses in both PDF and HTML with preservation and descriptive metadata. The presentation will include demonstrations of thesis production at all stages of development from proposal to completion. In a more speculative vein, we will discuss opportunities for institutions to provide new levels of support for candidates via automated thesis “dashboard” progress reports, supervisor and examiner annotation and comment and support for copyright considerations as early as possible in the process

CiteSeerX

University of Southern Queensland ePrints

Creating and customizing digital library collections with the Greenstone Librarian Interface

Author: Witten Ian H.
Publication venue: 'Institute of Mathematics, University of Tsukuba'
Publication date: 01/01/2004
Field of study

The Greenstone digital library software is a comprehensive system for building and distributing digital library collections. It provides a new way of organizing information and publishing it on the Internet. This paper describes how digital library collections can be created and customized with the new Greenstone Librarian Interface. Its basic features allow users to add documents and metadata to collections, create new collections whose structure mirrors existing ones, and build collections and put them in place so for users to view. More advanced users can design and customize new collection structures. At the most advanced level, the Librarian Interface gives expert users interactive access to the full power of Greenstone, which could formerly be tapped only by running Perl scripts manually

Research Commons@Waikato