12,904 research outputs found
A modular methodology for converting large, complex books into usable, accessible and standards-compliant ebooks
This report describes the methodology used for ebook creation for the Glasgow Digital Library (GDL), and provides detailed instructions on how the same methodology could be used elsewhere. The document includes a description and explanation of the processes for ebook creation followed by a tutorial
The Value of User-Visible Internet Cryptography
Cryptographic mechanisms are used in a wide range of applications, including
email clients, web browsers, document and asset management systems, where
typical users are not cryptography experts. A number of empirical studies have
demonstrated that explicit, user-visible cryptographic mechanisms are not
widely used by non-expert users, and as a result arguments have been made that
cryptographic mechanisms need to be better hidden or embedded in end-user
processes and tools. Other mechanisms, such as HTTPS, have cryptography
built-in and only become visible to the user when a dialogue appears due to a
(potential) problem. This paper surveys deployed and potential technologies in
use, examines the social and legal context of broad classes of users, and from
there, assesses the value and issues for those users
Integrating the common variability language with multilanguage annotations for web engineering
Web applications development involves managing a high diversity of files and resources like code, pages or style sheets, implemented in different languages. To deal with the automatic generation of
custom-made configurations of web applications, industry usually adopts annotation-based approaches even though the majority of studies encourage the use of composition-based approaches to implement
Software Product Lines. Recent work tries to combine both approaches to get the complementary benefits. However, technological companies are reticent to adopt new development paradigms
such as feature-oriented programming or aspect-oriented programming.
Moreover, it is extremely difficult, or even impossible, to apply
these programming models to web applications, mainly because of
their multilingual nature, since their development involves multiple
types of source code (Java, Groovy, JavaScript), templates (HTML,
Markdown, XML), style sheet files (CSS and its variants, such as
SCSS), and other files (JSON, YML, shell scripts). We propose to
use the Common Variability Language as a composition-based approach
and integrate annotations to manage fine grained variability
of a Software Product Line for web applications. In this paper, we (i)
show that existing composition and annotation-based approaches,
including some well-known combinations, are not appropriate to
model and implement the variability of web applications; and (ii)
present a combined approach that effectively integrates annotations
into a composition-based approach for web applications. We implement
our approach and show its applicability with an industrial
real-world system.Universidad de MĂĄlaga. Campus de Excelencia Internacional AndalucĂa Tech
Scenarios and research issues for a network of information
This paper describes ideas and items of work within the
framework of the EU-funded 4WARD project. We present
scenarios where the current host-centric approach to infor-
mation storage and retrieval is ill-suited for and explain
how a new networking paradigm emerges, by adopting the
information-centric network architecture approach, which
we call Network of Information (NetInf). NetInf capital-
izes on a proposed identifier/locator split and allows users
to create, distribute, and retrieve information using a com-
mon infrastructure without tying data to particular hosts.
NetInf introduces the concepts of information and data ob-
jects. Data objects correspond to the particular bits and
bytes of a digital object, such as text file, a specific encod-
ing of a song or a video. Information objects can be used
to identify other objects irrespective of their particular dig-
ital representation. After discussing the benefits of such an
indirection, we consider the impact of NetInf with respect
to naming and governance in the Future Internet. Finally,
we provide an outlook on the research scope of NetInf along
with items for future work
Self-supervised automated wrapper generation for weblog data extraction
Data extraction from the web is notoriously hard. Of the types of resources available on the web, weblogs are becoming increasingly important due to the continued growth of the blogosphere, but remain poorly explored. Past approaches to data extraction from weblogs have often involved manual intervention and suffer from low scalability. This paper proposes a fully automated information extraction methodology based on the use of web feeds and processing of HTML. The approach includes a model for generating a wrapper that exploits web feeds for deriving a set of extraction rules automatically. Instead of performing a pairwise comparison between posts, the model matches the values of the web feeds against their corresponding HTML elements retrieved from multiple weblog posts. It adopts a probabilistic approach for deriving a set of rules and automating the process of wrapper generation. An evaluation of the model is conducted on a dataset of 2,393 posts and the results (92% accuracy) show that the proposed technique enables robust extraction of weblog properties and can be applied across the blogosphere for applications such as improved information retrieval and more robust web preservation initiatives
The SIMBAD astronomical database
Simbad is the reference database for identification and bibliography of
astronomical objects. It contains identifications, `basic data', bibliography,
and selected observational measurements for several million astronomical
objects. Simbad is developed and maintained by CDS, Strasbourg. Building the
database contents is achieved with the help of several contributing institutes.
Scanning the bibliography is the result of the collaboration of CDS with
bibliographers in Observatoire de Paris (DASGAL), Institut d'Astrophysique de
Paris, and Observatoire de Bordeaux. When selecting catalogues and tables for
inclusion, priority is given to optimal multi-wavelength coverage of the
database, and to support of research developments linked to large projects. In
parallel, the systematic scanning of the bibliography reflects the diversity
and general trends of astronomical research.
A WWW interface to Simbad is available at: http://simbad.u-strasbg.fr/SimbadComment: 14 pages, 5 Postscript figures; to be published in A&A
- âŠ