Search CORE

3,293 research outputs found

Organizing the Internet

Author: Desmarais Norman
Publication venue: DigitalCommons@Providence
Publication date: 10/01/1999
Field of study

This paper examines XML and its relationships with SGML (Standardized General Markup Language) and HTML (HyperText Markup Language). It examines the importance of metatags and the XML Document Type Definition (DTD) and proposed alternatives. It looks at the differences between the two types of XML data: “valid” and “well-formed” documents

DigitalCommons@Providence

HELIN Digital Commons

GATE -- an Environment to Support Research and Development in Natural Language Engineering

Author: Cunningham Hamish
Gaizauskas Robert
Humphreys Kevin
Rodgers Peter
Wilks Yorick
Publication venue: IEEE Computer Society
Publication date: 01/01/1996
Field of study

We describe a software environment to support research and development in natural language (NL) engineering. This environment -- GATE (General Architecture for Text Engineering) -- aims to advance research in the area of machine processing of natural languages by providing a software infrastructure on top of which heterogeneous NL component modules may be evaluated and refined individually or may be combined into larger application systems. Thus, GATE aims to support both researchers and developers working on component technologies (e.g. parsing, tagging, morphological analysis) and those working on developing end-user applications (e.g. information extraction, text summarisation, document generation, machine translation, and second language learning). GATE will promote reuse of component technology, permit specialisation and collaboration in large-scale projects, and allow for the comparison and evaluation of alternative technologies. The first release of GATE is now available

CiteSeerX

Kent Academic Repository

Multiple hierarchies : new aspects of an old solution

Author: Witt Andreas
Publication venue
Publication date: 01/01/2004
Field of study

In this paper, we present the Multiple Annotation approach, which solves two problems: the problem of annotating overlapping structures, and the problem that occurs when documents should be annotated according to different, possibly heterogeneous tag sets. This approach has many advantages: it is based on XML, the modeling of alternative annotations is possible, each level can be viewed separately, and new levels can be added at any time. The files can be regarded as an interrelated unit, with the text serving as the implicit link. Two representations of the information contained in the multiple files (one in Prolog and one in XML) are described. These representations serve as a base for several applications

CiteSeerX

Hochschulschriftenserver - Universität Frankfurt am Main

EDI - XML Standards and Technologies in the Agri-Food Industry

Author: Füzesi István
Herdon Miklós
Publication venue: Magyar Agrárinformatikai Szövetség
Publication date: 01/01/2008
Field of study

Due to globalisation, the new technological developments and the complexity of food supply processes, the European food sector is increasingly becoming more complex. The consumers’ trust in food, triggered and affected by a number of food crises, is low. Today, consumers increasingly expect safe and high quality food and demand information about the origin of their food. Also, the economic health of the food industry can be greatly affected by food crises; therefore, efficient and effective mechanisms are required to assist the food industry in tracking and tracing products along the food chain. In this paper, we discuss the criteria for an efficient and effective traceability system from an IT perspective (mainly data exchange) and we identify key requirements for ICT enabled traceability

Repository of the Academy's Library

Development of Use Cases, Part I

Author: Bolzer Oliver
Bry François
Furche Tim
Kraus Sebastian
Schaffert Sebastian
Publication venue
Publication date: 06/03/2004
Field of study

For determining requirements and constructs appropriate for a Web query language, or in fact any language, use cases are of essence. The W3C has published two sets of use cases for XML and RDF query languages. In this article, solutions for these use cases are presented using Xcerpt. a novel Web and Semantic Web query language that combines access to standard Web data such as XML documents with access to Semantic Web metadata such as RDF resource descriptions with reasoning abilities and rules familiar from logicprogramming. To the best knowledge of the authors, this is the first in depth study of how to solve use cases for accessing XML and RDF in a single language: Integrated access to data and metadata has been recognized by industry and academia as one of the key challenges in data processing for the next decade. This article is a contribution towards addressing this challenge by demonstrating along practical and recognized use cases the usefulness of reasoning abilities, rules, and semistructured query languages for accessing both data (XML) and metadata (RDF)

Open Access LMU

Special Libraries, Spring 1995

Author: Special Libraries Association
Publication venue: SJSU ScholarWorks
Publication date: 01/04/1995
Field of study

Volume 86, Issue 2https://scholarworks.sjsu.edu/sla_sl_1995/1001/thumbnail.jp

SJSU ScholarWorks

Recent development in XML-IR

Author: Alwan RF
Lu J
Rashid BT
Yip YJ
Publication venue: 'University of Huddersfield Press'
Publication date: 01/11/2008
Field of study

The Web is characterized by a huge amount of heterogeneous data sources, which have different media support and format representation. Because XML can represent files of different formats, it can play an important role in IR since it is becoming a standard form for data representation and exchange over the Web. Under this assumption, the problem of querying heterogeneous sources can be reduced to the problem of querying XML data sources. This paper shows the influence of XML on the IR techniques and methodologies during the last five years through serving over 400 papers published in different conferences and journals

University of Salford Institutional Repository

University of Huddersfield Repository

Enhanced Reality Fieldwork: the Context-aware Archaeological Assistant

Author: Morse David R.
Pascoe Jason
Ryan Nick S.
Publication venue: Tempus Reparatum
Publication date: 01/10/1998
Field of study

Kent Academic Repository