Search CORE

3,696 research outputs found

Multimodal integration of disparate information sources with attribution

Author
Publication venue: Sloan School of Management, Massachusetts Institute of Technology
Publication date: 01/01/1997
Field of study

Cover title.Includes bibliographical references (p. [9]-[10]).Thomas Y. Lee & Stephane Bressan

DSpace@MIT

Survey over Existing Query and Transformation Languages

Author: Bolzer Oliver
Bry François
Furche Tim
Horrocks Ian
Kraus Michael
Orsini Renzo
Schaffert Sebastian
Publication venue
Publication date: 01/01/2004
Field of study

A widely acknowledged obstacle for realizing the vision of the Semantic Web is the inability of many current Semantic Web approaches to cope with data available in such diverging representation formalisms as XML, RDF, or Topic Maps. A common query language is the first step to allow transparent access to data in any of these formats. To further the understanding of the requirements and approaches proposed for query languages in the conventional as well as the Semantic Web, this report surveys a large number of query languages for accessing XML, RDF, or Topic Maps. This is the first systematic survey to consider query languages from all these areas. From the detailed survey of these query languages, a common classification scheme is derived that is useful for understanding and differentiating languages within and among all three areas

CiteSeerX

Open Access LMU

Linked Data - the story so far

Author: Berners-Lee Tim
Bizer Christian
Heath Tom
Publication venue: 'IGI Global'
Publication date: 01/01/2009
Field of study

The term “Linked Data” refers to a set of best practices for publishing and connecting structured data on the Web. These best practices have been adopted by an increasing number of data providers over the last three years, leading to the creation of a global data space containing billions of assertions— the Web of Data. In this article, the authors present the concept and technical principles of Linked Data, and situate these within the broader context of related technological developments. They describe progress to date in publishing Linked Data on the Web, review applications that have been developed to exploit the Web of Data, and map out a research agenda for the Linked Data community as it moves forward

Southampton (e-Prints Soton)

MAnnheim DOCument Server

Benchmarking database systems for Genomic Selection implementation

Author: Guignon Valentin
Jones Elizabeth
Larmande Pierre
Matthews Dave
Nti-Addae Yaw
Petel Adrien
Renner Jon
Robbins Kelly
Sempere Guilhem
Syed Raza
Ulat Victor Jun
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

Motivation: With high-throughput genotyping systems now available, it has become feasible to fully integrate genotyping information into breeding programs. To make use of this information effectively requires DNA extraction facilities and marker production facilities that can efficiently deploy the desired set of markers across samples with a rapid turnaround time that allows for selection before crosses needed to be made. In reality, breeders often have a short window of time to make decisions by the time they are able to collect all their phenotyping data and receive corresponding genotyping data. This presents a challenge to organize information and utilize it in downstream analyses to support decisions made by breeders. In order to implement genomic selection routinely as part of breeding programs, one would need an efficient genotyping data storage system. We selected and benchmarked six popular open-source data storage systems, including relational database management and columnar storage systems. Results: We found that data extract times are greatly influenced by the orientation in which genotype data is stored in a system. HDF5 consistently performed best, in part because it can more efficiently work with both orientations of the allele matrix

Agritrop

CGSpace

Horizon / Pleins textes

The DIGMAP geo-temporal web gazetteer service

Author: Borbinha José
Manguinhas H.
Martins Bruno
Siabato Vaca Willington Libardo
Publication venue: E.T.S.I. en Topografía, Geodesia y Cartografía (UPM)
Publication date: 01/01/2009
Field of study

This paper presents the DIGMAP geo-temporal Web gazetteer service, a system providing access to names of places, historical periods, and associated geo-temporal information. Within the DIGMAP project, this gazetteer serves as the unified repository of geographic and temporal information, assisting in the recognition and disambiguation of geo-temporal expressions over text, as well as in resource searching and indexing. We describe the data integration methodology, the handling of temporal information and some of the applications that use the gazetteer. Initial evaluation results show that the proposed system can adequately support several tasks related to geo-temporal information extraction and retrieval

Archivo Digital UPM

Improving Annotation Process and Increase the Perfmance of Tag Data

Author: A. Harikrishna
Mr. K. Bhaskarnaik
Publication venue: Global Journals Inc. (US)
Publication date: 21/02/2015
Field of study

Now a days so many organization create and share the textual description of their products or service and action etc. it is contains for most amount collection of structured data and which is remains worried about unstructured the information, if data extraction structural relation by using algorithms facilitating, they are more cost and inaccurate information. When is working top of text, it does not is contains structural information. An anther approach to the generating of the structure of metadata by the identifying that documents, that is likely to contain information of interest. That data are going to be valuable for questioning information based used. These approaches based on the idea that humans are more likely to add the necessary metadata during generate the time. This process based on the collaborative adaptive data sharing platform[CADS] approach to query workload by up to 50 percent only visibility of document. So further probing algorithm with Bayesian approach technique was included, that can be improve the efficient of visibility of document or data with respect the query and content workload based on the more than 50 percent improve

Global Journal of Computer Science and Technology (GJCST)

Modeling information systems from the viewpoint of active documents

Author: András Benczúr
Bálint Molnár
Publication venue: Springer Nature
Publication date: 01/01/2015
Field of study

Crossref

Springer - Publisher Connector

ELTE Digital Institutional Repository (EDIT)

The advent of a new lexicographical portuguese project

Author: Almeida Bruno
Carvalho Sara
Costa Rute
Khan Anas
Khemakhem Mohamed
Ramos Margarida
Romary Laurent
Salgado Ana de Castro
Silva Raquel
Tasovac Toma
Publication venue
Publication date: 01/07/2021
Field of study

UID/LIN/03213/2013MORDigital is a newly funded Portuguese lexicographic project that aims to produce high-quality and searchable digital versions of the first three editions (1789; 1813; 1823) of the Diccionario da Lingua Portugueza by António de Morais Silva, preserving and making accessible this important work of European heritage. This paper will describe the current state of the art, the project, its objectives and the methodology proposed, the latter of which is based on a rigorous linguistic analysis and will also include steps necessary for the ontologisation of knowledge contained in and relating to the text. A section will be dedicated to the various investigation domains of the project description. The output of the project will be made available via a dedicated platform.publishersversionpublishe

Repositório da Universidade Nova de Lisboa