Search CORE

1,452 research outputs found

Geospatial Semantics

Author: Abadi
Abdalla
Abdalla
Adams
Adams
Adams
Adams
Agarwal
Agirre
Alameh
Alani
Allen
Amitay
Ashburner
Athanasis
Auer
Auer
Auer
Aumueller
Baeza-Yates
Baglioni
Ballatore
Ballatore
Battle
Battle
Bellini
Berners-Lee
Bishr
Bishr
Bishr
Bittner
Bizer
Bizer
Bizer
Blei
Bolstad
Bowers
Brauner
Brickley
Brodaric
Brosset
Buitelaar
Buitelaar
Bunescu
Burrough
Buscaldi
Buscaldi
Buscaldi
Carral
Chen
Clarke
Clarke
Cohn
Compton
Couclelis
Couclelis
Couclelis
Cresswell
Cresswell
Cruz
Cruz
Cucerzan
Davies
Di Donato
Droegemeier
Duckham
Egenhofer
Egenhofer
Egenhofer
Egenhofer
Erling
Ermilov
Euzenat
Fallahi
Feng
Fisher
Fonseca
Fonseca
Fonseca
Fonseca
Fonseca
Fox
Frank
Frank
Frank
Frank
Freksa
Freksa
Frontiera
Gangemi
Gangemi
Gangemi
Gao
Gao
Gates
Gelernter
Gelernter
Gelsey
Gey
Gibson
Golledge
Golledge
Goodchild
Goodchild
Goodchild
Goodchild
Goodwin
Grenon
Grothe
Gruber
Guarino
Guarino
Gutierrez
Gärdenfors
Hakimpour
Hart
Harvey
Hastings
Heath
Hess
Hill
Hitzler
Hollenstein
Hu
Hu
Hu
Hu
Hu
Hu
Hu
Hu
Intagorn
Janowicz
Janowicz
Janowicz
Janowicz
Janowicz
Janowicz
Janowicz
Jones
Jones
Jones
Jones
Jones
Jones
Ju
Kennedy
Keßler
Keßler
Keßler
Keßler
Kim
Klippel
Klippel
Krisnadhi
Krisnadhi
Kuhn
Kuhn
Kuhn
Lambrix
Lecun
Lehmann
Leidner
Leidner
Lemmens
Li
Li
Li
Li
Ligozat
Lin
Longley
Lutz
Lutz
Maceachren
Maedche
Mai
Mallenby
Manning
Manning
Mark
Mark
Martins
Mata
Mata-Rivera
McCurley
Mckenzie
Mckenzie
Mckenzie
Mendes
Meyer
Miller
Moncla
MONTELLO
Montello
Montello
Montello
Mostern
Navarrete
Nowak
Ouksel
Overell
Patroumpas
Perry
Perry
Prieto-Díaz
Pundt
Purves
Purves
Pérez
Randell
Raskin
Rattenbury
Renz
Rice
Rodríguez
Rodríguez
Rogers
Rosch
Rosch
Russell
Samal
Sanderson
Sankoff
Scheider
Schlieder
Schuurman
Schuurman
Sehgal
Sen
Shamsfard
Shankar
Shvaiko
Shyu
Silva
Sinha
Smith
Smith
Smith
Sorokine
Southall
Stevens
Stoeckl
Stokes
Sunna
Third
Tomai
Tomai
Tuan
Uryupina
Usery
Vasardani
Visser
Wallgrün
Wang
Wang
Wang
Wang
White
Wiegand
Winter
Winter
Worboys
Yang
Ye
Yue
Zhang
Zhao
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 10/08/2017
Field of study

Geospatial semantics is a broad field that involves a variety of research areas. The term semantics refers to the meaning of things, and is in contrast with the term syntactics. Accordingly, studies on geospatial semantics usually focus on understanding the meaning of geographic entities as well as their counterparts in the cognitive and digital world, such as cognitive geographic concepts and digital gazetteers. Geospatial semantics can also facilitate the design of geographic information systems (GIS) by enhancing the interoperability of distributed systems and developing more intelligent interfaces for user interactions. During the past years, a lot of research has been conducted, approaching geospatial semantics from different perspectives, using a variety of methods, and targeting different problems. Meanwhile, the arrival of big geo data, especially the large amount of unstructured text data on the Web, and the fast development of natural language processing methods enable new research directions in geospatial semantics. This chapter, therefore, provides a systematic review on the existing geospatial semantic research. Six major research areas are identified and discussed, including semantic interoperability, digital gazetteers, geographic information retrieval, geospatial Semantic Web, place semantics, and cognitive geographic concepts.Comment: Yingjie Hu (2017). Geospatial Semantics. In Bo Huang, Thomas J. Cova, and Ming-Hsiang Tsou et al. (Eds): Comprehensive Geographic Information Systems, Elsevier. Oxford, U

arXiv.org e-Print Archive

Crossref

Knowledge Rich Natural Language Queries over Structured Biological Databases

Author: Chu W. W.
Goldsmith E. J.
InterProlog
Kossmann D.
Lawrence C.
Maio C. D.
Mir S.
Mou X.
Nandi A.
Novik L.
Safran M.
Swofford D. L.
Publication venue
Publication date: 30/03/2017
Field of study

Increasingly, keyword, natural language and NoSQL queries are being used for information retrieval from traditional as well as non-traditional databases such as web, document, image, GIS, legal, and health databases. While their popularity are undeniable for obvious reasons, their engineering is far from simple. In most part, semantics and intent preserving mapping of a well understood natural language query expressed over a structured database schema to a structured query language is still a difficult task, and research to tame the complexity is intense. In this paper, we propose a multi-level knowledge-based middleware to facilitate such mappings that separate the conceptual level from the physical level. We augment these multi-level abstractions with a concept reasoner and a query strategy engine to dynamically link arbitrary natural language querying to well defined structured queries. We demonstrate the feasibility of our approach by presenting a Datalog based prototype system, called BioSmart, that can compute responses to arbitrary natural language queries over arbitrary databases once a syntactic classification of the natural language query is made

arXiv.org e-Print Archive

Crossref

SeqHound: biological sequence and structure database as a platform for bioinformatics research

Author: Bader Gary D
Betel Doron
Dumontier Michel
Hogue Christopher WV
Isserlin Ruth
Lieu Hao
Michalickova Katerina
Publication venue: BioMed Central
Publication date: 01/01/2002
Field of study

BACKGROUND: SeqHound has been developed as an integrated biological sequence, taxonomy, annotation and 3-D structure database system. It provides a high-performance server platform for bioinformatics research in a locally-hosted environment. RESULTS: SeqHound is based on the National Center for Biotechnology Information data model and programming tools. It offers daily updated contents of all Entrez sequence databases in addition to 3-D structural data and information about sequence redundancies, sequence neighbours, taxonomy, complete genomes, functional annotation including Gene Ontology terms and literature links to PubMed. SeqHound is accessible via a web server through a Perl, C or C++ remote API or an optimized local API. It provides functionality necessary to retrieve specialized subsets of sequences, structures and structural domains. Sequences may be retrieved in FASTA, GenBank, ASN.1 and XML formats. Structures are available in ASN.1, XML and PDB formats. Emphasis has been placed on complete genomes, taxonomy, domain and functional annotation as well as 3-D structural functionality in the API, while fielded text indexing functionality remains under development. SeqHound also offers a streamlined WWW interface for simple web-user queries. CONCLUSIONS: The system has proven useful in several published bioinformatics projects such as the BIND database and offers a cost-effective infrastructure for research. SeqHound will continue to develop and be provided as a service of the Blueprint Initiative at the Samuel Lunenfeld Research Institute. The source code and examples are available under the terms of the GNU public license at the Sourceforge site http://sourceforge.net/projects/slritools/ in the SLRI Toolkit

University of Toronto Research Repository

Maastricht University Research Portal

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Detection of the inferred interaction network in hepatocellular carcinoma from EHCO (Encyclopedia of Hepatocellular Carcinoma genes Online)

Author: Chang Chao-Hui
Chen Chang-Han
Hsu Chun-Nan
Hsu Wen-Lian
Huang Chi-Ying F
Kao Cheng-Yan
Lai Jin-Mei
Lee DT
Lee Gen-Cher
Lee Sheng-An
Lin Chih-Yun
Lin Kuan-Ting
Liu Chia-Hung
Shiue Yow-Ling
Su Li-Jen
Sung Ting-Yi
Tseng Huei-Hun
Yeh Chang-Wei
Yeh Hsu-Hua
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: The significant advances in microarray and proteomics analyses have resulted in an exponential increase in potential new targets and have promised to shed light on the identification of disease markers and cellular pathways. We aim to collect and decipher the HCC-related genes at the systems level. RESULTS: Here, we build an integrative platform, the Encyclopedia of Hepatocellular Carcinoma genes Online, dubbed EHCO , to systematically collect, organize and compare the pileup of unsorted HCC-related studies by using natural language processing and softbots. Among the eight gene set collections, ranging across PubMed, SAGE, microarray, and proteomics data, there are 2,906 genes in total; however, more than 77% genes are only included once, suggesting that tremendous efforts need to be exerted to characterize the relationship between HCC and these genes. Of these HCC inventories, protein binding represents the largest proportion (~25%) from Gene Ontology analysis. In fact, many differentially expressed gene sets in EHCO could form interaction networks (e.g. HBV-associated HCC network) by using available human protein-protein interaction datasets. To further highlight the potential new targets in the inferred network from EHCO, we combine comparative genomics and interactomics approaches to analyze 120 evolutionary conserved and overexpressed genes in HCC. 47 out of 120 queries can form a highly interactive network with 18 queries serving as hubs. CONCLUSION: This architectural map may represent the first step toward the attempt to decipher the hepatocarcinogenesis at the systems level. Targeting hubs and/or disruption of the network formation might reveal novel strategy for HCC treatment

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

National Chung Hsing University Institutional Repository

National Health Research Institues

PubMed Central

National Taiwan University Repository

Recommended from our members

Grid-based semantic integration of heterogeneous data resources: Implementation on a HealthGrid

Author: Naseer Aisha
Publication venue: Brunel University, School of Information Systems, Computing and Mathematics
Publication date: 01/01/2007
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and was awarded by Brunel University.The semantic integration of geographically distributed and heterogeneous data resources still remains a key challenge in Grid infrastructures. Today's mainstream Grid technologies hold the promise to meet this challenge in a systematic manner, making data applications more scalable and manageable. The thesis conducts a thorough investigation of the problem, the state of the art, and the related technologies, and proposes an Architecture for Semantic Integration of Data Sources (ASIDS) addressing the semantic heterogeneity issue. It defines a simple mechanism for the interoperability of heterogeneous data sources in order to extract or discover information regardless of their different semantics. The constituent technologies of this architecture include Globus Toolkit (GT4) and OGSA-DAI (Open Grid Service Architecture Data Integration and Access) alongside other web services technologies such as XML (Extensive Markup Language). To show this, the ASIDS architecture was implemented and tested in a realistic setting by building an exemplar application prototype on a HealthGrid (pilot implementation). The study followed an empirical research methodology and was informed by extensive literature surveys and a critical analysis of the relevant technologies and their synergies. The two literature reviews, together with the analysis of the technology background, have provided a good overview of the current Grid and HealthGrid landscape, produced some valuable taxonomies, explored new paths by integrating technologies, and more importantly illuminated the problem and guided the research process towards a promising solution. Yet the primary contribution of this research is an approach that uses contemporary Grid technologies for integrating heterogeneous data resources that have semantically different. data fields (attributes). It has been practically demonstrated (using a prototype HealthGrid) that discovery in semantically integrated distributed data sources can be feasible by using mainstream Grid technologies, which have been shown to have some Significant advantages over non-Grid based approaches

Brunel University Research Archive

GeMInA, Genomic Metadata for Infectious Agents, a geospatial surveillance pathogen database

Author: Angiuoli Sam
Arze Cesar
Felix Victor
Ganapathy Anu
Ghedin Elodie
Gussman Aaron
Hall Neil
Mahurkar Anup
Nadendla Suvarna
Phillippy Katherine
Schriml Lynn M.
White Owen
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

The Gemina system (http://gemina.igs.umaryland.edu) identifies, standardizes and integrates the outbreak metadata for the breadth of NIAID category A–C viral and bacterial pathogens, thereby providing an investigative and surveillance tool describing the Who [Host], What [Disease, Symptom], When [Date], Where [Location] and How [Pathogen, Environmental Source, Reservoir, Transmission Method] for each pathogen. The Gemina database will provide a greater understanding of the interactions of viral and bacterial pathogens with their hosts and infectious diseases through in-depth literature text-mining, integrated outbreak metadata, outbreak surveillance tools, extensive ontology development, metadata curation and representative genomic sequence identification and standards development. The Gemina web interface provides metadata selection and retrieval of a pathogen's; Infection Systems (Pathogen, Host, Disease, Transmission Method and Anatomy) and Incidents (Location and Date) along with a hosts Age and Gender. The Gemina system provides an integrated investigative and geospatial surveillance system connecting pathogens, pathogen products and disease anchored on the taxonomic ID of the pathogen and host to identify the breadth of hosts and diseases known for these pathogens, to identify the extent of outbreak locations, and to identify unique genomic regions with the DNA Signature Insignia Detection Tool

CiteSeerX

PubMed Central

FragKB: Structural and Literature Annotation Resource of Conserved Peptide Fragments and Residues

Author: Alfonso Valencia
Ashish V. Tendulkar
Fabio Rapallo
Gonzalo López
Martin Krallinger
Pramod P. Wangikar
Victor de la Torre
Publication venue: Public Library of Science
Publication date: 18/03/2010
Field of study

BACKGROUND: FragKB (Fragment Knowledgebase) is a repository of clusters of structurally similar fragments from proteins. Fragments are annotated with information at the level of sequence, structure and function, integrating biological descriptions derived from multiple existing resources and text mining. METHODOLOGY: FragKB contains approximately 400,000 conserved fragments from 4,800 representative proteins from PDB. Literature annotations are extracted from more than 1,700 articles and are available for over 12,000 fragments. The underlying systematic annotation workflow of FragKB ensures efficient update and maintenance of this database. The information in FragKB can be accessed through a web interface that facilitates sequence and structural visualization of fragments together with known literature information on the consequences of specific residue mutations and functional annotations of proteins and fragment clusters. FragKB is accessible online at http://ubio.bioinfo.cnio.es/biotools/fragkb/. SIGNIFICANCE: The information presented in FragKB can be used for modeling protein structures, for designing novel proteins and for functional characterization of related fragments. The current release is focused on functional characterization of proteins through inspection of conservation of the fragments

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Data mining and fusion

Author: Addis M. J.
Choi F.
Taylor S. J.
Upstill C.
Watkins E. R.
Publication venue: s.n.
Publication date: 01/04/2006
Field of study

Southampton (e-Prints Soton)

Processing SPARQL queries with regular expressions in RDF databases

Author: Han WS
Hune cho
Jeong-Hoon Lee
Lee J
Lee J
Minh-Duc Pham
YU HWANJO
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Background: As the Resource Description Framework (RDF) data model is widely used for modeling and sharing a lot of online bioinformatics resources such as Uniprot (dev.isb-sib.ch/projects/uniprot-rdf) or Bio2RDF (bio2rdf.org), SPARQL - a W3C recommendation query for RDF databases - has become an important query language for querying the bioinformatics knowledge bases. Moreover, due to the diversity of users' requests for extracting information from the RDF data as well as the lack of users' knowledge about the exact value of each fact in the RDF databases, it is desirable to use the SPARQL query with regular expression patterns for querying the RDF data. To the best of our knowledge, there is currently no work that efficiently supports regular expression processing in SPARQL over RDF databases. Most of the existing techniques for processing regular expressions are designed for querying a text corpus, or only for supporting the matching over the paths in an RDF graph. Results: In this paper, we propose a novel framework for supporting regular expression processing in SPARQL query. Our contributions can be summarized as follows. 1) We propose an efficient framework for processing SPARQL queries with regular expression patterns in RDF databases. 2) We propose a cost model in order to adapt the proposed framework in the existing query optimizers. 3) We build a prototype for the proposed framework in C++ and conduct extensive experiments demonstrating the efficiency and effectiveness of our technique. Conclusions: Experiments with a full-blown RDF engine show that our framework outperforms the existing ones by up to two orders of magnitude in processing SPARQL queries with regular expression patterns.X113sciescopu

포항공과대학교

Temporal Representation in Semantic Graphs

Author
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date
Field of study

Crossref