Search CORE

1,421 research outputs found

Extracting Geospatial Entities from Wikipedia

Author: Jeremy Witmer
Jugal Kalita
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

This paper addresses the challenge of extracting geospa-tial data from the article text of the English Wikipedia. In the first phase of our work, we create a training corpus and select a set of word-based features to train a Support Vec-tor Machine (SVM) for the task of geospatial named entity recognition. We target for testing a corpus of Wikipedia articles about battles and wars, as these have a high in-cidence of geospatial content. The SVM recognizes place names in the corpus with a very high recall, close to 100%, with an acceptable precision. The set of geospatial NEs is then fed into a geocoding and resolution process, whose goal is to determine the correct coordinates for each place name. As many place names are ambiguous, and do not im-mediately geocode to a single location, we present a data structure and algorithm to resolve ambiguity based on sen-tence and article context, so the correct coordinates can be selected. We achieve an f-measure of 82%, and create a set of geospatial entities for each article, combining the place names, spatial locations, and an assumed point geometry. These entities can enable geospatial search on and geovi-sualization of Wikipedia.

CiteSeerX

Crossref

A Survey of Volunteered Open Geo-Knowledge Bases in the Semantic Web

Author: A. Ballatore
A. Buccella
A. Burton-Jones
A. Gangemi
A. Gore
A. Gómez-Pérez
A. Polleres
A. Schwering
A. Turner
B. Smith
C. Bizer
C. Jones
C. Keßler
C. Keßler
C. Manning
C.B. Jones
D. Buscaldi
D. Coleman
D. Nadeau
D. Strasunskas
D. Sui
F. Baader
F. Fonseca
F. Giunchiglia
F. Giunchiglia
F. Harvey
F.. Gey
F.J. Lopez-Pellicer
G. Bordogna
G. Fu
G. Tré De
G. Weikum
J. Giles
J. Goodwin
J. Howe
J. Leveling
K. Janowicz
K. Janowicz
L. Vaccari
L.L. Hill
M. Egenhofer
M. Goodchild
M. Goodchild
M. Grassi
M. Haklay
M. Haklay
M. Haklay
M. Kitsuregawa
M. Lutz
N. Choi
N. Guarino
N. Guarino
P. Burrough
P. Magnus
P. Roget
P. Singh
P.D. Smart
R. Fouad
R. Rada
S. Auer
S. Auer
S. Freitas
S. Hahmann
S. Overell
S. Schade
S. Staab
S. Vaid
S. Winter
T. Berners-Lee
T. Mandl
T. Mandl
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Over the past decade, rapid advances in web technologies, coupled with innovative models of spatial data collection and consumption, have generated a robust growth in geo-referenced information, resulting in spatial information overload. Increasing 'geographic intelligence' in traditional text-based information retrieval has become a prominent approach to respond to this issue and to fulfill users' spatial information needs. Numerous efforts in the Semantic Geospatial Web, Volunteered Geographic Information (VGI), and the Linking Open Data initiative have converged in a constellation of open knowledge bases, freely available online. In this article, we survey these open knowledge bases, focusing on their geospatial dimension. Particular attention is devoted to the crucial issue of the quality of geo-knowledge bases, as well as of crowdsourced data. A new knowledge base, the OpenStreetMap Semantic Network, is outlined as our contribution to this area. Research directions in information integration and Geographic Information Retrieval (GIR) are then reviewed, with a critical discussion of their current limitations and future prospects

arXiv.org e-Print Archive

Crossref

The DIGMAP geo-temporal web gazetteer service

Author: Borbinha José
Manguinhas H.
Martins Bruno
Siabato Vaca Willington Libardo
Publication venue: E.T.S.I. en Topografía, Geodesia y Cartografía (UPM)
Publication date: 01/01/2009
Field of study

This paper presents the DIGMAP geo-temporal Web gazetteer service, a system providing access to names of places, historical periods, and associated geo-temporal information. Within the DIGMAP project, this gazetteer serves as the unified repository of geographic and temporal information, assisting in the recognition and disambiguation of geo-temporal expressions over text, as well as in resource searching and indexing. We describe the data integration methodology, the handling of temporal information and some of the applications that use the gazetteer. Initial evaluation results show that the proposed system can adequately support several tasks related to geo-temporal information extraction and retrieval

Archivo Digital UPM

Neogeography: The Challenge of Channelling Large and Ill-Behaved Data Streams

Author: Habib Mena B.
Keulen Maurice van
Publication venue: Centre for Telematics and Information Technology, University of Twente
Publication date: 01/01/2011
Field of study

Neogeography is the combination of user generated data and experiences with mapping technologies. In this article we present a research project to extract valuable structured information with a geographic component from unstructured user generated text in wikis, forums, or SMSes. The extracted information should be integrated together to form a collective knowledge about certain domain. This structured information can be used further to help users from the same domain who want to get information using simple question answering system. The project intends to help workers communities in developing countries to share their knowledge, providing a simple and cheap way to contribute and get benefit using the available communication technology

Maastricht University Research Portal

University of Twente Research Information

GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding

Author: Chen Muhao
Chiang Yao-Yi
Li Zekun
Zhou Wenxuan
Publication venue
Publication date: 22/10/2023
Field of study

Humans subconsciously engage in geospatial reasoning when reading articles. We recognize place names and their spatial relations in text and mentally associate them with their physical locations on Earth. Although pretrained language models can mimic this cognitive process using linguistic context, they do not utilize valuable geospatial information in large, widely available geographical databases, e.g., OpenStreetMap. This paper introduces GeoLM, a geospatially grounded language model that enhances the understanding of geo-entities in natural language. GeoLM leverages geo-entity mentions as anchors to connect linguistic information in text corpora with geospatial information extracted from geographical databases. GeoLM connects the two types of context through contrastive learning and masked language modeling. It also incorporates a spatial coordinate embedding mechanism to encode distance and direction relations to capture geospatial context. In the experiment, we demonstrate that GeoLM exhibits promising capabilities in supporting toponym recognition, toponym linking, relation extraction, and geo-entity typing, which bridge the gap between natural language processing and geospatial sciences. The code is publicly available at https://github.com/knowledge-computing/geolm.Comment: Accepted to EMNLP23 mai

arXiv.org e-Print Archive

A geo-temporal information extraction service for processing descriptive metadata in digital libraries

Author: Borbinha José
Manguinhas H.
Martins Bruno
Siabato Vaca Willington Libardo
Publication venue: E.T.S.I. en Topografía, Geodesia y Cartografía (UPM)
Publication date: 01/01/2009
Field of study

In the context of digital map libraries, resources are usually described according to metadata records that define the relevant subject, location, time-span, format and keywords. On what concerns locations and time-spans, metadata records are often incomplete or they provide information in a way that is not machine-understandable (e.g. textual descriptions). This paper presents techniques for extracting geotemporal information from text, using relatively simple text mining methods that leverage on a Web gazetteer service. The idea is to go from human-made geotemporal referencing (i.e. using place and period names in textual expressions) into geo-spatial coordinates and time-spans. A prototype system, implementing the proposed methods, is described in detail. Experimental results demonstrate the efficiency and accuracy of the proposed approaches

Archivo Digital UPM

Geospatial Semantics

Author: Abadi
Abdalla
Abdalla
Adams
Adams
Adams
Adams
Agarwal
Agirre
Alameh
Alani
Allen
Amitay
Ashburner
Athanasis
Auer
Auer
Auer
Aumueller
Baeza-Yates
Baglioni
Ballatore
Ballatore
Battle
Battle
Bellini
Berners-Lee
Bishr
Bishr
Bishr
Bittner
Bizer
Bizer
Bizer
Blei
Bolstad
Bowers
Brauner
Brickley
Brodaric
Brosset
Buitelaar
Buitelaar
Bunescu
Burrough
Buscaldi
Buscaldi
Buscaldi
Carral
Chen
Clarke
Clarke
Cohn
Compton
Couclelis
Couclelis
Couclelis
Cresswell
Cresswell
Cruz
Cruz
Cucerzan
Davies
Di Donato
Droegemeier
Duckham
Egenhofer
Egenhofer
Egenhofer
Egenhofer
Erling
Ermilov
Euzenat
Fallahi
Feng
Fisher
Fonseca
Fonseca
Fonseca
Fonseca
Fonseca
Fox
Frank
Frank
Frank
Frank
Freksa
Freksa
Frontiera
Gangemi
Gangemi
Gangemi
Gao
Gao
Gates
Gelernter
Gelernter
Gelsey
Gey
Gibson
Golledge
Golledge
Goodchild
Goodchild
Goodchild
Goodchild
Goodwin
Grenon
Grothe
Gruber
Guarino
Guarino
Gutierrez
Gärdenfors
Hakimpour
Hart
Harvey
Hastings
Heath
Hess
Hill
Hitzler
Hollenstein
Hu
Hu
Hu
Hu
Hu
Hu
Hu
Hu
Intagorn
Janowicz
Janowicz
Janowicz
Janowicz
Janowicz
Janowicz
Janowicz
Jones
Jones
Jones
Jones
Jones
Jones
Ju
Kennedy
Keßler
Keßler
Keßler
Keßler
Kim
Klippel
Klippel
Krisnadhi
Krisnadhi
Kuhn
Kuhn
Kuhn
Lambrix
Lecun
Lehmann
Leidner
Leidner
Lemmens
Li
Li
Li
Li
Ligozat
Lin
Longley
Lutz
Lutz
Maceachren
Maedche
Mai
Mallenby
Manning
Manning
Mark
Mark
Martins
Mata
Mata-Rivera
McCurley
Mckenzie
Mckenzie
Mckenzie
Mendes
Meyer
Miller
Moncla
MONTELLO
Montello
Montello
Montello
Mostern
Navarrete
Nowak
Ouksel
Overell
Patroumpas
Perry
Perry
Prieto-Díaz
Pundt
Purves
Purves
Pérez
Randell
Raskin
Rattenbury
Renz
Rice
Rodríguez
Rodríguez
Rogers
Rosch
Rosch
Russell
Samal
Sanderson
Sankoff
Scheider
Schlieder
Schuurman
Schuurman
Sehgal
Sen
Shamsfard
Shankar
Shvaiko
Shyu
Silva
Sinha
Smith
Smith
Smith
Sorokine
Southall
Stevens
Stoeckl
Stokes
Sunna
Third
Tomai
Tomai
Tuan
Uryupina
Usery
Vasardani
Visser
Wallgrün
Wang
Wang
Wang
Wang
White
Wiegand
Winter
Winter
Worboys
Yang
Ye
Yue
Zhang
Zhao
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 10/08/2017
Field of study

Geospatial semantics is a broad field that involves a variety of research areas. The term semantics refers to the meaning of things, and is in contrast with the term syntactics. Accordingly, studies on geospatial semantics usually focus on understanding the meaning of geographic entities as well as their counterparts in the cognitive and digital world, such as cognitive geographic concepts and digital gazetteers. Geospatial semantics can also facilitate the design of geographic information systems (GIS) by enhancing the interoperability of distributed systems and developing more intelligent interfaces for user interactions. During the past years, a lot of research has been conducted, approaching geospatial semantics from different perspectives, using a variety of methods, and targeting different problems. Meanwhile, the arrival of big geo data, especially the large amount of unstructured text data on the Web, and the fast development of natural language processing methods enable new research directions in geospatial semantics. This chapter, therefore, provides a systematic review on the existing geospatial semantic research. Six major research areas are identified and discussed, including semantic interoperability, digital gazetteers, geographic information retrieval, geospatial Semantic Web, place semantics, and cognitive geographic concepts.Comment: Yingjie Hu (2017). Geospatial Semantics. In Bo Huang, Thomas J. Cova, and Ming-Hsiang Tsou et al. (Eds): Comprehensive Geographic Information Systems, Elsevier. Oxford, U

arXiv.org e-Print Archive

Crossref