5,004 research outputs found
The spatial component of R&D networks
We study the role of geography in R&D networks by means of a quantitative,
micro-geographic approach. Using a large database that covers international R&D
collaborations from 1984 to 2009, we localize each actor precisely in space
through its latitude and longitude. This allows us to analyze the R&D network
at all geographic scales simultaneously. Our empirical results show that
despite the high importance of the city level, transnational R&D collaborations
at large distances are much more frequent than expected from similar networks.
This provides evidence for the ambiguity of distance in economic cooperation
which is also suggested by the existing literature. In addition we test whether
the hypothesis of local buzz and global pipelines applies to the observed R&D
network by calculating well-defined metrics from network theory.Comment: Working paper, 22 pages, 7 figure
Spatial Big Data Analytics: Classification Techniques for Earth Observation Imagery
University of Minnesota Ph.D. dissertation. August 2016. Major: Computer Science. Advisor: Shashi Shekhar. 1 computer file (PDF); xi, 120 pages.Spatial Big Data (SBD), e.g., earth observation imagery, GPS trajectories, temporally detailed road networks, etc., refers to geo-referenced data whose volume, velocity, and variety exceed the capability of current spatial computing platforms. SBD has the potential to transform our society. Vehicle GPS trajectories together with engine measurement data provide a new way to recommend environmentally friendly routes. Satellite and airborne earth observation imagery plays a crucial role in hurricane tracking, crop yield prediction, and global water management. The potential value of earth observation data is so significant that the White House recently declared that full utilization of this data is one of the nation's highest priorities. However, SBD poses significant challenges to current big data analytics. In addition to its huge dataset size (NASA collects petabytes of earth images every year), SBD exhibits four unique properties related to the nature of spatial data that must be accounted for in any data analysis. First, SBD exhibits spatial autocorrelation effects. In other words, we cannot assume that nearby samples are statistically independent. Current analytics techniques that ignore spatial autocorrelation often perform poorly such as low prediction accuracy and salt-and-pepper noise (i.e., pixels predicted as different from neighbors by mistake). Second, spatial interactions are not isotropic and vary across directions. Third, spatial dependency exists in multiple spatial scales. Finally, spatial big data exhibits heterogeneity, i.e., identical feature values may correspond to distinct class labels in different regions. Thus, learned predictive models may perform poorly in many local regions. My thesis investigates novel SBD analytics techniques to address some of these challenges. To date, I have been mostly focusing on the challenges of spatial autocorrelation and anisotropy via developing novel spatial classification models such as spatial decision trees for raster SBD (e.g., earth observation imagery). To scale up the proposed models, I developed efficient learning algorithms via computational pruning. The proposed techniques have been applied to real world remote sensing imagery for wetland mapping. I also had developed spatial ensemble learning framework to address the challenge of spatial heterogeneity, particularly the class ambiguity issues in geographical classification, i.e., samples with the same feature values belong to different classes in different spatial zones. Evaluations on three real world remote sensing datasets confirmed that proposed spatial ensemble learning outperforms current approaches such as bagging, boosting, and mixture of experts when class ambiguity exists
Deep Learning for Spatiotemporal Big Data: A Vision on Opportunities and Challenges
With advancements in GPS, remote sensing, and computational simulation, an
enormous volume of spatiotemporal data is being collected at an increasing
speed from various application domains, spanning Earth sciences, agriculture,
smart cities, and public safety. Such emerging geospatial and spatiotemporal
big data, coupled with recent advances in deep learning technologies, foster
new opportunities to solve problems that have not been possible before. For
instance, remote sensing researchers can potentially train a foundation model
using Earth imagery big data for numerous land cover and land use modeling
tasks. Coastal modelers can train AI surrogates to speed up numerical
simulations. However, the distinctive characteristics of spatiotemporal big
data pose new challenges for deep learning technologies. This vision paper
introduces various types of spatiotemporal big data, discusses new research
opportunities in the realm of deep learning applied to spatiotemporal big data,
lists the unique challenges, and identifies several future research needs
Viewpoints on emergent semantics
Authors include:Philippe Cudr´e-Mauroux, and Karl Aberer (editors),
Alia I. Abdelmoty, Tiziana Catarci, Ernesto Damiani,
Arantxa Illaramendi, Robert Meersman,
Erich J. Neuhold, Christine Parent, Kai-Uwe Sattler,
Monica Scannapieco, Stefano Spaccapietra,
Peter Spyns, and Guy De Tr´eWe introduce a novel view on how to deal with the problems of semantic interoperability in distributed systems. This view is based on the concept of emergent semantics, which sees both the representation of semantics and the discovery of the proper interpretation of symbols as the result of a self-organizing process performed by distributed agents exchanging symbols and having utilities dependent on the proper interpretation of the symbols. This is a complex systems perspective on the problem of dealing with semantics. We highlight some of the distinctive features of our vision and point out preliminary examples of its applicatio
A Survey of Location Prediction on Twitter
Locations, e.g., countries, states, cities, and point-of-interests, are
central to news, emergency events, and people's daily lives. Automatic
identification of locations associated with or mentioned in documents has been
explored for decades. As one of the most popular online social network
platforms, Twitter has attracted a large number of users who send millions of
tweets on daily basis. Due to the world-wide coverage of its users and
real-time freshness of tweets, location prediction on Twitter has gained
significant attention in recent years. Research efforts are spent on dealing
with new challenges and opportunities brought by the noisy, short, and
context-rich nature of tweets. In this survey, we aim at offering an overall
picture of location prediction on Twitter. Specifically, we concentrate on the
prediction of user home locations, tweet locations, and mentioned locations. We
first define the three tasks and review the evaluation metrics. By summarizing
Twitter network, tweet content, and tweet context as potential inputs, we then
structurally highlight how the problems depend on these inputs. Each dependency
is illustrated by a comprehensive review of the corresponding strategies
adopted in state-of-the-art approaches. In addition, we also briefly review two
related problems, i.e., semantic location prediction and point-of-interest
recommendation. Finally, we list future research directions.Comment: Accepted to TKDE. 30 pages, 1 figur
- …