191 research outputs found
End-to-end eScience: integrating workflow, query, visualization, and provenance at an ocean observatory
Journal ArticleData analysis tasks at an Ocean Observatory require integrative and and domain-specialized use of database, workflow, visualization systems. We describe a platform to support these tasks developed as part of the cyberinfrastructure at the NSF Science and Technology Center for Coastal Margin Observation and Prediction integrating a provenance-aware workflow system, 3D visualization, and a remote query engine for large-scale ocean circulation models. We show how these disparate tools complement each other and give examples of real scientific insights delivered by the integrated system. We conclude that data management solutions for eScience require this kind of holistic, integrative approach, explain how our approach may be generalized, and recommend a broader, application-oriented research agenda to explore relevant architectures
Urban Spatiotemporal Data Synthesis via Neural Disaggregation
The level of granularity of open data often conflicts the benefits it can
provide. Less granular data can protect individual privacy, but to certain
degrees, sabotage the promise of open data to promote transparency and assist
research. Similar in the urban setting, aggregated urban data at high-level
geographic units can mask out the underline particularities of city dynamics
that may vary at lower areal levels. In this work, we aim to synthesize
fine-grained, high resolution urban data, by breaking down aggregated urban
data at coarse, low resolution geographic units. The goal is to increase the
usability and realize the values as much as possible of highly aggregated urban
data. To address the issue of simplicity of some traditional disaggregation
methods -- 1) we experimented with numerous neural-based models that are
capable of modeling intricate non-linear relationships among features. Neural
methods can also leverage both spatial and temporal information concurrently.
We showed that all neural methods perform better than traditional
disaggregation methods. Incorporating the temporal information further enhances
the results. 2) We proposed a training approach for disaggregation task,
Chain-of-Training (COT), that can be incorporated into any of the
training-based models. COT adds transitional disaggregation steps by
incorporating intermediate geographic dimensions, which enhances the
predictions at low geographic level and boosts the results at higher levels. 3)
We adapted the idea of reconstruction (REC) from super-resolution domain in our
disaggregation case -- after disaggregating from low to high geographic level,
we then re-aggregate back to the low level from our generated high level
values. Both strategies improved disaggregation results on three datasets and
two cities we tested on
- …