267 research outputs found

    Earth Science Data Analysis in the Era of Big Data

    Get PDF
    Anyone with even a cursory interest in information technology cannot help but recognize that "Big Data" is one of the most fashionable catchphrases of late. From accurate voice and facial recognition, language translation, and airfare prediction and comparison, to monitoring the real-time spread of flu, Big Data techniques have been applied to many seemingly intractable problems with spectacular successes. They appear to be a rewarding way to approach many currently unsolved problems. Few fields of research can claim a longer history with problems involving voluminous data than Earth science. The problems we are facing today with our Earth's future are more complex and carry potentially graver consequences than the examples given above. How has our climate changed? Beside natural variations, what is causing these changes? What are the processes involved and through what mechanisms are these connected? How will they impact life as we know it? In attempts to answer these questions, we have resorted to observations and numerical simulations with ever-finer resolutions, which continue to feed the "data deluge." Plausibly, many Earth scientists are wondering: How will Big Data technologies benefit Earth science research? As an example from the global water cycle, one subdomain among many in Earth science, how would these technologies accelerate the analysis of decades of global precipitation to ascertain the changes in its characteristics, to validate these changes in predictive climate models, and to infer the implications of these changes to ecosystems, economies, and public health? Earth science researchers need a viable way to harness the power of Big Data technologies to analyze large volumes and varieties of data with velocity and veracity. Beyond providing speedy data analysis capabilities, Big Data technologies can also play a crucial, albeit indirect, role in boosting scientific productivity by facilitating effective collaboration within an analysis environment. To illustrate the effects of combining a Big Data technology with an effective means of collaboration, we relate the (fictitious) experience of an early-career Earth science researcher a few years beyond the present, interlaced and contrasted with reminiscences of its recent past (i.e., the present)

    SpF: Enabling Petascale Performance for Pseudospectral Dynamo Models

    Get PDF
    Pseudospectral (PS) methods possess a number of characteristics (e.g., efficiency, accuracy, natural boundary conditions) that are extremely desirable for dynamo models. Unfortunately, dynamo models based upon PS methods face a number of daunting challenges, which include exposing additional parallelism, leveraging hardware accelerators, exploiting hybrid parallelism, and improving the scalability of global memory transposes. Although these issues are a concern for most models, solutions for PS methods tend to require far more pervasive changes to underlying data and control structures. Further, improvements in performance in one model are difficult to transfer to other models, resulting in significant duplication of effort across the research community.We have developed an extensible software framework for pseudospectral methods called SpF that is intended to enable extreme scalability and optimal performance. High-level abstractions provided by SpF unburden applications of the responsibility of managing domain decomposition and load balance while reducing the changes in code required to adapt to new computing architectures. The key design concept in SpF is that each phase of the numerical calculation is partitioned into disjoint numerical kernels that can be performed entirely in-processor. The granularity of domain-decomposition provided by SpF is only constrained by the data-locality requirements of these kernels. SpF builds on top of optimized vendor libraries for common numerical operations such as transforms, matrix solvers, etc., but can also be configured to use open source alternatives for portability. SpF includes several alternative schemes for global data redistribution and is expected to serve as an ideal testbed for further research into optimal approaches for different network architectures.In this presentation, we will describe the basic architecture of SpF as well as preliminary performance data and experience with adapting legacy dynamo codes. We will conclude with a discussion of planned extensions to SpF that will provide pseudospectral applications with additional flexibility with regard to time integration, linear solvers, and discretization in the radial direction

    PROcess Based Diagnostics PROBE

    Get PDF
    Many of the aspects of the climate system that are of the greatest interest (e.g., the sensitivity of the system to external forcings) are emergent properties that arise via the complex interplay between disparate processes. This is also true for climate models most diagnostics are not a function of an isolated portion of source code, but rather are affected by multiple components and procedures. Thus any model-observation mismatch is hard to attribute to any specific piece of code or imperfection in a specific model assumption. An alternative approach is to identify diagnostics that are more closely tied to specific processes -- implying that if a mismatch is found, it should be much easier to identify and address specific algorithmic choices that will improve the simulation. However, this approach requires looking at model output and observational data in a more sophisticated way than the more traditional production of monthly or annual mean quantities. The data must instead be filtered in time and space for examples of the specific process being targeted.We are developing a data analysis environment called PROcess-Based Explorer (PROBE) that seeks to enable efficient and systematic computation of process-based diagnostics on very large sets of data. In this environment, investigators can define arbitrarily complex filters and then seamlessly perform computations in parallel on the filtered output from their model. The same analysis can be performed on additional related data sets (e.g., reanalyses) thereby enabling routine comparisons between model and observational data. PROBE also incorporates workflow technology to automatically update computed diagnostics for subsequent executions of a model. In this presentation, we will discuss the design and current status of PROBE as well as share results from some preliminary use cases

    Exploring the Inner Edge of the Habitable Zone with Fully Coupled Oceans

    Get PDF
    The role of rotation in planetary atmospheres plays an important role in regulating atmospheric and oceanic heat flow, cloud formation and precipitation. Using the Goddard Institute for Space Studies (GISS) three dimension General Circulation Model (3D-GCM) we demonstrate how varying rotation rate and increasing the incident solar flux on a planet are related to each other and may allow the inner edge of the habitable zone to be much closer than many previous habitable zone studies have indicated. This is shown in particular for fully coupled ocean runs -- some of the first that have been utilized in this context. Results with a 100m mixed layer depth and our fully coupled ocean runs are compared with those of Yang et al. 2014, which demonstrates consistency across models. However, there are clear differences for rotations rates of 1-16x present earth day lengths between the mixed layer and fully couple ocean models, which points to the necessity of using fully coupled oceans whenever possible. The latter was recently demonstrated quite clearly by Hu & Yang 2014 in their aquaworld study with a fully coupled ocean when compared with similar mixed layer ocean studies and by Cullum et al. 2014. Atmospheric constituent amounts were also varied alongside adjustments to cloud parameterizations (results not shown here). While the latter have an effect on what a planet's global mean temperature is once the oceans reach equilibrium they do not qualitatively change the overall relationship between the globally averaged surface temperature and incident solar flux for rotation rates ranging from 1 to 256 times the present Earth day length. At the same time this study demonstrates that given the lack of knowledge about the atmospheric constituents and clouds on exoplanets there is still a large uncertainty as to where a planet will sit in a given star's habitable zone

    Mean flow instabilities of two-dimensional convection in strong magnetic fields

    Get PDF
    The interaction of magnetic fields with convection is of great importance in astrophysics. Two well-known aspects of the interaction are the tendency of convection cells to become narrow in the perpendicular direction when the imposed field is strong, and the occurrence of streaming instabilities involving horizontal shears. Previous studies have found that the latter instability mechanism operates only when the cells are narrow, and so we investigate the occurrence of the streaming instability for large imposed fields, when the cells are naturally narrow near onset. The basic cellular solution can be treated in the asymptotic limit as a nonlinear eigenvalue problem. In the limit of large imposed field, the instability occurs for asymptotically small Prandtl number. The determination of the stability boundary turns out to be surprisingly complicated. At leading order, the linear stability problem is the linearisation of the same nonlinear eigenvalue problem, and as a result, it is necessary to go to higher order to obtain a stability criterion. We establish that the flow can only be unstable to a horizontal mean flow if the Prandtl number is smaller than order , where B0 is the imposed magnetic field, and that the mean flow is concentrated in a horizontal jet of width in the middle of the layer. The result applies to stress-free or no-slip boundary conditions at the top and bottom of the layer

    GEOS-5 Chemistry Transport Model User's Guide

    Get PDF
    The Goddard Earth Observing System version 5 (GEOS-5) General Circulation Model (GCM) makes use of the Earth System Modeling Framework (ESMF) to enable model configurations with many functions. One of the options of the GEOS-5 GCM is the GEOS-5 Chemistry Transport Model (GEOS-5 CTM), which is an offline simulation of chemistry and constituent transport driven by a specified meteorology and other model output fields. This document describes the basic components of the GEOS-5 CTM, and is a user's guide on to how to obtain and run simulations on the NCCS Discover platform. In addition, we provide information on how to change the model configuration input files to meet users' needs

    Ontogeny tends to recapitulate phylogeny in digital organisms

    Get PDF
    JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. . abstract: Biologists have long debated whether ontogeny recapitulates phylogeny and, if so, why. Two plausible explanations are that (i) changes to early developmental stages are selected against because they tend to disrupt later development and (ii) simpler structures often precede more complex ones in both ontogeny and phylogeny if the former serve as building blocks for the latter. It is difficult to test these hypotheses experimentally in natural systems, so we used a computational system that exhibits evolutionary dynamics. We observed that ontogeny does indeed recapitulate phylogeny; traits that arose earlier in a lineage's history also tended to be expressed earlier in the development of individuals. The relative complexity of traits contributed substantially to this correlation, but a significant tendency toward recapitulation remained even after accounting for trait complexity. This additional effect provides evidence that selection against developmental disruption also contributed to the conservation of early stages in development
    corecore