14 research outputs found

    Use of Schema on Read in Earth Science Data Archives

    Get PDF
    Traditionally, NASA Earth Science data archives have file-based storage using proprietary data file formats, such as HDF and HDF-EOS, which are optimized to support fast and efficient storage of spaceborne and model data as they are generated. The use of file-based storage essentially imposes an indexing strategy based on data dimensions. In most cases, NASA Earth Science data uses time as the primary index, leading to poor performance in accessing data in spatial dimensions. For example, producing a time series for a single spatial grid cell involves accessing a large number of data files. With exponential growth in data volume due to the ever-increasing spatial and temporal resolution of the data, using file-based archives poses significant performance and cost barriers to data discovery and access. Storing and disseminating data in proprietary data formats imposes an additional access barrier for users outside the mainstream research community. At the NASA Goddard Earth Sciences Data Information Services Center (GES DISC), we have evaluated applying the schema-on-read principle to data access and distribution. We used Apache Parquet to store geospatial data, and have exposed data through Amazon Web Services (AWS) Athena, AWS Simple Storage Service (S3), and Apache Spark. Using the schema-on-read approach allows customization of indexing spatially or temporally to suit the data access pattern. The storage of data in open formats such as Apache Parquet has widespread support in popular programming languages. A wide range of solutions for handling big data lowers the access barrier for all users. This presentation will discuss formats used for data storage, frameworks with This presentation will discuss formats used for data storage, frameworks with support for schema-on-read used for data access, and common use cases covering data usage patterns seen in a geospatial data archive

    Ring-Opening Copolymerization of Epoxides with Cyclic Anhydrides Promoted by Bimetallic and Monometallic Phenoxyâ\u80\u93Imine Aluminum complexes

    No full text
    Two dinuclear aluminum alkyl complexes bearing a salen ligand with alkyl backbone of different length between the nitrogen atoms (1,3-propylene and 1,5-pentylene backbone) and the related hemisalen aluminum complex were tested as catalysts in the ring-opening copolymerization of cyclohexene oxide and limonene oxide with succinic anhydride and phthalic anhydride. The effects of different reaction conditions (co-catalyst and solvent) on the productivity and selectivity of the reactions were evaluated. The comparison of the catalytic behavior of the dinuclear complexes bearing the aluminum reactive centers at different distances as well as of their mononuclear counterpart suggested that the copolymerization follows a monometallic pathway differently from what was observed in the homopolymerization of CHO
    corecore