49 research outputs found

    Retrieval of Melt Ponds on Arctic Multiyear Sea Ice in Summer from TerraSAR-X Dual-Polarization Data Using Machine Learning Approaches: A Case Study in the Chukchi Sea with Mid-Incidence Angle Data

    Get PDF
    Melt ponds, a common feature on Arctic sea ice, absorb most of the incoming solar radiation and have a large effect on the melting rate of sea ice, which significantly influences climate change. Therefore, it is very important to monitor melt ponds in order to better understand the sea ice-climate interaction. In this study, melt pond retrieval models were developed using the TerraSAR-X dual-polarization synthetic aperture radar (SAR) data with mid-incidence angle obtained in a summer multiyear sea ice area in the Chukchi Sea, the Western Arctic, based on two rule-based machine learning approachesdecision trees (DT) and random forest (RF)in order to derive melt pond statistics at high spatial resolution and to identify key polarimetric parameters for melt pond detection. Melt ponds, sea ice and open water were delineated from the airborne SAR images (0.3-m resolution), which were used as a reference dataset. A total of eight polarimetric parameters (HH and VV backscattering coefficients, co-polarization ratio, co-polarization phase difference, co-polarization correlation coefficient, alpha angle, entropy and anisotropy) were derived from the TerraSAR-X dual-polarization data and then used as input variables for the machine learning models. The DT and RF models could not effectively discriminate melt ponds from open water when using only the polarimetric parameters. This is because melt ponds showed similar polarimetric signatures to open water. The average and standard deviation of the polarimetric parameters based on a 15 x 15 pixel window were supplemented to the input variables in order to consider the difference between the spatial texture of melt ponds and open water. Both the DT and RF models using the polarimetric parameters and their texture features produced improved performance for the retrieval of melt ponds, and RF was superior to DT. The HH backscattering coefficient was identified as the variable contributing the most, and its spatial standard deviation was the next most contributing one to the classification of open water, sea ice and melt ponds in the RF model. The average of the co-polarization phase difference and the alpha angle in a mid-incidence angle were also identified as the important variables in the RF model. The melt pond fraction and sea ice concentration retrieved from the RF-derived melt pond map showed root mean square deviations of 2.4% and 4.9%, respectively, compared to those from the reference melt pond maps. This indicates that there is potential to accurately monitor melt ponds on multiyear sea ice in the summer season at a local scale using high-resolution dual-polarization SAR data.open

    Subcutaneously implantable electromagnetic biosensor system for continuous glucose monitoring

    Get PDF
    Continuous glucose monitoring systems (CGMS) are becoming increasingly popular in diabetes management compared to conventional methods of self-blood glucose monitoring systems. They help understanding physiological responses towards nutrition intake, physical activities in everyday life and glucose control. CGMS available in market are of two types based on their working principle. Needle type systems with few weeks lifespan (e.g., enzyme-based Freestyle Libre) and implant type system (e.g., fluorescence-based Senseonics) with few months of lifespan are commercially available. An alternate to both working methods, herein, we propose electromagnetic-based sensor that can be subcutaneously implanted and capable of tracking minute changes in dielectric permittivity owing to changes in blood glucose level (BGL). Proof-of-concept of proposed electromagnetic-based implant sensor has been validated in intravenous glucose tolerance test (IVGTT) conducted on swine and beagle in a controlled environment. Sensor interface modules, mobile applications, and glucose mapping algorithms are also developed for continuous measurement in a freely moving beagle during oral glucose tolerance test (OGTT). The results of the short-term (1 h, IVGTT) and long-term (52 h, OGTT) test are summarized in this work. A close trend is observed between sensor frequency and BGL during GTT experiments on both animal species

    ICESat-2 ????????? ????????? ????????? ????????? ?????? ????????? ??????

    Get PDF
    As the Arctic melt ponds play an important role in determining the interannual variation of the sea ice extent and changes in the Arctic environment, it is crucial to monitor the Arctic melt ponds with high accuracy. Ice, Cloud, and Land Elevation Satellite-2 (ICESat-2), which is the NASA's latest altimeter satellite based on the green laser (532 nm), observes the global surface elevation. When compared to the CryoSat-2 altimetry satellite whose along-track resolution is 250 m, ICESat-2 is highly expected to provide much more detailed information about Arctic melt ponds thanks to its high along-track resolution of 70 cm. The basic products of ICESat-2 are the surface height and the number of reflected photons. To aggregate the neighboring information of a specific ICESat-2 photon, the segments of photons with 10 m length were used. The standard deviation of the height and the total number of photons were calculated for each segment. As the melt ponds have the smoother surface than the sea ice, the lower variation of the height over melt ponds can make the melt ponds distinguished from the sea ice. When the melt ponds were extracted, the number of photons per segment was used to classify the melt ponds covered with open-water and specular ice. As photons are much more absorbed in the water-covered melt pondsthan the melt ponds with the specular ice, the number of photons persegment can distinguish the water- and ice-covered ponds. As a result, the suggested melt pond detection method was able to classify the sea ice, water-covered melt ponds, and ice-covered melt ponds. A qualitative analysis was conducted using the Sentinel-2 optical imagery. The suggested method successfully classified the water- and ice-covered ponds which were difficult to distinguish with Sentinel-2 optical images. Lastly, the pros and cons of the melt pond detection using satellite altimetry and optical images were discussed

    Identification of DNA-Methylated CpG Islands Associated With Gene Silencing in the Adult Body Tissues of the Ogye Chicken Using RNA-Seq and Reduced Representation Bisulfite Sequencing

    Get PDF
    DNA methylation is an epigenetic mark that plays an essential role in regulating gene expression. CpG islands are DNA methylations regions in promoters known to regulate gene expression through transcriptional silencing of the corresponding gene. DNA methylation at CpG islands is crucial for gene expression and tissue-specific processes. At the current time, a limited number of studies have reported on gene expression associated with DNA methylation in diverse adult tissues at the genome-wide level. Expression levels are rarely affected by DNA methylation in normal adult tissues; however, statistical differences in gene expression level correlated with DNA methylation have recently been revealed. In this study, we examined 20 pairs of DNA methylomes and transcriptomes from RNA-seq and reduced representation bisulfite sequencing (RRBS) data using adult Ogye chicken tissues. A total of 3,133 CpG islands were identified from 20 tissue data in a single chicken sample which could affect downstream genes. Analyzing these CpG island and gene pairs, 121 significant units were statistically correlated. Among them, six genes (CLDN3, DECR2, EVA1B, NME4, NTSR1, and XPNPEP2) were highly significantly changed by altered DNA methylation. Finally, our data demonstrated how DNA methylation correlated to gene expression in normal adult tissues. Our source codes can be found at https://github.com/wjlim/correlation-between-rna-seq-and-RRBS

    Reducing time to discovery : materials and molecular modeling, imaging, informatics, and integration

    Get PDF
    This work was supported by the KAIST-funded Global Singularity Research Program for 2019 and 2020. J.C.A. acknowledges support from the National Science Foundation under Grant TRIPODS + X:RES-1839234 and the Nano/Human Interfaces Presidential Initiative. S.V.K.’s effort was supported by the U.S. Department of Energy (DOE), Office of Science, Basic Energy Sciences (BES), Materials Sciences and Engineering Division and was performed at the Oak Ridge National Laboratory’s Center for Nanophase Materials Sciences (CNMS), a U.S. Department of Energy, Office of Science User Facility.Multiscale and multimodal imaging of material structures and properties provides solid ground on which materials theory and design can flourish. Recently, KAIST announced 10 flagship research fields, which include KAIST Materials Revolution: Materials and Molecular Modeling, Imaging, Informatics and Integration (M3I3). The M3I3 initiative aims to reduce the time for the discovery, design and development of materials based on elucidating multiscale processing-structure-property relationship and materials hierarchy, which are to be quantified and understood through a combination of machine learning and scientific insights. In this review, we begin by introducing recent progress on related initiatives around the globe, such as the Materials Genome Initiative (U.S.), Materials Informatics (U.S.), the Materials Project (U.S.), the Open Quantum Materials Database (U.S.), Materials Research by Information Integration Initiative (Japan), Novel Materials Discovery (E.U.), the NOMAD repository (E.U.), Materials Scientific Data Sharing Network (China), Vom Materials Zur Innovation (Germany), and Creative Materials Discovery (Korea), and discuss the role of multiscale materials and molecular imaging combined with machine learning in realizing the vision of M3I3. Specifically, microscopies using photons, electrons, and physical probes will be revisited with a focus on the multiscale structural hierarchy, as well as structure-property relationships. Additionally, data mining from the literature combined with machine learning will be shown to be more efficient in finding the future direction of materials structures with improved properties than the classical approach. Examples of materials for applications in energy and information will be reviewed and discussed. A case study on the development of a Ni-Co-Mn cathode materials illustrates M3I3's approach to creating libraries of multiscale structure-property-processing relationships. We end with a future outlook toward recent developments in the field of M3I3.Peer reviewe

    Bioinformatics services for analyzing massive genomic datasets

    Get PDF
    The explosive growth of next-generation sequencing data has resulted in ultra-large-scale datasets and ensuing computational problems. In Korea, the amount of genomic data has been increasing rapidly in the recent years. Leveraging these big data requires researchers to use large-scale computational resources and analysis pipelines. A promising solution for addressing this computational challenge is cloud computing, where CPUs, memory, storage, and programs are accessible in the form of virtual machines. Here, we present a cloud computing-based system, Bio-Express, that provides user-friendly, cost-effective analysis of massive genomic datasets. Bio-Express is loaded with predefined multi-omics data analysis pipelines, which are divided into genome, transcriptome, epigenome, and metagenome pipelines. Users can employ predefined pipelines or create a new pipeline for analyzing their own omics data. We also developed several web-based services for facilitating down-stream analysis of genome data. Bio-Express web service is freely available at https://www. bioexpress.re.kr/. ?? 2020, Korea Genome Organization

    Development of Red Tide Detection Algorithm using GOCI Image based on Random Forest

    No full text
    The socio-economic damages on the fishery and aquacultural industries caused by the red tide have been increased in Korea. The remote sensing techniques using the ocean color (OC) satellite imagery has been developed in order to observe the red tide. However, the Korean red tide information system (RTIS) is still relying on ship surveillance. It has limitations to cover the whole coastal area as well as take lots of cost and time. This study developed the random forest (RF) based red tide detection model using the Geostationary Ocean Color Imager (GOCI) satellite imagery which has a higher spatio-temporal resolution (i.e., 500 x 500m, hourly). The spectral characteristics, quantitative and qualitative analysis, and spatio-temporal analysis of red tides in the South Sea of Korea during July ??? August 2018 were examined. The RF model showed promising detection accuracy (R2 = 0.701) than the other three algorithms at high concentrations (over 1,000 cells/mL) quantitatively as well as qualitatively. (i.e., modified red tide index (MRI, R2 = 0.192), red-to-blue ratio (RBR, R2 = 0.683), and spectral shape (SS, R2 = 0.531)). The detection model can provide an accurate red tide alert map in near-realtime as well as contribute to reducing socio-economic damages from the red tides in Korea

    Ocean Fog Detection using Himawari-8 data over the Yellow sea with Machine Learning Approaches

    No full text
    Ocean fog (OF) is a phenomenon in which the visibility distance is less than 1km over the ocean due to the droplets. OF has a role not only as a moisture source, for the plants when it enters the land, also as an obstacle to maritime traffic. Many harbors set up fog detectors on the land to monitor OF occurrence near their port, but it covers a limited area. Recently, satellite remote sensing which covers wider area was usually applied on this criterion, but it is hard to identify the OF condition because of the complexity of generation condition and optical-thermal properties. Thus, in this study, machine learning approaches (e.g., random forest, support vector machine, logistic regression) were used to observe OF occurrence. As spatial coverage, temporal coverage is also important for maritime traffic, so the geostationary satellite (i.e., Himawari-8) data were used. The study area is the Yellow sea, which is suffering from OFs frequently. The Cloud-Aerosol Lidar with Orthogonal Polarization data were used to get OF location as follows Wu et. al. (2015). For the temporal seamless monitoring, infrared channels 0of Himawari-8 were used as the input images. From the input images, not only thermal feature such as mean, maximum and minimum, but also spatio-temporal feature such as roughness of buffer area, temporal anomaly. Additional post processing was applied to check the reliability of each OF pixels

    SEXCMD: Development and validation of sex marker sequences for whole-exome/genome and RNA sequencing.

    No full text
    Over the last decade, a large number of nucleotide sequences have been generated by next-generation sequencing technologies and deposited to public databases. However, most of these datasets do not specify the sex of individuals sampled because researchers typically ignore or hide this information. Male and female genomes in many species have distinctive sex chromosomes, XX/XY and ZW/ZZ, and expression levels of many sex-related genes differ between the sexes. Herein, we describe how to develop sex marker sequences from syntenic regions of sex chromosomes and use them to quickly identify the sex of individuals being analyzed. Array-based technologies routinely use either known sex markers or the B-allele frequency of X or Z chromosomes to deduce the sex of an individual. The same strategy has been used with whole-exome/genome sequence data; however, all reads must be aligned onto a reference genome to determine the B-allele frequency of the X or Z chromosomes. SEXCMD is a pipeline that can extract sex marker sequences from reference sex chromosomes and rapidly identify the sex of individuals from whole-exome/genome and RNA sequencing after training with a known dataset through a simple machine learning approach. The pipeline counts total numbers of hits from sex-specific marker sequences and identifies the sex of the individuals sampled based on the fact that XX/ZZ samples do not have Y or W chromosome hits. We have successfully validated our pipeline with mammalian (Homo sapiens; XY) and avian (Gallus gallus; ZW) genomes. Typical calculation time when applying SEXCMD to human whole-exome or RNA sequencing datasets is a few minutes, and analyzing human whole-genome datasets takes about 10 minutes. Another important application of SEXCMD is as a quality control measure to avoid mixing samples before bioinformatics analysis. SEXCMD comprises simple Python and R scripts and is freely available at https://github.com/lovemun/SEXCMD
    corecore