132 research outputs found

    Ensemble learning with dynamic weighting for response modeling in direct marketing

    Get PDF
    Response modeling, a key to successful direct marketing, has become increasingly prevalent in recent years. However, it practically suffers from the difficulty of class imbalance, i.e., the number of responding (target) customers is often much smaller than that of the non-responding customers. This issue would result in a response model that is biased to the majority class, leading to the low prediction accuracy on the responding customers. In this study, we develop an Ensemble Learning with Dynamic Weighting (ELDW) approach to address the above problem. The proposed ELDW includes two stages. In the first stage, all the minority class instances are combined with different majority class instances to form a number of training subsets, and a base classifiers is trained in each subset. In the second stage, the results of the base classifiers are dynamically integrated, in which two factors are considered. The first factor is the cross entropy of neighbors in each subset, and the second factor is the feature similarity to the minority class instances. In order to evaluate the performance of ELDW, we conduct experimental studies on 10 imbalanced benchmark datasets. The results show that compared with other state-of-the-art imbalance classification algorithms, ELDW achieves higher accuracy on the minority class. Last, we apply the ELDW to a direct marketing activity of an insurance company to identify the target customers under a limited budget

    Regression-based surface water fraction mapping using a synthetic spectral library for monitoring small water bodies

    Get PDF
    Small water bodies (SWBs), such as ponds and on-farm reservoirs, are a key part of the hydrological system and play important roles in diverse domains from agriculture to conservation. The monitoring of SWBs has been greatly facilitated by medium-spatial-resolution satellite images, but the monitoring accuracy is considerably affected by the mixed-pixel problem. Although various spectral unmixing methods have been applied to map sub-pixel surface water fractions for large water bodies, such as lakes and reservoirs, it is challenging to map SWBs that are small in size relative to the image pixel and have dissimilar spectral properties. In this study, a novel regression-based surface water fraction mapping method (RSWFM) using a random forest and a synthetic spectral library is proposed for mapping 10 m spatial resolution surface water fractions from Sentinel-2 imagery. The RSWFM inputs a few endmembers of water, vegetation, impervious surfaces, and soil to simulate a spectral library, and considers spectral variations in endmembers for different SWBs. Additionally, RSWFM applies noise-based data augmentation on pure endmembers to overcome the limitation often arising from the use of a small set of pure spectra in training the regression model. RSWFM was assessed in ten study sites and compared with the fully constrained least squares (FCLS) linear spectral mixture analysis, multiple endmember spectral mixture analysis (MESMA), and the nonlinear random forest (RF) regression without data-augmentation. The results showed that RSWFM decreases the water fraction mapping errors by ~ 30%, ~15%, and ~ 11% in root mean square error compared with the linear FCLS, MESMA unmixings, and the nonlinear RF regression without data-augmentation respectively. RSWFM has an accuracy of approximately 0.85 in R2 in estimating the area of SWBs smaller than 1 ha

    Interictal EEG features as computational biomarkers of West syndrome

    Get PDF
    BackgroundWest syndrome (WS) is a devastating epileptic encephalopathy with onset in infancy and early childhood. It is characterized by clustered epileptic spasms, developmental arrest, and interictal hypsarrhythmia on electroencephalogram (EEG). Hypsarrhythmia is considered the hallmark of WS, but its visual assessment is challenging due to its wide variability and lack of a quantifiable definition. This study aims to analyze the EEG patterns in WS and identify computational diagnostic biomarkers of the disease.MethodLinear and non-linear features derived from EEG recordings of 31 WS patients and 20 age-matched controls were compared. Subsequently, the correlation of the identified features with structural and genetic abnormalities was investigated.ResultsWS patients showed significantly elevated alpha-band activity (0.2516 vs. 0.1914, p < 0.001) and decreased delta-band activity (0.5117 vs. 0.5479, p < 0.001), particularly in the occipital region, as well as globally strengthened theta-band activity (0.2145 vs. 0.1655, p < 0.001) in power spectrum analysis. Moreover, wavelet-bicoherence analysis revealed significantly attenuated cross-frequency coupling in WS patients. Additionally, bi-channel coherence analysis indicated minor connectivity alterations in WS patients. Among the four non-linear characteristics of the EEG data (i.e., approximate entropy, sample entropy, permutation entropy, and wavelet entropy), permutation entropy showed the most prominent global reduction in the EEG of WS patients compared to controls (1.4411 vs. 1.5544, p < 0.001). Multivariate regression results suggested that genetic etiologies could influence the EEG profiles of WS, whereas structural factors could not.SignificanceA combined global strengthening of theta activity and global reduction of permutation entropy can serve as computational EEG biomarkers for WS. Implementing these biomarkers in clinical practice may expedite diagnosis and treatment in WS, thereby improving long-term outcomes

    Unmixing-based Spatiotemporal Image Fusion Based on the Self-trained Random Forest Regression and Residual Compensation

    Get PDF
    Spatiotemporal satellite image fusion (STIF) has been widely applied in land surface monitoring to generate high spatial and high temporal reflectance images from satellite sensors. This paper proposed a new unmixing-based spatiotemporal fusion method that is composed of a self-trained random forest machine learning regression (R), low resolution (LR) endmember estimation (E), high resolution (HR) surface reflectance image reconstruction (R), and residual compensation (C), that is, RERC. RERC uses a self-trained random forest to train and predict the relationship between spectra and the corresponding class fractions. This process is flexible without any ancillary training dataset, and does not possess the limitations of linear spectral unmixing, which requires the number of endmembers to be no more than the number of spectral bands. The running time of the random forest regression is about ~1% of the running time of the linear mixture model. In addition, RERC adopts a spectral reflectance residual compensation approach to refine the fused image to make full use of the information from the LR image. RERC was assessed in the fusion of a prediction time MODIS with a Landsat image using two benchmark datasets, and was assessed in fusing images with different numbers of spectral bands by fusing a known time Landsat image (seven bands used) with a known time very-high-resolution PlanetScope image (four spectral bands). RERC was assessed in the fusion of MODIS-Landsat imagery in large areas at the national scale for the Republic of Ireland and France. The code is available at https://www.researchgate.net/proiile/Xiao_Li52

    Deep Feature and Domain Knowledge Fusion Network for Mapping Surface Water Bodies by Fusing Google Earth RGB and Sentinel-2 images

    Get PDF
    Mapping surface water bodies from fine spatial resolution optical remote sensing imagery is essential for the understanding of the global hydrologic cycle. Although satellite data are useful for mapping, the limited spectral information captured by some satellite systems can be sub-optimal for the task. For example, the very high resolution images of Google Earth (GE) only contain RGB bands, which often means many water bodies and land objects are confused. Sentinel-2 (S2) imagery have a spectral resolution more suitable for mapping water bodies, but its medium spatial resolution limits the ability for detailed mapping of water-land boundaries. This letter proposes a deep feature and domain knowledge fusion network (DFDKFNet) for mapping surface water bodies by fusing GE and S2 images while incorporating domain knowledge. DFDKFNet uses the remote sensing indices of normalized difference water index (NDWI) and normalized difference vegetation index (NDVI) derived from the S2 image as the representative domain knowledge to better extract water bodies from terrestrial features. A similar pixel-based approach is used to downscaling the NDWI and NDVI maps to match the spatial resolution between the GE and S2 images. The DFDKFNet uses the GE and downscaled NDWI and NDVI images to extract the deep semantic features of water bodies, which are fused with the domain knowledge extracted from the NDWI and NDVI images. DFDKFNet was compared with several state-of-the-art algorithms, and the results show that DFDKFNet can enhance water body mapping accuracy

    Dynamic Transcriptome Analysis Reveals Potential Long Non-coding RNAs Governing Postnatal Pineal Development in Pig

    Get PDF
    Postnatal development and maturation of pineal gland is a highly dynamic period of tissue remodeling and phenotype maintenance, which is genetically controlled by programmed gene expression regulations. However, limited molecular characterization, particularly regarding long noncoding RNAs (lncRNA), is available for postnatal pineal at a whole transcriptome level. The present study first characterized the comprehensive pineal transcriptome profiles using strand-specific RNA-seq to illustrate the dynamic mRNA/lncRNA expression at three developmental stages (infancy, puberty, and adulthood). The results showed that 21,448 mRNAs and 8,166 novel lncRNAs were expressed in pig postnatal pineal gland. Among these genes, 3,573 mRNAs and 851 lncRNAs, including the 5-hydroxytryptamine receptors, exhibited significant dynamic regulation along maturation process, while the expression of homeobox genes didn’t show significant differences. Gene Ontology analysis revealed that the differentially expressed genes (DEGs) were significantly enriched in ion transport and synaptic transmission, highlighting the critical role of calcium signaling in postnatal pineal development. Additionally, co-expression analysis revealed the DEGs could be grouped into 12 clusters with distinct expression patterns. Many differential lncRNAs were functionally enriched in co-expressed clusters of genes related to ion transport, transcription regulation, DNA binding, and visual perception. Our study first provided an overview of postnatal pineal transcriptome dynamics in pig and demonstrated that dynamic lncRNA regulation of developmental transitions impact pineal physiology

    Fatty infiltration in the musculoskeletal system: pathological mechanisms and clinical implications

    Get PDF
    Fatty infiltration denotes the anomalous accrual of adipocytes in non-adipose tissue, thereby generating toxic substances with the capacity to impede the ordinary physiological functions of various organs. With aging, the musculoskeletal system undergoes pronounced degenerative alterations, prompting heightened scrutiny regarding the contributory role of fatty infiltration in its pathophysiology. Several studies have demonstrated that fatty infiltration affects the normal metabolism of the musculoskeletal system, leading to substantial tissue damage. Nevertheless, a definitive and universally accepted generalization concerning the comprehensive effects of fatty infiltration on the musculoskeletal system remains elusive. As a result, this review summarizes the characteristics of different types of adipose tissue, the pathological mechanisms associated with fatty infiltration in bone, muscle, and the entirety of the musculoskeletal system, examines relevant clinical diseases, and explores potential therapeutic modalities. This review is intended to give researchers a better understanding of fatty infiltration and to contribute new ideas to the prevention and treatment of clinical musculoskeletal diseases

    Monitoring high spatiotemporal water dynamics by fusing MODIS, Landsat, water occurrence data and DEM

    Get PDF
    Monitoring the spatiotemporal dynamics of surface water from remote sensing imagery is essential for understanding water's impact on the global ecosystem and climate change. There is often a tradeoff between the spatial and temporal resolutions of imagery acquired from current satellite sensors and as such various spatiotemporal image fusion methods have been explored to circumvent the challenges this situation presents (e.g., STARFM). However, some challenges persist in mapping surface water at the desired fine spatial and temporal resolution. Principally, the spatiotemporal changes of water bodies are often abrupt and controlled by topographic conditions, which are usually unaddressed in current spatiotemporal image fusion methods. This paper proposes the SpatioTemporal Surface Water Mapping (STSWM) method, which aims to predict Landsat-like, 30 m, surface water maps at an 8-day time step (same as the MODIS 8-day composite product) by integrating topographic information into the analysis. In addition to MODIS imagery acquired on the date of map prediction and a pair of MODIS and Landsat images acquired temporally close to the date of prediction, STSWM also uses the surface water occurrence (SWO, which represents the frequency with which water is present in a pixel) and DEM data to provide, respectively, topographic information below and above the water surface. These data are used to translate the coarse spatial resolution water distribution representation observed by MODIS into a 30 m spatial resolution water distribution map. The STSWM was used to generate an 8-day time series surface water maps of 30 m resolution in six inundation regions globally, and was compared with several other state-of-the-art spatiotemporal methods. The stratified random sampling design was used, and unbiased estimators of the accuracies were provided. The results show that STSWM generated the most accurate surface water map in which the spatial details of surface water were well-represented

    Intragrain impurity annihilation for highly efficient and stable perovskite solar cells

    Get PDF
    Intragrain impurities can impart detrimental effects on the efficiency and stability of perovskite solar cells, but they are indiscernible to conventional characterizations and thus remain unexplored. Using in situ scanning transmission electron microscopy, we reveal that intragrain impurity nano-clusters inherited from either the solution synthesis or post-synthesis storage can revert to perovskites upon irradiation stimuli, leading to the counterintuitive amendment of crystalline grains. In conjunction with computational modelling, we atomically resolve crystallographic transformation modes for the annihilation of intragrain impurity nano-clusters and probe their impacts on optoelectronic properties. Such critical fundamental findings are translated for the device advancement. Adopting a scanning laser stimulus proven to heal intragrain impurity nano-clusters, we simultaneously boost the efficiency and stability of formamidinium-cesium perovskite solar cells, by virtual of improved optoelectronic properties and relaxed intra-crystal strain, respectively. This device engineering, inspired and guided by atomic-scale in situ microscopic imaging, presents a new prototype for solar cell advancement
    corecore