97 research outputs found

    Ensembles of probability estimation trees for customer churn prediction

    Get PDF
    Customer churn prediction is one of the most, important elements tents of a company's Customer Relationship Management, (CRM) strategy In tins study, two strategies are investigated to increase the lift. performance of ensemble classification models, i.e (1) using probability estimation trees (PETs) instead of standard decision trees as base classifiers; and (n) implementing alternative fusion rules based on lift weights lot the combination of ensemble member's outputs Experiments ale conducted lot font popular ensemble strategics on five real-life chin n data sets In general, the results demonstrate how lift performance can be substantially improved by using alternative base classifiers and fusion tides However: the effect vanes lot the (Idol cut ensemble strategies lit particular, the results indicate an increase of lift performance of (1) Bagging by implementing C4 4 base classifiets. (n) the Random Subspace Method (RSM) by using lift-weighted fusion rules, and (in) AdaBoost, by implementing both

    The Five Factor Model of personality and evaluation of drug consumption risk

    Full text link
    The problem of evaluating an individual's risk of drug consumption and misuse is highly important. An online survey methodology was employed to collect data including Big Five personality traits (NEO-FFI-R), impulsivity (BIS-11), sensation seeking (ImpSS), and demographic information. The data set contained information on the consumption of 18 central nervous system psychoactive drugs. Correlation analysis demonstrated the existence of groups of drugs with strongly correlated consumption patterns. Three correlation pleiades were identified, named by the central drug in the pleiade: ecstasy, heroin, and benzodiazepines pleiades. An exhaustive search was performed to select the most effective subset of input features and data mining methods to classify users and non-users for each drug and pleiad. A number of classification methods were employed (decision tree, random forest, kk-nearest neighbors, linear discriminant analysis, Gaussian mixture, probability density function estimation, logistic regression and na{\"i}ve Bayes) and the most effective classifier was selected for each drug. The quality of classification was surprisingly high with sensitivity and specificity (evaluated by leave-one-out cross-validation) being greater than 70\% for almost all classification tasks. The best results with sensitivity and specificity being greater than 75\% were achieved for cannabis, crack, ecstasy, legal highs, LSD, and volatile substance abuse (VSA).Comment: Significantly extended report with 67 pages, 27 tables, 21 figure

    Anisotropic conductivity of Nd_{1.85}Ce_{0.15}CuO_{4-\delta} films at submillimeter wavelengths

    Full text link
    The anisotropic conductivity of thin Nd1.85_{1.85}Ce0.15_{0.15}CuO4δ_{4-\delta} films was measured in the frequency range 8 cm1<ν<^{-1}<\nu < 40 cm1^{-1} and for temperatures 4 K <T<300<T<300 K. A tilted sample geometry allowed to extract both, in-plane and c-axis properties. The in-plane quasiparticle scattering rate remains unchanged as the sample becomes superconducting. The temperature dependence of the in-plane conductivity is reasonably well described using the Born limit for a d-wave superconductor. Below T_{{\rm C}%} the c-axis dielectric constant ϵ1c\epsilon_{1c} changes sign at the screened c-axis plasma frequency. The temperature dependence of the c-axis conductivity closely follows the linear in T behavior within the plane.Comment: 4 pages, 4 figure

    Optical symmetries and anisotropic transport in high-Tc superconductors

    Full text link
    A simple symmetry analysis of in-plane and out-of-plane transport in a family of high temperature superconductors is presented. It is shown that generalized scaling relations exist between the low frequency electronic Raman response and the low frequency in-plane and out-of-plane conductivities in both the normal and superconducting states of the cuprates. Specifically, for both the normal and superconducting state, the temperature dependence of the low frequency B1gB_{1g} Raman slope scales with the cc-axis conductivity, while the B2gB_{2g} Raman slope scales with the in-plane conductivity. Comparison with experiments in the normal state of Bi-2212 and Y-123 imply that the nodal transport is largely doping independent and metallic, while transport near the BZ axes is governed by a quantum critical point near doping p0.22p\sim 0.22 holes per CuO2_{2} plaquette. Important differences for La-214 are discussed. It is also shown that the cc- axis conductivity rise for TTcT\ll T_{c} is a consequence of partial conservation of in-plane momentum for out-of-plane transport.Comment: 16 pages, 8 Figures (3 pages added, new discussion on pseudogap and charge ordering in La214

    Isoscaling in dissipative projectile breakup

    Get PDF
    Dynamical breakup of projectile-like fragments (PLF) following dissipative reactions of Ca-48 projectiles with Sn-112 and Sn-124 is shown to exhibit "isoscaling" regularities that can be understood in terms of phase space governed by ground state masses. Ambiguities in isoscaling parameters obscure information on nuclear symmetry energy at subnormal densities

    The genetic architecture of the human cerebral cortex

    Get PDF
    INTRODUCTION The cerebral cortex underlies our complex cognitive capabilities. Variations in human cortical surface area and thickness are associated with neurological, psychological, and behavioral traits and can be measured in vivo by magnetic resonance imaging (MRI). Studies in model organisms have identified genes that influence cortical structure, but little is known about common genetic variants that affect human cortical structure. RATIONALE To identify genetic variants associated with human cortical structure at both global and regional levels, we conducted a genome-wide association meta-analysis of brain MRI data from 51,665 individuals across 60 cohorts. We analyzed the surface area and average thickness of the whole cortex and 34 cortical regions with known functional specializations. RESULTS We identified 306 nominally genome-wide significant loci (P < 5 × 10−8) associated with cortical structure in a discovery sample of 33,992 participants of European ancestry. Of the 299 loci for which replication data were available, 241 loci influencing surface area and 14 influencing thickness remained significant after replication, with 199 loci passing multiple testing correction (P < 8.3 × 10−10; 187 influencing surface area and 12 influencing thickness). Common genetic variants explained 34% (SE = 3%) of the variation in total surface area and 26% (SE = 2%) in average thickness; surface area and thickness showed a negative genetic correlation (rG = −0.32, SE = 0.05, P = 6.5 × 10−12), which suggests that genetic influences have opposing effects on surface area and thickness. Bioinformatic analyses showed that total surface area is influenced by genetic variants that alter gene regulatory activity in neural progenitor cells during fetal development. By contrast, average thickness is influenced by active regulatory elements in adult brain samples, which may reflect processes that occur after mid-fetal development, such as myelination, branching, or pruning. When considered together, these results support the radial unit hypothesis that different developmental mechanisms promote surface area expansion and increases in thickness. To identify specific genetic influences on individual cortical regions, we controlled for global measures (total surface area or average thickness) in the regional analyses. After multiple testing correction, we identified 175 loci that influence regional surface area and 10 that influence regional thickness. Loci that affect regional surface area cluster near genes involved in the Wnt signaling pathway, which is known to influence areal identity. We observed significant positive genetic correlations and evidence of bidirectional causation of total surface area with both general cognitive functioning and educational attainment. We found additional positive genetic correlations between total surface area and Parkinson’s disease but did not find evidence of causation. Negative genetic correlations were evident between total surface area and insomnia, attention deficit hyperactivity disorder, depressive symptoms, major depressive disorder, and neuroticism. CONCLUSION This large-scale collaborative work enhances our understanding of the genetic architecture of the human cerebral cortex and its regional patterning. The highly polygenic architecture of the cortex suggests that distinct genes are involved in the development of specific cortical areas. Moreover, we find evidence that brain structure is a key phenotype along the causal pathway that leads from genetic variation to differences in general cognitive function

    Mapping and characterization of structural variation in 17,795 human genomes

    Get PDF
    A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline1 to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0–11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing

    Learning multiple predicates

    No full text

    False Alarm Classification Model for Network-Based Intrusion Detection System

    No full text
    corecore