668 research outputs found

    Age estimation during the blow fly intra-puparial period: a qualitative and quantitative approach using micro-computed tomography

    Get PDF
    © The Author(s) 2017. Open Access. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The attached file is the published version of the article

    Identifying the Machine Learning Family from Black-Box Models

    Full text link
    [EN] We address the novel question of determining which kind of machine learning model is behind the predictions when we interact with a black-box model. This may allow us to identify families of techniques whose models exhibit similar vulnerabilities and strengths. In our method, we first consider how an adversary can systematically query a given black-box model (oracle) to label an artificially-generated dataset. This labelled dataset is then used for training different surrogate models (each one trying to imitate the oracle¿s behaviour). The method has two different approaches. First, we assume that the family of the surrogate model that achieves the maximum Kappa metric against the oracle labels corresponds to the family of the oracle model. The other approach, based on machine learning, consists in learning a meta-model that is able to predict the model family of a new black-box model. We compare these two approaches experimentally, giving us insight about how explanatory and predictable our concept of family is.This material is based upon work supported by the Air Force Office of Scientific Research under award number FA9550-17-1-0287, the EU (FEDER), and the Spanish MINECO under grant TIN 2015-69175-C4-1-R, the Generalitat Valenciana PROMETEOII/2015/013. F. Martinez-Plumed was also supported by INCIBE under grant INCIBEI-2015-27345 (Ayudas para la excelencia de los equipos de investigacion avanzada en ciberseguridad). J. H-Orallo also received a Salvador de Madariaga grant (PRX17/00467) from the Spanish MECD for a research stay at the CFI, Cambridge, and a BEST grant (BEST/2017/045) from the GVA for another research stay at the CFI.Fabra-Boluda, R.; Ferri Ramírez, C.; Hernández-Orallo, J.; Martínez-Plumed, F.; Ramírez Quintana, MJ. (2018). Identifying the Machine Learning Family from Black-Box Models. Lecture Notes in Computer Science. 11160:55-65. https://doi.org/10.1007/978-3-030-00374-6_6S556511160Angluin, D.: Queries and concept learning. Mach. Learn. 2(4), 319–342 (1988)Benedek, G.M., Itai, A.: Learnability with respect to fixed distributions. Theor. Comput. Sci. 86(2), 377–389 (1991)Biggio, B., et al.: Security Evaluation of support vector machines in adversarial environments. In: Ma, Y., Guo, G. (eds.) Support Vector Machines Applications, pp. 105–153. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-02300-7_4Blanco-Vega, R., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Analysing the trade-off between comprehensibility and accuracy in mimetic models. In: Suzuki, E., Arikawa, S. (eds.) DS 2004. LNCS (LNAI), vol. 3245, pp. 338–346. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30214-8_29Dalvi, N., Domingos, P., Sanghai, S., Verma, D., et al.: Adversarial classification. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 99–108. ACM (2004)Dheeru, D., Karra Taniskidou, E.: UCI machine learning repository (2017). http://archive.ics.uci.edu/mlDomingos, P.: Knowledge discovery via multiple models. Intell. Data Anal. 2(3), 187–202 (1998)Duin, R.P.W., Loog, M., Pȩkalska, E., Tax, D.M.J.: Feature-based dissimilarity space classification. In: Ünay, D., Çataltepe, Z., Aksoy, S. (eds.) ICPR 2010. LNCS, vol. 6388, pp. 46–55. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-17711-8_5Fernández-Delgado, M., Cernadas, E., Barro, S., Amorim, D.: Do we need hundreds of classifiers to solve real world classification problems. J. Mach. Learn. Res. 15(1), 3133–3181 (2014)Ferri, C., Hernández-Orallo, J., Modroiu, R.: An experimental comparison of performance measures for classification. Pattern Recognit. Lett. 30(1), 27–38 (2009)Giacinto, G., Perdisci, R., Del Rio, M., Roli, F.: Intrusion detection in computer networks by a modular ensemble of one-class classifiers. Inf. Fusion 9(1), 69–82 (2008)Huang, L., Joseph, A.D., Nelson, B., Rubinstein, B.I., Tygar, J.: Adversarial machine learning. In: Proceedings of the 4th ACM Workshop on Security and Artificial Intelligence, pp. 43–58 (2011)Kuncheva, L.I., Whitaker, C.J.: Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach. Learn. 51(2), 181–207 (2003)Landis, J.R., Koch, G.G.: An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics 33, 363–374 (1977)Lowd, D., Meek, C.: Adversarial learning. In: Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data mining, pp. 641–647. ACM (2005)Martınez-Plumed, F., Prudêncio, R.B., Martınez-Usó, A., Hernández-Orallo, J.: Making sense of item response theory in machine learning. In: Proceedings of 22nd European Conference on Artificial Intelligence (ECAI). Frontiers in Artificial Intelligence and Applications, vol. 285, pp. 1140–1148 (2016)Papernot, N., McDaniel, P., Goodfellow, I.: Transferability in machine learning: from phenomena to black-box attacks using adversarial samples. arXiv preprint arXiv:1605.07277 (2016)Papernot, N., McDaniel, P., Jha, S., Fredrikson, M., Celik, Z.B., Swami, A.: The limitations of deep learning in adversarial settings. In: 2016 IEEE European Symposium on Security and Privacy (EuroS&P), pp. 372–387. IEEE (2016)Papernot, N., McDaniel, P., Wu, X., Jha, S., Swami, A.: Distillation as a defense to adversarial perturbations against deep neural networks. In: 2016 IEEE Symposium on Security and Privacy (SP), pp. 582–597. IEEE (2016)Sesmero, M.P., Ledezma, A.I., Sanchis, A.: Generating ensembles of heterogeneous classifiers using stacked generalization. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 5(1), 21–34 (2015)Smith, M.R., Martinez, T., Giraud-Carrier, C.: An instance level analysis of data complexity. Mach. Learn. 95(2), 225–256 (2014)Tramèr, F., Zhang, F., Juels, A., Reiter, M.K., Ristenpart, T.: Stealing machine learning models via prediction APIs. In: USENIX Security Symposium, pp. 601–618 (2016)Valiant, L.G.: A theory of the learnable. Commun. ACM 27(11), 1134–1142 (1984)Wallace, C.S., Boulton, D.M.: An information measure for classification. Comput. J. 11(2), 185–194 (1968)Wolpert, D.H.: Stacked generalization. Neural Netw. 5(2), 241–259 (1992

    Estrogen-dependent dynamic profile of eNOS-DNA associations in prostate cancer

    Get PDF
    In previous work we have documented the nuclear translocation of endothelial NOS (eNOS) and its participation in combinatorial complexes with Estrogen Receptor Beta (ERβ) and Hypoxia Inducible Factors (HIFs) that determine localized chromatin remodeling in response to estrogen (E2) and hypoxia stimuli, resulting in transcriptional regulation of genes associated with adverse prognosis in prostate cancer (PCa). To explore the role of nuclear eNOS in the acquisition of aggressive phenotype in PCa, we performed ChIP-Sequencing on chromatin-associated eNOS from cells from a primary tumor with poor outcome and from metastatic LNCaP cells. We found that: 1. the eNOS-bound regions (peaks) are widely distributed across the genome encompassing multiple transcription factors binding sites, including Estrogen Response Elements. 2. E2 increased the number of peaks, indicating hormone-dependent eNOS re-localization. 3. Peak distribution was similar with/without E2 with ≈ 55% of them in extragenic DNA regions and an intriguing involvement of the 5′ domain of several miRs deregulated in PCa. Numerous potentially novel eNOS-targeted genes have been identified suggesting that eNOS participates in the regulation of large gene sets. The parallel finding of downregulation of a cluster of miRs, including miR-34a, in PCa cells associated with poor outcome led us to unveil a molecular link between eNOS and SIRT1, an epigenetic regulator of aging and tumorigenicity, negatively regulated by miR-34a and in turn activating eNOS. E2 potentiates miR-34a downregulation thus enhancing SIRT1 expression, depicting a novel eNOS/SIRT1 interplay fine-tuned by E2-activated ER signaling, and suggesting that eNOS may play an important role in aggressive PCa

    Incidence of Diabetes in the Working Population in Spain: Results from the ICARIA Cohort

    Get PDF
    INTRODUCTION: Our objective was to evaluate the incidence of type 2 diabetes mellitus (T2DM) in a working population in Spain and to assess associations between its development and several risk factors. METHODS: The ICARIA (Ibermutuamur CArdiovascular RIsk Assessment) cohort (n = 627,523) includes ~3% of Spanish workers. This analysis was undertaken in individuals whose glycaemic status during the index period (May 2004-December 2007) was determined to be normal or indicative of prediabetes [fasting plasma glucose (FPG) 100-125 mg/dl] and who had at least one FPG measurement taken 9 months after a first measurement during follow-up (May 2004-June 2014) (n = 380,366). T2DM patients were defined as those with an FPG ? 126 mg/day and those who had already been diagnosed with T2DM or were taking antihyperglycaemic medications. RESULTS: The incidence rate of T2DM was 5.0 [95% confidence interval (CI) 4.9-5.1] cases per 1000 person-years. Under multivariate logistic regression analysis, the factor showing the strongest association with the occurrence of T2DM was the baseline FPG level, with the likelihood of T2DM almost doubling for every 5 mg/dl increase in baseline FPG between 100 and < 126 mg/dl. The presence of other cardiometabolic risk factors and being a blue-collar worker were also significantly associated with the occurrence of T2DM. CONCLUSIONS: The incidence of T2DM in the working population was within the range encountered in the general population and prediabetes was found to be the strongest risk factor for the development of diabetes. The workplace is an appropriate and feasible setting for the assessment of easily measurable risk factors, such as the presence of prediabetes and other cardiometabolic factors, to facilitate the early detection of individuals at higher risk of diabetes and the implementation of diabetes prevention programmes

    Determinant representations of scalar products for the open XXZ chain with non-diagonal boundary terms

    Full text link
    With the help of the F-basis provided by the Drinfeld twist or factorizing F-matrix for the open XXZ spin chain with non-diagonal boundary terms, we obtain the determinant representations of the scalar products of Bethe states of the model.Comment: Latex file, 28 pages, based on the talk given by W. -L. Yang at Statphys 24, Cairns, Australia, 19-23 July, 201

    The role of the right temporoparietal junction in perceptual conflict: detection or resolution?

    Get PDF
    The right temporoparietal junction (rTPJ) is a polysensory cortical area that plays a key role in perception and awareness. Neuroimaging evidence shows activation of rTPJ in intersensory and sensorimotor conflict situations, but it remains unclear whether this activity reflects detection or resolution of such conflicts. To address this question, we manipulated the relationship between touch and vision using the so-called mirror-box illusion. Participants' hands lay on either side of a mirror, which occluded their left hand and reflected their right hand, but created the illusion that they were looking directly at their left hand. The experimenter simultaneously touched either the middle (D3) or the ring finger (D4) of each hand. Participants judged, which finger was touched on their occluded left hand. The visual stimulus corresponding to the touch on the right hand was therefore either congruent (same finger as touch) or incongruent (different finger from touch) with the task-relevant touch on the left hand. Single-pulse transcranial magnetic stimulation (TMS) was delivered to the rTPJ immediately after touch. Accuracy in localizing the left touch was worse for D4 than for D3, particularly when visual stimulation was incongruent. However, following TMS, accuracy improved selectively for D4 in incongruent trials, suggesting that the effects of the conflicting visual information were reduced. These findings suggest a role of rTPJ in detecting, rather than resolving, intersensory conflict

    Genetic characterization of morphologically variant strains of Paracoccidioides brasiliensis

    Get PDF
    Molecular characterization of Paracoccidioides brasiliensis variant strains that had been preserved under mineral oil for decades was carried out by random amplified polymorphic DNA analysis (RAPD). On P. brasiliensis variants in the transitional phase and strains with typical morphology, RAPD produced reproducible polymorphic amplification products that differentiated them. A dendrogram based on the generated RAPD patterns placed the 14 P. brasiliensis strains into five groups with similarity coefficients of 72%. A high correlation between the genotypic and phenotypic characteristics of the strains was observed. A 750 bp-RAPD fragment found only in the wild-type phenotype strains was cloned and sequenced. Genetic similarity analysis using BLASTx suggested that this RAPD marker represents a putative domain of a hypothetical flavin-binding monooxygenase (FMO)-like protein of Neurospora crassa.FiocruzBritish Council Progra

    Antibodies to Serine Proteases in the Antiphospholipid Syndrome

    Get PDF
    It is generally accepted that the major autoantigen for antiphospholipid antibodies (aPL) in the antiphospholipid syndrome (APS) is β2-glycoprotein I (β2GPI). However, a recent study has revealed that some aPL bind to certain conformational epitope(s) on β2GPI shared by the homologous enzymatic domains of several serine proteases involved in hemostasis and fibrinolysis. Importantly, some serine protease–reactive aPL correspondingly hinder anticoagulant regulation and resolution of clots. These results extend several early findings of aPL binding to other coagulation factors and provide a new perspective about some aPL in terms of binding specificities and related functional properties in promoting thrombosis. Moreover, a recent immunological and pathological study of a panel of human IgG monoclonal aPL showed that aPL with strong binding to thrombin promote in vivo venous thrombosis and leukocyte adherence, suggesting that aPL reactivity with thrombin may be a good predictor for pathogenic potentials of aPL

    Identifying Selected Regions from Heterozygosity and Divergence Using a Light-Coverage Genomic Dataset from Two Human Populations

    Get PDF
    When a selective sweep occurs in the chromosomal region around a target gene in two populations that have recently separated, it produces three dramatic genomic consequences: 1) decreased multi-locus heterozygosity in the region; 2) elevated or diminished genetic divergence (FST) of multiple polymorphic variants adjacent to the selected locus between the divergent populations, due to the alternative fixation of alleles; and 3) a consequent regional increase in the variance of FST (S2FST) for the same clustered variants, due to the increased alternative fixation of alleles in the loci surrounding the selection target. In the first part of our study, to search for potential targets of directional selection, we developed and validated a resampling-based computational approach; we then scanned an array of 31 different-sized moving windows of SNP variants (5–65 SNPs) across the human genome in a set of European and African American population samples with 183,997 SNP loci after correcting for the recombination rate variation. The analysis revealed 180 regions of recent selection with very strong evidence in either population or both. In the second part of our study, we compared the newly discovered putative regions to those sites previously postulated in the literature, using methods based on inspecting patterns of linkage disequilibrium, population divergence and other methodologies. The newly found regions were cross-validated with those found in nine other studies that have searched for selection signals. Our study was replicated especially well in those regions confirmed by three or more studies. These validated regions were independently verified, using a combination of different methods and different databases in other studies, and should include fewer false positives. The main strength of our analysis method compared to others is that it does not require dense genotyping and therefore can be used with data from population-based genome SNP scans from smaller studies of humans or other species
    corecore