946 research outputs found
Identifying the Machine Learning Family from Black-Box Models
[EN] We address the novel question of determining which kind of machine learning model is behind the predictions when we interact with a black-box model. This may allow us to identify families of techniques whose models exhibit similar vulnerabilities and strengths. In our method, we first consider how an adversary can systematically query a given black-box model (oracle) to label an artificially-generated dataset. This labelled dataset is then used for training different surrogate models (each one trying to imitate the oracle¿s behaviour). The method has two different approaches. First, we assume that the family of the surrogate model that achieves the maximum Kappa metric against the oracle labels corresponds to the family of the oracle model. The other approach, based on machine learning, consists in learning a meta-model that is able to predict the model family of a new black-box model. We compare these two approaches experimentally, giving us insight about how explanatory and predictable our concept of family is.This material is based upon work supported by the Air Force Office of Scientific Research under award number FA9550-17-1-0287, the EU (FEDER), and the Spanish MINECO under grant TIN 2015-69175-C4-1-R, the Generalitat Valenciana PROMETEOII/2015/013. F. Martinez-Plumed was also supported by INCIBE under grant INCIBEI-2015-27345 (Ayudas para la excelencia de los equipos de investigacion avanzada en ciberseguridad). J. H-Orallo also received a Salvador de Madariaga grant (PRX17/00467) from the Spanish MECD for a research stay at the CFI, Cambridge, and a BEST grant (BEST/2017/045) from the GVA for another research stay at the CFI.Fabra-Boluda, R.; Ferri Ramírez, C.; Hernández-Orallo, J.; Martínez-Plumed, F.; Ramírez Quintana, MJ. (2018). Identifying the Machine Learning Family from Black-Box Models. Lecture Notes in Computer Science. 11160:55-65. https://doi.org/10.1007/978-3-030-00374-6_6S556511160Angluin, D.: Queries and concept learning. Mach. Learn. 2(4), 319–342 (1988)Benedek, G.M., Itai, A.: Learnability with respect to fixed distributions. Theor. Comput. Sci. 86(2), 377–389 (1991)Biggio, B., et al.: Security Evaluation of support vector machines in adversarial environments. In: Ma, Y., Guo, G. (eds.) Support Vector Machines Applications, pp. 105–153. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-02300-7_4Blanco-Vega, R., Hernández-Orallo, J., Ramírez-Quintana, M.J.: Analysing the trade-off between comprehensibility and accuracy in mimetic models. In: Suzuki, E., Arikawa, S. (eds.) DS 2004. LNCS (LNAI), vol. 3245, pp. 338–346. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30214-8_29Dalvi, N., Domingos, P., Sanghai, S., Verma, D., et al.: Adversarial classification. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 99–108. ACM (2004)Dheeru, D., Karra Taniskidou, E.: UCI machine learning repository (2017). http://archive.ics.uci.edu/mlDomingos, P.: Knowledge discovery via multiple models. Intell. Data Anal. 2(3), 187–202 (1998)Duin, R.P.W., Loog, M., Pȩkalska, E., Tax, D.M.J.: Feature-based dissimilarity space classification. In: Ünay, D., Çataltepe, Z., Aksoy, S. (eds.) ICPR 2010. LNCS, vol. 6388, pp. 46–55. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-17711-8_5Fernández-Delgado, M., Cernadas, E., Barro, S., Amorim, D.: Do we need hundreds of classifiers to solve real world classification problems. J. Mach. Learn. Res. 15(1), 3133–3181 (2014)Ferri, C., Hernández-Orallo, J., Modroiu, R.: An experimental comparison of performance measures for classification. Pattern Recognit. Lett. 30(1), 27–38 (2009)Giacinto, G., Perdisci, R., Del Rio, M., Roli, F.: Intrusion detection in computer networks by a modular ensemble of one-class classifiers. Inf. Fusion 9(1), 69–82 (2008)Huang, L., Joseph, A.D., Nelson, B., Rubinstein, B.I., Tygar, J.: Adversarial machine learning. In: Proceedings of the 4th ACM Workshop on Security and Artificial Intelligence, pp. 43–58 (2011)Kuncheva, L.I., Whitaker, C.J.: Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach. Learn. 51(2), 181–207 (2003)Landis, J.R., Koch, G.G.: An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics 33, 363–374 (1977)Lowd, D., Meek, C.: Adversarial learning. In: Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data mining, pp. 641–647. ACM (2005)Martınez-Plumed, F., Prudêncio, R.B., Martınez-Usó, A., Hernández-Orallo, J.: Making sense of item response theory in machine learning. In: Proceedings of 22nd European Conference on Artificial Intelligence (ECAI). Frontiers in Artificial Intelligence and Applications, vol. 285, pp. 1140–1148 (2016)Papernot, N., McDaniel, P., Goodfellow, I.: Transferability in machine learning: from phenomena to black-box attacks using adversarial samples. arXiv preprint arXiv:1605.07277 (2016)Papernot, N., McDaniel, P., Jha, S., Fredrikson, M., Celik, Z.B., Swami, A.: The limitations of deep learning in adversarial settings. In: 2016 IEEE European Symposium on Security and Privacy (EuroS&P), pp. 372–387. IEEE (2016)Papernot, N., McDaniel, P., Wu, X., Jha, S., Swami, A.: Distillation as a defense to adversarial perturbations against deep neural networks. In: 2016 IEEE Symposium on Security and Privacy (SP), pp. 582–597. IEEE (2016)Sesmero, M.P., Ledezma, A.I., Sanchis, A.: Generating ensembles of heterogeneous classifiers using stacked generalization. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 5(1), 21–34 (2015)Smith, M.R., Martinez, T., Giraud-Carrier, C.: An instance level analysis of data complexity. Mach. Learn. 95(2), 225–256 (2014)Tramèr, F., Zhang, F., Juels, A., Reiter, M.K., Ristenpart, T.: Stealing machine learning models via prediction APIs. In: USENIX Security Symposium, pp. 601–618 (2016)Valiant, L.G.: A theory of the learnable. Commun. ACM 27(11), 1134–1142 (1984)Wallace, C.S., Boulton, D.M.: An information measure for classification. Comput. J. 11(2), 185–194 (1968)Wolpert, D.H.: Stacked generalization. Neural Netw. 5(2), 241–259 (1992
Role of mitochondrial raft-like microdomains in the regulation of cell apoptosis
Lipid rafts are envisaged as lateral assemblies of specific lipids and proteins that dissociate and associate rapidly and form functional clusters in cell membranes. These structural platforms are not confined to the plasma membrane; indeed lipid microdomains are similarly formed at subcellular organelles, which include endoplasmic reticulum, Golgi and mitochondria, named raft-like microdomains. In addition, some components of raft-like microdomains are present within ER-mitochondria associated membranes. This review is focused on the role of mitochondrial raft-like microdomains in the regulation of cell apoptosis, since these microdomains may represent preferential sites where key reactions take place, regulating mitochondria hyperpolarization, fission-associated changes, megapore formation and release of apoptogenic factors. These structural platforms appear to modulate cytoplasmic pathways switching cell fate towards cell survival or death. Main insights on this issue derive from some pathological conditions in which alterations of microdomains structure or function can lead to severe alterations of cell activity and life span. In the light of the role played by raft-like microdomains to integrate apoptotic signals and in regulating mitochondrial dynamics, it is conceivable that these membrane structures may play a role in the mitochondrial alterations observed in some of the most common human neurodegenerative diseases, such as Amyotrophic lateral sclerosis, Huntington's chorea and prion-related diseases. These findings introduce an additional task for identifying new molecular target(s) of pharmacological agents in these pathologies
Integrated Operational Taxonomic Units (IOTUs) in Echolocating Bats: A Bridge between Molecular and Traditional Taxonomy
Background: Nowadays, molecular techniques are widespread tools for the identification of biological entities. However,
until very few years ago, their application to taxonomy provoked intense debates between traditional and molecular
taxonomists. To prevent every kind of disagreement, it is essential to standardize taxonomic definitions. Along these lines,
we introduced the concept of Integrated Operational Taxonomic Unit (IOTU). IOTUs come from the concept of Operational
Taxonomic Unit (OTU) and paralleled the Molecular Operational Taxonomic Unit (MOTU). The latter is largely used as
a standard in many molecular-based works (even if not always explicitly formalized). However, while MOTUs are assigned
solely on molecular variation criteria, IOTUs are identified from patterns of molecular variation that are supported by at least
one more taxonomic characteristic.
Methodology/Principal Findings: We tested the use of IOTUs on the widest DNA barcoding dataset of Italian echolocating
bats species ever assembled (i.e. 31 species, 209 samples). We identified 31 molecular entities, 26 of which corresponded to
the morphologically assigned species, two MOTUs and three IOTUs. Interestingly, we found three IOTUs in Myotis nattereri,
one of which is a newly described lineage found only in central and southern Italy. In addition, we found a level of molecular
variability within four vespertilionid species deserving further analyses. According to our scheme two of them (i.e.
M. bechsteinii and Plecotus auritus) should be ranked as unconfirmed candidate species (UCS).
Conclusions/Significance: From a systematic point of view, IOTUs are more informative than the general concept of OTUs
and the more recent MOTUs. According to information content, IOTUs are closer to species, although it is important to
underline that IOTUs are not species. Overall, the use of a more precise panel of taxonomic entities increases the clarity in
the systematic field and has the potential to fill the gaps between modern and traditional taxonomy
Performance of CMS muon reconstruction in pp collision events at sqrt(s) = 7 TeV
The performance of muon reconstruction, identification, and triggering in CMS
has been studied using 40 inverse picobarns of data collected in pp collisions
at sqrt(s) = 7 TeV at the LHC in 2010. A few benchmark sets of selection
criteria covering a wide range of physics analysis needs have been examined.
For all considered selections, the efficiency to reconstruct and identify a
muon with a transverse momentum pT larger than a few GeV is above 95% over the
whole region of pseudorapidity covered by the CMS muon system, abs(eta) < 2.4,
while the probability to misidentify a hadron as a muon is well below 1%. The
efficiency to trigger on single muons with pT above a few GeV is higher than
90% over the full eta range, and typically substantially better. The overall
momentum scale is measured to a precision of 0.2% with muons from Z decays. The
transverse momentum resolution varies from 1% to 6% depending on pseudorapidity
for muons with pT below 100 GeV and, using cosmic rays, it is shown to be
better than 10% in the central region up to pT = 1 TeV. Observed distributions
of all quantities are well reproduced by the Monte Carlo simulation.Comment: Replaced with published version. Added journal reference and DO
Performance of CMS muon reconstruction in pp collision events at sqrt(s) = 7 TeV
The performance of muon reconstruction, identification, and triggering in CMS
has been studied using 40 inverse picobarns of data collected in pp collisions
at sqrt(s) = 7 TeV at the LHC in 2010. A few benchmark sets of selection
criteria covering a wide range of physics analysis needs have been examined.
For all considered selections, the efficiency to reconstruct and identify a
muon with a transverse momentum pT larger than a few GeV is above 95% over the
whole region of pseudorapidity covered by the CMS muon system, abs(eta) < 2.4,
while the probability to misidentify a hadron as a muon is well below 1%. The
efficiency to trigger on single muons with pT above a few GeV is higher than
90% over the full eta range, and typically substantially better. The overall
momentum scale is measured to a precision of 0.2% with muons from Z decays. The
transverse momentum resolution varies from 1% to 6% depending on pseudorapidity
for muons with pT below 100 GeV and, using cosmic rays, it is shown to be
better than 10% in the central region up to pT = 1 TeV. Observed distributions
of all quantities are well reproduced by the Monte Carlo simulation.Comment: Replaced with published version. Added journal reference and DO
X-ray emission from the Sombrero galaxy: discrete sources
We present a study of discrete X-ray sources in and around the
bulge-dominated, massive Sa galaxy, Sombrero (M104), based on new and archival
Chandra observations with a total exposure of ~200 ks. With a detection limit
of L_X = 1E37 erg/s and a field of view covering a galactocentric radius of ~30
kpc (11.5 arcminute), 383 sources are detected. Cross-correlation with Spitler
et al.'s catalogue of Sombrero globular clusters (GCs) identified from HST/ACS
observations reveals 41 X-rays sources in GCs, presumably low-mass X-ray
binaries (LMXBs). We quantify the differential luminosity functions (LFs) for
both the detected GC and field LMXBs, whose power-low indices (~1.1 for the
GC-LF and ~1.6 for field-LF) are consistent with previous studies for
elliptical galaxies. With precise sky positions of the GCs without a detected
X-ray source, we further quantify, through a fluctuation analysis, the GC LF at
fainter luminosities down to 1E35 erg/s. The derived index rules out a
faint-end slope flatter than 1.1 at a 2 sigma significance, contrary to recent
findings in several elliptical galaxies and the bulge of M31. On the other
hand, the 2-6 keV unresolved emission places a tight constraint on the field
LF, implying a flattened index of ~1.0 below 1E37 erg/s. We also detect 101
sources in the halo of Sombrero. The presence of these sources cannot be
interpreted as galactic LMXBs whose spatial distribution empirically follows
the starlight. Their number is also higher than the expected number of cosmic
AGNs (52+/-11 [1 sigma]) whose surface density is constrained by deep X-ray
surveys. We suggest that either the cosmic X-ray background is unusually high
in the direction of Sombrero, or a distinct population of X-ray sources is
present in the halo of Sombrero.Comment: 11 figures, 5 tables, ApJ in pres
Azimuthal anisotropy of charged particles at high transverse momenta in PbPb collisions at sqrt(s[NN]) = 2.76 TeV
The azimuthal anisotropy of charged particles in PbPb collisions at
nucleon-nucleon center-of-mass energy of 2.76 TeV is measured with the CMS
detector at the LHC over an extended transverse momentum (pt) range up to
approximately 60 GeV. The data cover both the low-pt region associated with
hydrodynamic flow phenomena and the high-pt region where the anisotropies may
reflect the path-length dependence of parton energy loss in the created medium.
The anisotropy parameter (v2) of the particles is extracted by correlating
charged tracks with respect to the event-plane reconstructed by using the
energy deposited in forward-angle calorimeters. For the six bins of collision
centrality studied, spanning the range of 0-60% most-central events, the
observed v2 values are found to first increase with pt, reaching a maximum
around pt = 3 GeV, and then to gradually decrease to almost zero, with the
decline persisting up to at least pt = 40 GeV over the full centrality range
measured.Comment: Replaced with published version. Added journal reference and DO
Search for new physics with same-sign isolated dilepton events with jets and missing transverse energy
A search for new physics is performed in events with two same-sign isolated
leptons, hadronic jets, and missing transverse energy in the final state. The
analysis is based on a data sample corresponding to an integrated luminosity of
4.98 inverse femtobarns produced in pp collisions at a center-of-mass energy of
7 TeV collected by the CMS experiment at the LHC. This constitutes a factor of
140 increase in integrated luminosity over previously published results. The
observed yields agree with the standard model predictions and thus no evidence
for new physics is found. The observations are used to set upper limits on
possible new physics contributions and to constrain supersymmetric models. To
facilitate the interpretation of the data in a broader range of new physics
scenarios, information on the event selection, detector response, and
efficiencies is provided.Comment: Published in Physical Review Letter
- …