1,947 research outputs found

    Faster k-Medoids Clustering: Improving the PAM, CLARA, and CLARANS Algorithms

    Full text link
    Clustering non-Euclidean data is difficult, and one of the most used algorithms besides hierarchical clustering is the popular algorithm Partitioning Around Medoids (PAM), also simply referred to as k-medoids. In Euclidean geometry the mean-as used in k-means-is a good estimator for the cluster center, but this does not hold for arbitrary dissimilarities. PAM uses the medoid instead, the object with the smallest dissimilarity to all others in the cluster. This notion of centrality can be used with any (dis-)similarity, and thus is of high relevance to many domains such as biology that require the use of Jaccard, Gower, or more complex distances. A key issue with PAM is its high run time cost. We propose modifications to the PAM algorithm to achieve an O(k)-fold speedup in the second SWAP phase of the algorithm, but will still find the same results as the original PAM algorithm. If we slightly relax the choice of swaps performed (at comparable quality), we can further accelerate the algorithm by performing up to k swaps in each iteration. With the substantially faster SWAP, we can now also explore alternative strategies for choosing the initial medoids. We also show how the CLARA and CLARANS algorithms benefit from these modifications. It can easily be combined with earlier approaches to use PAM and CLARA on big data (some of which use PAM as a subroutine, hence can immediately benefit from these improvements), where the performance with high k becomes increasingly important. In experiments on real data with k=100, we observed a 200-fold speedup compared to the original PAM SWAP algorithm, making PAM applicable to larger data sets as long as we can afford to compute a distance matrix, and in particular to higher k (at k=2, the new SWAP was only 1.5 times faster, as the speedup is expected to increase with k)

    Temperature induced solubility transitions of various poly(2-oxazoline)s in ethanol-water solvent mixtures

    Get PDF
    The solution behavior of a series of poly(2-oxazoline)s with different side chains, namely methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, pentyl, hexyl, heptyl, octyl, nonyl, phenyl and benzyl, are reported in ethanol-water solvent mixtures based on turbidimetry investigations. The LCST transitions of poly(2-oxazoline) s with propyl side chains and the UCST transitions of the poly(2-oxazoline) s with more hydrophobic side chains are discussed in relation to the ethanol-water solvent composition and structure. The poly(2-alkyl-2-oxazoline) s with side chains longer than propyl only dissolved during the first heating run, which is discussed and correlated to the melting transition of the polymers

    Thermoresponsive poly(2-oxazoline) block copolymers exhibiting two cloud points: complex multistep assembly behavior

    Get PDF
    Aqueous solutions of poly(2-oxazoline) block copolymers consisting of a 2-ethyl-2-oxazoline block and a block consisting of a random copolymer of 2-ethyl-2-oxazoline and 2-n-propyl-2-oxazoline (PEtOx-block-P(EtOx-stat-PropOx)) have been studied by dynamic light scattering (DLS), static light scattering (SLS), and turbidimetry. Even at temperatures significantly below the lower critical solution temperature (LCST), polymer unimers are found to coexist with a few large aggregates with an open structure. When heated, the systems exhibit an intricate transmittance behavior whereby the samples becomes visually clear again after an initial cloud point and then exhibit a second cloud point at even higher temperatures. The DLS data indicate that the aggregates formed around the first cloud point restructure and fragment into smaller micelle-like structures ascribed to further dehydration of the more hydrophobic PPropOx containing block, causing the samples to become optically clear again. The observed fragmentation is confirmed by the SLS experiments. At even higher temperatures, both blocks become hydrophobic, causing the formation of large, compact aggregates, resulting in a second cloud point

    Efficient cationic ring-opening polymerization of diverse cyclic imino ethers: unexpected copolymerization behavior

    Get PDF
    The recently developed fast microwave-assisted cationic ring-opening polymerization procedure for 2-oxazolines seems to be ideally suited for slower polymerizing cyclic imino ether monomers. In this study we report the effect of the cyclic imino ether structure on the polymerization rate under exactly the same microwave-assisted conditions revealing that indeed less reactive cyclic imino ethers, including 2-oxazines as well as 4- and 5-substituted 2-oxazolines, can be polymerized to at least 50% conversion for the slowest monomer, namely 5-methyl-2-butyl-2-oxazoline, within 10 h. In addition, the copolymerization behavior of 4-ethyl-2-butyl-2-oxazoline with 2-methyl-2-oxazoline and 2-phenyl-2-oxazoline unexpectedly revealed faster incorporation of the less reactive 4-ethy1-2-buty1-2-oxazoline monomer compared to 2-phenyl-2-oxazoline due to the increased bulk of the latter monomer amplifying the sterical hindrance for polymerization onto the 4-ethyl-2-butyl-2-oxazolinium propagating species

    A reverse engineering approach to the suppression of citation biases reveals universal properties of citation distributions

    Get PDF
    The large amount of information contained in bibliographic databases has recently boosted the use of citations, and other indicators based on citation numbers, as tools for the quantitative assessment of scientific research. Citations counts are often interpreted as proxies for the scientific influence of papers, journals, scholars, and institutions. However, a rigorous and scientifically grounded methodology for a correct use of citation counts is still missing. In particular, cross-disciplinary comparisons in terms of raw citation counts systematically favors scientific disciplines with higher citation and publication rates. Here we perform an exhaustive study of the citation patterns of millions of papers, and derive a simple transformation of citation counts able to suppress the disproportionate citation counts among scientific domains. We find that the transformation is well described by a power-law function, and that the parameter values of the transformation are typical features of each scientific discipline. Universal properties of citation patterns descend therefore from the fact that citation distributions for papers in a specific field are all part of the same family of univariate distributions.Comment: 9 pages, 6 figures. Supporting information files available at http://filrad.homelinux.or

    Novel prokaryotic expression of thioredoxin-fused insulinoma associated protein tyrosine phosphatase 2 (IA-2), its characterization and immunodiagnostic application

    Get PDF
    Background The insulinoma associated protein tyrosine phosphatase 2 (IA-2) is one of the immunodominant autoantigens involved in the autoimmune attack to the beta-cell in Type 1 Diabetes Mellitus. In this work we have developed a complete and original process for the production and recovery of the properly folded intracellular domain of IA-2 fused to thioredoxin (TrxIA-2ic) in Escherichia coli GI698 and GI724 strains. We have also carried out the biochemical and immunochemical characterization of TrxIA-2icand design variants of non-radiometric immunoassays for the efficient detection of IA-2 autoantibodies (IA-2A). Results The main findings can be summarized in the following statements: i) TrxIA-2ic expression after 3 h of induction on GI724 strain yielded ≈ 10 mg of highly pure TrxIA-2ic/L of culture medium by a single step purification by affinity chromatography, ii) the molecular weight of TrxIA-2ic (55,358 Da) could be estimated by SDS-PAGE, size exclusion chromatography and mass spectrometry, iii) TrxIA-2ic was properly identified by western blot and mass spectrometric analysis of proteolytic digestions (63.25 % total coverage), iv) excellent immunochemical behavior of properly folded full TrxIA-2ic was legitimized by inhibition or displacement of [35S]IA-2 binding from IA-2A present in Argentinian Type 1 Diabetic patients, v) great stability over time was found under proper storage conditions and vi) low cost and environmentally harmless ELISA methods for IA-2A assessment were developed, with colorimetric or chemiluminescent detection. Conclusions E. coli GI724 strain emerged as a handy source of recombinant IA-2ic, achieving high levels of expression as a thioredoxin fusion protein, adequately validated and applicable to the development of innovative and cost-effective immunoassays for IA-2A detection in most laboratories.Fil: Guerra, Luciano Lucas. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni. Universidad de Buenos Aires. Facultad de Farmacia y Bioquímica. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni; ArgentinaFil: Faccinetti, Natalia Ines. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni. Universidad de Buenos Aires. Facultad de Farmacia y Bioquímica. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni; ArgentinaFil: Trabucchi, Aldana. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni. Universidad de Buenos Aires. Facultad de Farmacia y Bioquímica. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni; ArgentinaFil: Rovitto, Bruno David. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni. Universidad de Buenos Aires. Facultad de Farmacia y Bioquímica. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni; ArgentinaFil: Sabljic, Adriana Victoria. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni. Universidad de Buenos Aires. Facultad de Farmacia y Bioquímica. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni; ArgentinaFil: Poskus, Edgardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni. Universidad de Buenos Aires. Facultad de Farmacia y Bioquímica. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni; ArgentinaFil: Iacono, Ruben Francisco. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni. Universidad de Buenos Aires. Facultad de Farmacia y Bioquímica. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni; ArgentinaFil: Valdez, Silvina Noemi. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Houssay. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni. Universidad de Buenos Aires. Facultad de Farmacia y Bioquímica. Instituto de Estudios de la Inmunidad Humoral Prof. Ricardo A. Margni; Argentin

    Two-loop Yang-Mills diagrams from superstring amplitudes

    Get PDF
    Starting from the superstring amplitude describing interactions among D-branes with a constant world-volume field strength, we present a detailed analysis of how the open string degeneration limits reproduce the corresponding field theory Feynman diagrams. A key ingredient in the string construction is represented by the twisted (Prym) super differentials, as their periods encode the information about the background field. We provide an efficient method to calculate perturbatively the determinant of the twisted period matrix in terms of sets of super-moduli appropriate to the degeneration limits. Using this result we show that there is a precise one-to-one correspondence between the degeneration of different factors in the superstring amplitudes and one-particle irreducible Feynman diagrams capturing the gauge theory effective action at the two-loop level.Comment: 42 pages plus appendices, 10 figure

    A search for the decay modes B+/- to h+/- tau l

    Get PDF
    We present a search for the lepton flavor violating decay modes B+/- to h+/- tau l (h= K,pi; l= e,mu) using the BaBar data sample, which corresponds to 472 million BBbar pairs. The search uses events where one B meson is fully reconstructed in one of several hadronic final states. Using the momenta of the reconstructed B, h, and l candidates, we are able to fully determine the tau four-momentum. The resulting tau candidate mass is our main discriminant against combinatorial background. We see no evidence for B+/- to h+/- tau l decays and set a 90% confidence level upper limit on each branching fraction at the level of a few times 10^-5.Comment: 15 pages, 7 figures, submitted to Phys. Rev.

    Study of the reaction e^{+}e^{-} -->J/psi\pi^{+}\pi^{-} via initial-state radiation at BaBar

    Get PDF
    We study the process e+eJ/ψπ+πe^+e^-\to J/\psi\pi^{+}\pi^{-} with initial-state-radiation events produced at the PEP-II asymmetric-energy collider. The data were recorded with the BaBar detector at center-of-mass energies 10.58 and 10.54 GeV, and correspond to an integrated luminosity of 454 fb1\mathrm{fb^{-1}}. We investigate the J/ψπ+πJ/\psi \pi^{+}\pi^{-} mass distribution in the region from 3.5 to 5.5 GeV/c2\mathrm{GeV/c^{2}}. Below 3.7 GeV/c2\mathrm{GeV/c^{2}} the ψ(2S)\psi(2S) signal dominates, and above 4 GeV/c2\mathrm{GeV/c^{2}} there is a significant peak due to the Y(4260). A fit to the data in the range 3.74 -- 5.50 GeV/c2\mathrm{GeV/c^{2}} yields a mass value 4244±54244 \pm 5 (stat) ±4 \pm 4 (syst)MeV/c2\mathrm{MeV/c^{2}} and a width value 11415+16114 ^{+16}_{-15} (stat)±7 \pm 7(syst)MeV\mathrm{MeV} for this state. We do not confirm the report from the Belle collaboration of a broad structure at 4.01 GeV/c2\mathrm{GeV/c^{2}}. In addition, we investigate the π+π\pi^{+}\pi^{-} system which results from Y(4260) decay

    Evidence for an excess of B -> D(*) Tau Nu decays

    Get PDF
    Based on the full BaBar data sample, we report improved measurements of the ratios R(D(*)) = B(B -> D(*) Tau Nu)/B(B -> D(*) l Nu), where l is either e or mu. These ratios are sensitive to new physics contributions in the form of a charged Higgs boson. We measure R(D) = 0.440 +- 0.058 +- 0.042 and R(D*) = 0.332 +- 0.024 +- 0.018, which exceed the Standard Model expectations by 2.0 sigma and 2.7 sigma, respectively. Taken together, our results disagree with these expectations at the 3.4 sigma level. This excess cannot be explained by a charged Higgs boson in the type II two-Higgs-doublet model. We also report the observation of the decay B -> D Tau Nu, with a significance of 6.8 sigma.Comment: Expanded section on systematics, text corrections, improved the format of Figure 2 and included the effect of the change of the Tau polarization due to the charged Higg
    corecore