11,041 research outputs found

    Probabilistic Graphical Models on Multi-Core CPUs using Java 8

    Get PDF
    In this paper, we discuss software design issues related to the development of parallel computational intelligence algorithms on multi-core CPUs, using the new Java 8 functional programming features. In particular, we focus on probabilistic graphical models (PGMs) and present the parallelisation of a collection of algorithms that deal with inference and learning of PGMs from data. Namely, maximum likelihood estimation, importance sampling, and greedy search for solving combinatorial optimisation problems. Through these concrete examples, we tackle the problem of defining efficient data structures for PGMs and parallel processing of same-size batches of data sets using Java 8 features. We also provide straightforward techniques to code parallel algorithms that seamlessly exploit multi-core processors. The experimental analysis, carried out using our open source AMIDST (Analysis of MassIve Data STreams) Java toolbox, shows the merits of the proposed solutions.Comment: Pre-print version of the paper presented in the special issue on Computational Intelligence Software at IEEE Computational Intelligence Magazine journa

    Phylodynamics of H5N1 Highly Pathogenic Avian Influenza in Europe, 2005-2010: Potential for Molecular Surveillance of New Outbreaks.

    Get PDF
    Previous Bayesian phylogeographic studies of H5N1 highly pathogenic avian influenza viruses (HPAIVs) explored the origin and spread of the epidemic from China into Russia, indicating that HPAIV circulated in Russia prior to its detection there in 2005. In this study, we extend this research to explore the evolution and spread of HPAIV within Europe during the 2005-2010 epidemic, using all available sequences of the hemagglutinin (HA) and neuraminidase (NA) gene regions that were collected in Europe and Russia during the outbreak. We use discrete-trait phylodynamic models within a Bayesian statistical framework to explore the evolution of HPAIV. Our results indicate that the genetic diversity and effective population size of HPAIV peaked between mid-2005 and early 2006, followed by drastic decline in 2007, which coincides with the end of the epidemic in Europe. Our results also suggest that domestic birds were the most likely source of the spread of the virus from Russia into Europe. Additionally, estimates of viral dispersal routes indicate that Russia, Romania, and Germany were key epicenters of these outbreaks. Our study quantifies the dynamics of a major European HPAIV pandemic and substantiates the ability of phylodynamic models to improve molecular surveillance of novel AIVs

    Genomics clarifies taxonomic boundaries in a difficult species complex.

    Get PDF
    Efforts to taxonomically delineate species are often confounded with conflicting information and subjective interpretation. Advances in genomic methods have resulted in a new approach to taxonomic identification that stands to greatly reduce much of this conflict. This approach is ideal for species complexes, where divergence times are recent (evolutionarily) and lineages less well defined. The California Roach/Hitch fish species complex is an excellent example, experiencing a convoluted geologic history, diverse habitats, conflicting species designations and potential admixture between species. Here we use this fish complex to illustrate how genomics can be used to better clarify and assign taxonomic categories. We performed restriction-site associated DNA (RAD) sequencing on 255 Roach and Hitch samples collected throughout California to discover and genotype thousands of single nucleotide polymorphism (SNPs). Data were then used in hierarchical principal component, admixture, and FST analyses to provide results that consistently resolved a number of ambiguities and provided novel insights across a range of taxonomic levels. At the highest level, our results show that the CA Roach/Hitch complex should be considered five species split into two genera (4 + 1) as opposed to two species from distinct genera (1 +1). Subsequent levels revealed multiple subspecies and distinct population segments within identified species. At the lowest level, our results indicate Roach from a large coastal river are not native but instead introduced from a nearby river. Overall, this study provides a clear demonstration of the power of genomic methods for informing taxonomy and serves as a model for future studies wishing to decipher difficult species questions. By allowing for systematic identification across multiple scales, taxonomic structure can then be tied to historical and contemporary ecological, geographic or anthropogenic factors

    Using Machine Learning to Predict Swine Movements within a Regional Program to Improve Control of Infectious Diseases in the US.

    Get PDF
    Between-farm animal movement is one of the most important factors influencing the spread of infectious diseases in food animals, including in the US swine industry. Understanding the structural network of contacts in a food animal industry is prerequisite to planning for efficient production strategies and for effective disease control measures. Unfortunately, data regarding between-farm animal movements in the US are not systematically collected and thus, such information is often unavailable. In this paper, we develop a procedure to replicate the structure of a network, making use of partial data available, and subsequently use the model developed to predict animal movements among sites in 34 Minnesota counties. First, we summarized two networks of swine producing facilities in Minnesota, then we used a machine learning technique referred to as random forest, an ensemble of independent classification trees, to estimate the probability of pig movements between farms and/or markets sites located in two counties in Minnesota. The model was calibrated and tested by comparing predicted data and observed data in those two counties for which data were available. Finally, the model was used to predict animal movements in sites located across 34 Minnesota counties. Variables that were important in predicting pig movements included between-site distance, ownership, and production type of the sending and receiving farms and/or markets. Using a weighted-kernel approach to describe spatial variation in the centrality measures of the predicted network, we showed that the south-central region of the study area exhibited high aggregation of predicted pig movements. Our results show an overlap with the distribution of outbreaks of porcine reproductive and respiratory syndrome, which is believed to be transmitted, at least in part, though animal movements. While the correspondence of movements and disease is not a causal test, it suggests that the predicted network may approximate actual movements. Accordingly, the predictions provided here might help to design and implement control strategies in the region. Additionally, the methodology here may be used to estimate contact networks for other livestock systems when only incomplete information regarding animal movements is available

    Are stealth scalar fields stable?

    Full text link
    Non-gravitating (stealth) scalar fields associated with Minkowski space in scalar-tensor gravity are examined. Analytical solutions for both non-minimally coupled scalar field theory and for Brans-Dicke gravity are studied and their stability with respect to tensor perturbations is assessed using a covariant and gauge-invariant formalism developed for alternative gravity. For Brans-Dicke solutions, the stability with respect to homogeneous perturbations is also studied. There are regions of parameter space corresponding to stability and other regions corresponding to instability.Comment: 10 pages, 1 table, no figures, to appear in Phys. Rev,

    Asterias: a parallelized web-based suite for the analysis of expression and aCGH data

    Get PDF
    Asterias (\url{http://www.asterias.info}) is an integrated collection of freely-accessible web tools for the analysis of gene expression and aCGH data. Most of the tools use parallel computing (via MPI). Most of our applications allow the user to obtain additional information for user-selected genes by using clickable links in tables and/or figures. Our tools include: normalization of expression and aCGH data; converting between different types of gene/clone and protein identifiers; filtering and imputation; finding differentially expressed genes related to patient class and survival data; searching for models of class prediction; using random forests to search for minimal models for class prediction or for large subsets of genes with predictive capacity; searching for molecular signatures and predictive genes with survival data; detecting regions of genomic DNA gain or loss. The capability to send results between different applications, access to additional functional information, and parallelized computation make our suite unique and exploit features only available to web-based applications.Comment: web based application; 3 figure

    Inherent tracers for carbon capture and storage in sedimentary formations: composition and applications

    Get PDF
    Inherent tracers - the “natural” isotopic and trace gas composition of captured CO₂ streams – are potentially powerful tracers for use in CCS technology. This review outlines for the first time the expected carbon isotope and noble gas compositions of captured CO₂ streams from a range of feedstocks, CO₂-generating processes and carbon capture techniques. The C-isotope composition of captured CO₂ will be most strongly controlled by the feedstock, but significant isotope fractionation is possible during capture; noble gas concentrations will be controlled by the capture technique employed. Comparison with likely baseline data suggests that CO₂ generated from fossil fuel feedstocks will often have δ13C distinguishable from storage reservoir CO₂. Noble gases in amine-captured CO₂ streams are likely to be low concentration, with isotopic ratios dependant on the feedstock, but CO₂ captured from oxyfuel plants may be strongly enriched in Kr and Xe which are potentially valuable subsurface tracers. CO₂ streams derived from fossil fuels will have noble gas isotope ratios reflecting a radiogenic component that will be difficult to distinguish in the storage reservoir, but inheritance of radiogenic components will provide an easily recognisable signature in the case of any unplanned migration into shallow aquifers or to the surface

    Percolating through networks of random thresholds: Finite temperature electron tunneling in metal nanocrystal arrays

    Get PDF
    We investigate how temperature affects transport through large networks of nonlinear conductances with distributed thresholds. In monolayers of weakly-coupled gold nanocrystals, quenched charge disorder produces a range of local thresholds for the onset of electron tunneling. Our measurements delineate two regimes separated by a cross-over temperature TT^*. Up to TT^* the nonlinear zero-temperature shape of the current-voltage curves survives, but with a threshold voltage for conduction that decreases linearly with temperature. Above TT^* the threshold vanishes and the low-bias conductance increases rapidly with temperature. We develop a model that accounts for these findings and predicts TT^*.Comment: 5 pages including 3 figures; replaced 3/30/04: minor changes; final versio

    Superconductivity and charge carrier localization in ultrathin La1.85Sr0.15CuO4/La2CuO4\mathbf{{La_{1.85}Sr_{0.15}CuO_4}/{La_2CuO_4}} bilayers

    Get PDF
    La1.85Sr0.15CuO4\mathrm{La_{1.85}Sr_{0.15}CuO_4}/La2CuO4\mathrm{La_2CuO_4} (LSCO15/LCO) bilayers with a precisely controlled thickness of N unit cells (UCs) of the former and M UCs of the latter ([LSCO15\_N/LCO\_M]) were grown on (001)-oriented {\slao} (SLAO) substrates with pulsed laser deposition (PLD). X-ray diffraction and reciprocal space map (RSM) studies confirmed the epitaxial growth of the bilayers and showed that a [LSCO15\_2/LCO\_2] bilayer is fully strained, whereas a [LSCO15\_2/LCO\_7] bilayer is already partially relaxed. The \textit{in situ} monitoring of the growth with reflection high energy electron diffraction (RHEED) revealed that the gas environment during deposition has a surprisingly strong effect on the growth mode and thus on the amount of disorder in the first UC of LSCO15 (or the first two monolayers of LSCO15 containing one CuO2\mathrm{CuO_2} plane each). For samples grown in pure N2O\mathrm{N_2O} gas (growth type-B), the first LSCO15 UC next to the SLAO substrate is strongly disordered. This disorder is strongly reduced if the growth is performed in a mixture of N2O\mathrm{N_2O} and O2\mathrm{O_2} gas (growth type-A). Electric transport measurements confirmed that the first UC of LSCO15 next to the SLAO substrate is highly resistive and shows no sign of superconductivity for growth type-B, whereas it is superconducting for growth type-A. Furthermore, we found, rather surprisingly, that the conductivity of the LSCO15 UC next to the LCO capping layer strongly depends on the thickness of the latter. A LCO capping layer with 7~UCs leads to a strong localization of the charge carriers in the adjacent LSCO15 UC and suppresses superconductivity. The magneto-transport data suggest a similarity with the case of weakly hole doped LSCO single crystals that are in a so-called {"{cluster-spin-glass state}"
    corecore