11,052 research outputs found
Probabilistic Graphical Models on Multi-Core CPUs using Java 8
In this paper, we discuss software design issues related to the development
of parallel computational intelligence algorithms on multi-core CPUs, using the
new Java 8 functional programming features. In particular, we focus on
probabilistic graphical models (PGMs) and present the parallelisation of a
collection of algorithms that deal with inference and learning of PGMs from
data. Namely, maximum likelihood estimation, importance sampling, and greedy
search for solving combinatorial optimisation problems. Through these concrete
examples, we tackle the problem of defining efficient data structures for PGMs
and parallel processing of same-size batches of data sets using Java 8
features. We also provide straightforward techniques to code parallel
algorithms that seamlessly exploit multi-core processors. The experimental
analysis, carried out using our open source AMIDST (Analysis of MassIve Data
STreams) Java toolbox, shows the merits of the proposed solutions.Comment: Pre-print version of the paper presented in the special issue on
Computational Intelligence Software at IEEE Computational Intelligence
Magazine journa
Phylodynamics of H5N1 Highly Pathogenic Avian Influenza in Europe, 2005-2010: Potential for Molecular Surveillance of New Outbreaks.
Previous Bayesian phylogeographic studies of H5N1 highly pathogenic avian influenza viruses (HPAIVs) explored the origin and spread of the epidemic from China into Russia, indicating that HPAIV circulated in Russia prior to its detection there in 2005. In this study, we extend this research to explore the evolution and spread of HPAIV within Europe during the 2005-2010 epidemic, using all available sequences of the hemagglutinin (HA) and neuraminidase (NA) gene regions that were collected in Europe and Russia during the outbreak. We use discrete-trait phylodynamic models within a Bayesian statistical framework to explore the evolution of HPAIV. Our results indicate that the genetic diversity and effective population size of HPAIV peaked between mid-2005 and early 2006, followed by drastic decline in 2007, which coincides with the end of the epidemic in Europe. Our results also suggest that domestic birds were the most likely source of the spread of the virus from Russia into Europe. Additionally, estimates of viral dispersal routes indicate that Russia, Romania, and Germany were key epicenters of these outbreaks. Our study quantifies the dynamics of a major European HPAIV pandemic and substantiates the ability of phylodynamic models to improve molecular surveillance of novel AIVs
Genomics clarifies taxonomic boundaries in a difficult species complex.
Efforts to taxonomically delineate species are often confounded with conflicting information and subjective interpretation. Advances in genomic methods have resulted in a new approach to taxonomic identification that stands to greatly reduce much of this conflict. This approach is ideal for species complexes, where divergence times are recent (evolutionarily) and lineages less well defined. The California Roach/Hitch fish species complex is an excellent example, experiencing a convoluted geologic history, diverse habitats, conflicting species designations and potential admixture between species. Here we use this fish complex to illustrate how genomics can be used to better clarify and assign taxonomic categories. We performed restriction-site associated DNA (RAD) sequencing on 255 Roach and Hitch samples collected throughout California to discover and genotype thousands of single nucleotide polymorphism (SNPs). Data were then used in hierarchical principal component, admixture, and FST analyses to provide results that consistently resolved a number of ambiguities and provided novel insights across a range of taxonomic levels. At the highest level, our results show that the CA Roach/Hitch complex should be considered five species split into two genera (4 + 1) as opposed to two species from distinct genera (1 +1). Subsequent levels revealed multiple subspecies and distinct population segments within identified species. At the lowest level, our results indicate Roach from a large coastal river are not native but instead introduced from a nearby river. Overall, this study provides a clear demonstration of the power of genomic methods for informing taxonomy and serves as a model for future studies wishing to decipher difficult species questions. By allowing for systematic identification across multiple scales, taxonomic structure can then be tied to historical and contemporary ecological, geographic or anthropogenic factors
Using Machine Learning to Predict Swine Movements within a Regional Program to Improve Control of Infectious Diseases in the US.
Between-farm animal movement is one of the most important factors influencing the spread of infectious diseases in food animals, including in the US swine industry. Understanding the structural network of contacts in a food animal industry is prerequisite to planning for efficient production strategies and for effective disease control measures. Unfortunately, data regarding between-farm animal movements in the US are not systematically collected and thus, such information is often unavailable. In this paper, we develop a procedure to replicate the structure of a network, making use of partial data available, and subsequently use the model developed to predict animal movements among sites in 34 Minnesota counties. First, we summarized two networks of swine producing facilities in Minnesota, then we used a machine learning technique referred to as random forest, an ensemble of independent classification trees, to estimate the probability of pig movements between farms and/or markets sites located in two counties in Minnesota. The model was calibrated and tested by comparing predicted data and observed data in those two counties for which data were available. Finally, the model was used to predict animal movements in sites located across 34 Minnesota counties. Variables that were important in predicting pig movements included between-site distance, ownership, and production type of the sending and receiving farms and/or markets. Using a weighted-kernel approach to describe spatial variation in the centrality measures of the predicted network, we showed that the south-central region of the study area exhibited high aggregation of predicted pig movements. Our results show an overlap with the distribution of outbreaks of porcine reproductive and respiratory syndrome, which is believed to be transmitted, at least in part, though animal movements. While the correspondence of movements and disease is not a causal test, it suggests that the predicted network may approximate actual movements. Accordingly, the predictions provided here might help to design and implement control strategies in the region. Additionally, the methodology here may be used to estimate contact networks for other livestock systems when only incomplete information regarding animal movements is available
Are stealth scalar fields stable?
Non-gravitating (stealth) scalar fields associated with Minkowski space in
scalar-tensor gravity are examined. Analytical solutions for both non-minimally
coupled scalar field theory and for Brans-Dicke gravity are studied and their
stability with respect to tensor perturbations is assessed using a covariant
and gauge-invariant formalism developed for alternative gravity. For
Brans-Dicke solutions, the stability with respect to homogeneous perturbations
is also studied. There are regions of parameter space corresponding to
stability and other regions corresponding to instability.Comment: 10 pages, 1 table, no figures, to appear in Phys. Rev,
Asterias: a parallelized web-based suite for the analysis of expression and aCGH data
Asterias (\url{http://www.asterias.info}) is an integrated collection of
freely-accessible web tools for the analysis of gene expression and aCGH data.
Most of the tools use parallel computing (via MPI). Most of our applications
allow the user to obtain additional information for user-selected genes by
using clickable links in tables and/or figures. Our tools include:
normalization of expression and aCGH data; converting between different types
of gene/clone and protein identifiers; filtering and imputation; finding
differentially expressed genes related to patient class and survival data;
searching for models of class prediction; using random forests to search for
minimal models for class prediction or for large subsets of genes with
predictive capacity; searching for molecular signatures and predictive genes
with survival data; detecting regions of genomic DNA gain or loss. The
capability to send results between different applications, access to additional
functional information, and parallelized computation make our suite unique and
exploit features only available to web-based applications.Comment: web based application; 3 figure
Inherent tracers for carbon capture and storage in sedimentary formations: composition and applications
Inherent tracers - the “natural” isotopic and trace gas composition of captured CO₂ streams – are potentially powerful tracers for use in CCS technology. This review outlines for the first time the expected carbon isotope and noble gas compositions of captured CO₂ streams from a range of feedstocks, CO₂-generating processes and carbon capture techniques. The C-isotope composition of captured CO₂ will be most strongly controlled by the feedstock, but significant isotope fractionation is possible during capture; noble gas concentrations will be controlled by the capture technique employed. Comparison with likely baseline data suggests that CO₂ generated from fossil fuel feedstocks will often have δ13C distinguishable from storage reservoir CO₂. Noble gases in amine-captured CO₂ streams are likely to be low concentration, with isotopic ratios dependant on the feedstock, but CO₂ captured from oxyfuel plants may be strongly enriched in Kr and Xe which are potentially valuable subsurface tracers. CO₂ streams derived from fossil fuels will have noble gas isotope ratios reflecting a radiogenic component that will be difficult to distinguish in the storage reservoir, but inheritance of radiogenic components will provide an easily recognisable signature in the case of any unplanned migration into shallow aquifers or to the surface
Percolating through networks of random thresholds: Finite temperature electron tunneling in metal nanocrystal arrays
We investigate how temperature affects transport through large networks of
nonlinear conductances with distributed thresholds. In monolayers of
weakly-coupled gold nanocrystals, quenched charge disorder produces a range of
local thresholds for the onset of electron tunneling. Our measurements
delineate two regimes separated by a cross-over temperature . Up to
the nonlinear zero-temperature shape of the current-voltage curves survives,
but with a threshold voltage for conduction that decreases linearly with
temperature. Above the threshold vanishes and the low-bias conductance
increases rapidly with temperature. We develop a model that accounts for these
findings and predicts .Comment: 5 pages including 3 figures; replaced 3/30/04: minor changes; final
versio
Superconductivity and charge carrier localization in ultrathin bilayers
/ (LSCO15/LCO) bilayers
with a precisely controlled thickness of N unit cells (UCs) of the former and M
UCs of the latter ([LSCO15\_N/LCO\_M]) were grown on (001)-oriented {\slao}
(SLAO) substrates with pulsed laser deposition (PLD). X-ray diffraction and
reciprocal space map (RSM) studies confirmed the epitaxial growth of the
bilayers and showed that a [LSCO15\_2/LCO\_2] bilayer is fully strained,
whereas a [LSCO15\_2/LCO\_7] bilayer is already partially relaxed. The
\textit{in situ} monitoring of the growth with reflection high energy electron
diffraction (RHEED) revealed that the gas environment during deposition has a
surprisingly strong effect on the growth mode and thus on the amount of
disorder in the first UC of LSCO15 (or the first two monolayers of LSCO15
containing one plane each). For samples grown in pure
gas (growth type-B), the first LSCO15 UC next to the SLAO
substrate is strongly disordered. This disorder is strongly reduced if the
growth is performed in a mixture of and gas
(growth type-A). Electric transport measurements confirmed that the first UC of
LSCO15 next to the SLAO substrate is highly resistive and shows no sign of
superconductivity for growth type-B, whereas it is superconducting for growth
type-A. Furthermore, we found, rather surprisingly, that the conductivity of
the LSCO15 UC next to the LCO capping layer strongly depends on the thickness
of the latter. A LCO capping layer with 7~UCs leads to a strong localization of
the charge carriers in the adjacent LSCO15 UC and suppresses superconductivity.
The magneto-transport data suggest a similarity with the case of weakly hole
doped LSCO single crystals that are in a so-called {"{cluster-spin-glass
state}"
- …