962 research outputs found

    The variant call format and VCFtools

    Get PDF
    Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API

    Simulations of energetic beam deposition: from picoseconds to seconds

    Full text link
    We present a new method for simulating crystal growth by energetic beam deposition. The method combines a Kinetic Monte-Carlo simulation for the thermal surface diffusion with a small scale molecular dynamics simulation of every single deposition event. We have implemented the method using the effective medium theory as a model potential for the atomic interactions, and present simulations for Ag/Ag(111) and Pt/Pt(111) for incoming energies up to 35 eV. The method is capable of following the growth of several monolayers at realistic growth rates of 1 monolayer per second, correctly accounting for both energy-induced atomic mobility and thermal surface diffusion. We find that the energy influences island and step densities and can induce layer-by-layer growth. We find an optimal energy for layer-by-layer growth (25 eV for Ag), which correlates with where the net impact-induced downward interlayer transport is at a maximum. A high step density is needed for energy induced layer-by-layer growth, hence the effect dies away at increased temperatures, where thermal surface diffusion reduces the step density. As part of the development of the method, we present molecular dynamics simulations of single atom-surface collisions on flat parts of the surface and near straight steps, we identify microscopic mechanisms by which the energy influences the growth, and we discuss the nature of the energy-induced atomic mobility

    Ab initio molecular dynamics using density based energy functionals: application to ground state geometries of some small clusters

    Get PDF
    The ground state geometries of some small clusters have been obtained via ab initio molecular dynamical simulations by employing density based energy functionals. The approximate kinetic energy functionals that have been employed are the standard Thomas-Fermi (TTF)(T_{TF}) along with the Weizsacker correction TWT_W and a combination F(Ne)TTF+TWF(N_e)T_{TF} + T_W. It is shown that the functional involving F(Ne)F(N_e) gives superior charge densities and bondlengths over the standard functional. Apart from dimers and trimers of Na, Mg, Al, Li, Si, equilibrium geometries for LinAl,n=1,8Li_nAl, n=1,8 and Al13Al_{13} clusters have also been reported. For all the clusters investigated, the method yields the ground state geometries with the correct symmetries with bondlengths within 5\% when compared with the corresponding results obtained via full orbital based Kohn-Sham method. The method is fast and a promising one to study the ground state geometries of large clusters.Comment: 15 pages, 3 PS figure

    Quantifying single nucleotide variant detection sensitivity in exome sequencing

    Get PDF
    BACKGROUND: The targeted capture and sequencing of genomic regions has rapidly demonstrated its utility in genetic studies. Inherent in this technology is considerable heterogeneity of target coverage and this is expected to systematically impact our sensitivity to detect genuine polymorphisms. To fully interpret the polymorphisms identified in a genetic study it is often essential to both detect polymorphisms and to understand where and with what probability real polymorphisms may have been missed. RESULTS: Using down-sampling of 30 deeply sequenced exomes and a set of gold-standard single nucleotide variant (SNV) genotype calls for each sample, we developed an empirical model relating the read depth at a polymorphic site to the probability of calling the correct genotype at that site. We find that measured sensitivity in SNV detection is substantially worse than that predicted from the naive expectation of sampling from a binomial. This calibrated model allows us to produce single nucleotide resolution SNV sensitivity estimates which can be merged to give summary sensitivity measures for any arbitrary partition of the target sequences (nucleotide, exon, gene, pathway, exome). These metrics are directly comparable between platforms and can be combined between samples to give “power estimates” for an entire study. We estimate a local read depth of 13X is required to detect the alleles and genotype of a heterozygous SNV 95% of the time, but only 3X for a homozygous SNV. At a mean on-target read depth of 20X, commonly used for rare disease exome sequencing studies, we predict 5–15% of heterozygous and 1–4% of homozygous SNVs in the targeted regions will be missed. CONCLUSIONS: Non-reference alleles in the heterozygote state have a high chance of being missed when commonly applied read coverage thresholds are used despite the widely held assumption that there is good polymorphism detection at these coverage levels. Such alleles are likely to be of functional importance in population based studies of rare diseases, somatic mutations in cancer and explaining the “missing heritability” of quantitative traits

    Evolution favors protein mutational robustness in sufficiently large populations

    Get PDF
    BACKGROUND: An important question is whether evolution favors properties such as mutational robustness or evolvability that do not directly benefit any individual, but can influence the course of future evolution. Functionally similar proteins can differ substantially in their robustness to mutations and capacity to evolve new functions, but it has remained unclear whether any of these differences might be due to evolutionary selection for these properties. RESULTS: Here we use laboratory experiments to demonstrate that evolution favors protein mutational robustness if the evolving population is sufficiently large. We neutrally evolve cytochrome P450 proteins under identical selection pressures and mutation rates in populations of different sizes, and show that proteins from the larger and thus more polymorphic population tend towards higher mutational robustness. Proteins from the larger population also evolve greater stability, a biophysical property that is known to enhance both mutational robustness and evolvability. The excess mutational robustness and stability is well described by existing mathematical theories, and can be quantitatively related to the way that the proteins occupy their neutral network. CONCLUSIONS: Our work is the first experimental demonstration of the general tendency of evolution to favor mutational robustness and protein stability in highly polymorphic populations. We suggest that this phenomenon may contribute to the mutational robustness and evolvability of viruses and bacteria that exist in large populations
    corecore