100 research outputs found

    A Fast and Specific Alignment Method for Minisatellite Maps

    Get PDF
    Background: Variable minisatellites count among the most polymorphic markers of eukaryotic and prokaryotic genomes. This variability can affect gene coding regions, like in the prion protein gene, or gene regulation regions, like for the cystatin B gene, and be associated or implicated in diseases: the Creutzfeld-Jakob disease and the myoclonus epilepsy type 1, for our examples. When it affects neutrally evolving regions, the polymorphism in length (i.e. in number of copies) of minisatellites proved useful in population genetics. Motivation: In these tandem repeat sequences, different mutational mechanisms let the number of copies, as well as the copies themselves, vary. Especially, the interspersion of events of tandem duplication/contraction and of punctual mutation makes the succession of variant repeat much more informative than the sole allele length. To exploit this information requires the ability to align minisatellite alleles by accounting for both punctual mutations and tandem duplications. Results: We propose a minisatellite maps alignment program that improves on previous solutions. Our new program is faster, simpler, considers an extended evolutionary model, and is available to the community. We test it on the data set of 609 alleles of the MSY1 (DYF155S1) human minisatellite andconfirm its abilityto recover known evolutionary signals. Our experiments highlight that the informativeness of minisatellites resides in their length and composition polymorphisms. Exploiting both simultaneously is critical to unravel the implications of variable minisatellites in the control of gene expression and diseases. Availability: Software is available at http://atgc.lirmm.fr/ms_align/ Keywords: VNTR, tandem repeat, tandem duplication, variable costs, dynamic programming, sequence comparison

    Author Index

    Get PDF

    Multilocus variable number of tandem repeat analysis reveals multiple introductions in Spain of Xanthomonas arboricola pv. Pruni, the causal agent of bacterial spot disease of stone fruits and almond

    Get PDF
    Xanthomonas arboricola pv. pruni is the causal agent of the bacterial spot disease of stone fruits, almond and some ornamental Prunus species. In Spain it was first detected in 2002 and since then, several outbreaks have occurred in different regions affecting mainly Japanese plum, peach and almond, both in commercial orchards and nurseries. As the origin of the introduction(s) was unknown, we have assessed the genetic diversity of 239 X. arboricola pv. pruni strains collected from 11 Spanish provinces from 2002 to 2013 and 25 reference strains from international collections. We have developed an optimized multilocus variable number of tandem repeat analysis (MLVA) scheme targeting 18 microsatellites and five minisatellites. A high discriminatory power was achieved since almost 50% of the Spanish strains were distinguishable, confirming the usefulness of this genotyping technique at small spatio-temporal scales. Spanish strains grouped in 18 genetic clusters (conservatively delineated so that each cluster contained haplotype networks linked by up to quadruple-locus variations). Furthermore, pairwise comparisons among populations from different provinces showed a strong genetic differentiation. Our results suggest multiple introductions of this pathogen in Spain and redistribution through contaminated nursery propagative plant material

    ModuleOrganizer: detecting modules in families of transposable elements

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Most known eukaryotic genomes contain mobile copied elements called transposable elements. In some species, these elements account for the majority of the genome sequence. They have been subject to many mutations and other genomic events (copies, deletions, captures) during transposition. The identification of these transformations remains a difficult issue. The study of families of transposable elements is generally founded on a multiple alignment of their sequences, a critical step that is adapted to transposons containing mostly localized nucleotide mutations. Many transposons that have lost their protein-coding capacity have undergone more complex rearrangements, needing the development of more complex methods in order to characterize the architecture of sequence variations.</p> <p>Results</p> <p>In this study, we introduce the concept of a <it>transposable element module</it>, a flexible motif present in at least two sequences of a family of transposable elements and built on a succession of maximal repeats. The paper proposes an assembly method working on a set of exact maximal repeats of a set of sequences to create such modules. It results in a graphical view of sequences segmented into modules, a representation that allows a flexible analysis of the transformations that have occurred between them. We have chosen as a demonstration data set in depth analysis of the transposable element Foldback in <it>Drosophila melanogaster</it>. Comparison with multiple alignment methods shows that our method is more sensitive for highly variable sequences. The study of this family and the two other families AtREP21 and SIDER2 reveals new copies of very different sizes and various combinations of modules which show the potential of our method.</p> <p>Conclusions</p> <p>ModuleOrganizer is available on the Genouest bioinformatics center at <url>http://moduleorganizer.genouest.org</url></p

    The shape of human gene family phylogenies

    Get PDF
    BACKGROUND: The shape of phylogenetic trees has been used to make inferences about the evolutionary process by comparing the shapes of actual phylogenies with those expected under simple models of the speciation process. Previous studies have focused on speciation events, but gene duplication is another lineage splitting event, analogous to speciation, and gene loss or deletion is analogous to extinction. Measures of the shape of gene family phylogenies can thus be used to investigate the processes of gene duplication and loss. We make the first systematic attempt to use tree shape to study gene duplication using human gene phylogenies. RESULTS: We find that gene duplication has produced gene family trees significantly less balanced than expected from a simple model of the process, and less balanced than species phylogenies: the opposite to what might be expected under the 2R hypothesis. CONCLUSION: While other explanations are plausible, we suggest that the greater imbalance of gene family trees than species trees is due to the prevalence of tandem duplications over regional duplications during the evolution of the human genome
    corecore