31 research outputs found

    An ancient adaptive episode of convergent molecular evolution confounds phylogenetic inference

    Get PDF
    Convergence can mislead phylogenetic inference by mimicking shared ancestry, but has been detected only rarely in molecular evolution. Here, we show that significant convergence occurred in snake and agamid lizard mitochondrial genomes. Most evidence, and most of the mitochondrial genome, supports one phylogenetic tree, but a subset of mostly amino acid-altering mitochondrial sites strongly support a radically different phylogeny. These sites are convergent, probably selected, and overwhelm the signal from other sites. This suggests that convergent molecular evolution can seriously mislead phylogenetics, even with large data sets. Radical phylogenies inconsistent with previous evidence should be treated cautiously

    Rapid Microsatellite Identification from Illumina Paired-End Genomic Sequencing in Two Birds and a Snake

    Get PDF
    Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct “Seq-to-SSR” approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample – a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable

    Repetitive Elements May Comprise Over Two-Thirds of the Human Genome

    Get PDF
    Transposable elements (TEs) are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo “clouds”). We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%–69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM), to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp). Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed “element-specific” P-clouds (ESPs) to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed

    Seasonality of MRSA Infections

    Get PDF
    Using MRSA isolates submitted to our hospital microbiology laboratory January 2001–March 2010 and the number of our emergency department (ED) visits, quarterly community-associated (CA) and hospital-associated (HA) MRSA infections were modeled using Poisson regressions. For pediatric patients, approximately 1.85x (95% CI 1.45x–2.36x, adj. p<0.0001) as many CA-MRSA infections per ED visit occurred in the second two quarters as occurred in the first two quarters. For adult patients, 1.14x (95% CI 1.01x–1.29x, adj.p = 0.03) as many infections per ED visit occurred in the second two quarters as in the first two quarters. Approximately 2.94x (95% CI 1.39x–6.21x, adj.p = 0.015) as many HA-MRSA infections per hospital admission occurred in the second two quarters as occurred in the first two quarters for pediatric patients. No seasonal variation was observed among adult HA-MRSA infections per hospital admission. We demonstrated seasonality of MRSA infections and provide a summary table of similar observations in other studies

    Crystal Structures of T. b. rhodesiense Adenosine Kinase Complexed with Inhibitor and Activator: Implications for Catalysis and Hyperactivation

    Get PDF
    Recently, we discovered that 4-[5-(4-phenoxyphenyl)-2H-pyrazol-3-yl]morpholine (compound 1) and its derivatives exhibit specific antitrypanosomal activity toward T. b. rhodesiense, the causative agent of the acute form of HAT. We found that compound 1 would target the parasite adenosine kinase (TbrAK), an important enzyme of the purine salvage pathway, by acting via hyperactivation of the enzyme. This represents a novel and hitherto unexplored strategy for the development of trypanocides. These findings prompted us to investigate the mechanism of action at the molecular level. The present study reports the first three-dimensional crystal structures of TbrAK in complex with the bisubstrate inhibitor AP5A, and in complex with the activator (compound 1). The subsequent structural analysis sheds light on substrate and activator binding, and gives insight into the possible mechanism leading to hyperactivation. Further structure-activity relationships in terms of TbrAK activation properties support the observed binding mode of compound 1 in the crystal structure and may open the field for subsequent optimization of this compound series

    Molecular Adaptations for Sensing and Securing Prey and Insight into Amniote Genome Diversity from the Garter Snake Genome

    Get PDF
    Colubridae represents the most phenotypically diverse and speciose family of snakes, yet no well-assembled and annotated genome exists for this lineage. Here, we report and analyze the genome of the garter snake, Thamnophis sirtalis, a colubrid snake that is an important model species for research in evolutionary biology, physiology, genomics, behavior, and the evolution of toxin resistance. Using the garter snake genome, we show how snakes have evolved numerous adaptations for sensing and securing prey, and identify features of snake genome structure that provide insight into the evolution of amniote genomes. Analyses of the garter snake and other squamate reptile genomes highlight shifts in repeat element abundance and expansion within snakes, uncover evidence of genes under positive selection, and provide revised neutral substitution rate estimates for squamates. Our identification of Z and W sex chromosome-specific scaffolds provides evidence for multiple origins of sex chromosome systems in snakes and demonstrates the value of this genome for studying sex chromosome evolution. Analysis of gene duplication and loss in visual and olfactory gene families supports a dim-light ancestral condition in snakes and indicates that olfactory receptor repertoires underwent an expansion early in snake evolution. Additionally, we provide some of the first links between secreted venom proteins, the genes that encode them, and their evolutionary origins in a rear-fanged colubrid snake, together with new genomic insight into the coevolutionary arms race between garter snakes and highly toxic newt prey that led to toxin resistance in garter snakes

    Sequencing three crocodilian genomes to illuminate the evolution of archosaurs and amniotes

    Get PDF
    The International Crocodilian Genomes Working Group (ICGWG) will sequence and assemble the American alligator (Alligator mississippiensis), saltwater crocodile (Crocodylus porosus) and Indian gharial (Gavialis gangeticus) genomes. The status of these projects and our planned analyses are described

    Association of the PHACTR1/EDN1 genetic locus with spontaneous coronary artery dissection

    Get PDF
    Background: Spontaneous coronary artery dissection (SCAD) is an increasingly recognized cause of acute coronary syndromes (ACS) afflicting predominantly younger to middle-aged women. Observational studies have reported a high prevalence of extracoronary vascular anomalies, especially fibromuscular dysplasia (FMD) and a low prevalence of coincidental cases of atherosclerosis. PHACTR1/EDN1 is a genetic risk locus for several vascular diseases, including FMD and coronary artery disease, with the putative causal noncoding variant at the rs9349379 locus acting as a potential enhancer for the endothelin-1 (EDN1) gene. Objectives: This study sought to test the association between the rs9349379 genotype and SCAD. Methods: Results from case control studies from France, United Kingdom, United States, and Australia were analyzed to test the association with SCAD risk, including age at first event, pregnancy-associated SCAD (P-SCAD), and recurrent SCAD. Results: The previously reported risk allele for FMD (rs9349379-A) was associated with a higher risk of SCAD in all studies. In a meta-analysis of 1,055 SCAD patients and 7,190 controls, the odds ratio (OR) was 1.67 (95% confidence interval [CI]: 1.50 to 1.86) per copy of rs9349379-A. In a subset of 491 SCAD patients, the OR estimate was found to be higher for the association with SCAD in patients without FMD (OR: 1.89; 95% CI: 1.53 to 2.33) than in SCAD cases with FMD (OR: 1.60; 95% CI: 1.28 to 1.99). There was no effect of genotype on age at first event, P-SCAD, or recurrence. Conclusions: The first genetic risk factor for SCAD was identified in the largest study conducted to date for this condition. This genetic link may contribute to the clinical overlap between SCAD and FMD
    corecore