56 research outputs found
Multiple Mechanisms Promote the Retained Expression of Gene Duplicates in the Tetraploid Frog Xenopus laevis
Gene duplication provides a window of opportunity for biological variants to persist under the protection of a co-expressed copy with similar or redundant function. Duplication catalyzes innovation (neofunctionalization), subfunction degeneration (subfunctionalization), and genetic buffering (redundancy), and the genetic survival of each paralog is triggered by mechanisms that add, compromise, or do not alter protein function. We tested the applicability of three types of mechanisms for promoting the retained expression of duplicated genes in 290 expressed paralogs of the tetraploid clawed frog, Xenopus laevis. Tests were based on explicit expectations concerning the ka/ks ratio, and the number and location of nonsynonymous substitutions after duplication. Functional constraints on the majority of paralogs are not significantly different from a singleton ortholog. However, we recover strong support that some of them have an asymmetric rate of nonsynonymous substitution: 6% match predictions of the neofunctionalization hypothesis in that (1) each paralog accumulated nonsynonymous substitutions at a significantly different rate and (2) the one that evolves faster has a higher ka/ks ratio than the other paralog and than a singleton ortholog. Fewer paralogs (3%) exhibit a complementary pattern of substitution at the protein level that is predicted by enhancement or degradation of different functional domains, and the remaining 13% have a higher average ka/ks ratio in both paralogs that is consistent with altered functional constraints, diversifying selection, or activity-reducing mutations after duplication. We estimate that these paralogs have been retained since they originated by genome duplication between 21 and 41 million years ago. Multiple mechanisms operate to promote the retained expression of duplicates in the same genome, in genes in the same functional class, over the same period of time following duplication, and sometimes in the same pair of paralogs. None of these paralogs are superfluous; degradation or enhancement of different protein subfunctions and neofunctionalization are plausible hypotheses for the retained expression of some of them. Evolution of most X. laevis paralogs, however, is consistent with retained expression via mechanisms that do not radically alter functional constraints, such as selection to preserve post-duplication stoichiometry or temporal, quantitative, or spatial subfunctionalization
Optimization and performance testing of a sequence processing pipeline applied to detection of nonindigenous species
Genetic taxonomic assignment can be more sensitive than morphological taxonomic assignment, particularly for small, cryptic or rare species. Sequence processing is essential to taxonomic assignment, but can also produce errors because optimal parameters are not known a priori. Here, we explored how sequence processing parameters influence taxonomic assignment of 18S sequences from bulk zooplankton samples produced by 454 pyrosequencing. We optimized a sequence processing pipeline for two common research goals, estimation of species richness and early detection of aquatic invasive species (AIS), and then tested most optimal models’ performances through simulations. We tested 1,050 parameter sets on 18S sequences from 20 AIS to determine optimal parameters for each research goal. We tested optimized pipelines’ performances (detectability and sensitivity) by computationally inoculating sequences of 20 AIS into ten bulk zooplankton samples from ports across Canada. We found that optimal parameter selection generally depends on the research goal. However, regardless of research goal, we found that metazoan 18S sequences produced by 454 pyrosequencing should be trimmed to 375–400 bp and sequence quality filtering should be relaxed (1.5 ≤ maximum expected error ≤ 3.0, Phred score = 10). Clustering and denoising were only viable for estimating species richness, because these processing steps made some species undetectable at low sequence abundances which would not be useful for early detection of AIS. With parameter sets optimized for early detection of AIS, 90% of AIS were detected with fewer than 11 target sequences, regardless of whether clustering or denoising was used. Despite developments in next-generation sequencing, sequence processing remains an important issue owing to difficulties in balancing false-positive and false-negative errors in metabarcoding data
Genomics of Divergence along a Continuum of Parapatric Population Differentiation
MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1)
Trends in DNA barcoding and metabarcoding
This open-access special issue features 12 full articles representing emerging trends from the international DNAbarcoding community. Several articles highlight how DNA-based techniques are elucidating the species diversity,biogeography, and conservation status of Africas biodiversity. Another prominent theme is the movementtowards big biodiversity data using high-throughput, individual-based DNA barcoding methods, which preservevoucher specimens and abundance data, as well as bulk sample-based metabarcoding. Methodological developments are enhancing the detection of specific species and whole communities using environmental DNA(eDNA) barcoding and metabarcoding. Data are also expanding in terms of genetic coverage; in this issue, a newdatabase is established for a secondary fungalDNAbarcode marker, and multi-kingdom, multi-marker biodiversitysurveys are gaining traction. DNA barcode sequence data, often combined with complementary markers or taxonomic information, are increasingly contributing to large-scale phylogenetic projects, with implications for understanding evolutionary history, community structure, and conservation priorities.Fil: Adamowicz, Sarah J.. University of Guelph; CanadáFil: Boatwright, James S.. University of The Western Cape; SudáfricaFil: Chain, Frédéric. University of Massachusetts; Estados UnidosFil: Fisher, Brian L.. California Academy Of Sciences.; Estados UnidosFil: Hogg, Ian D.. Polar Knowledge Canada; CanadáFil: Leese, Florian. Universitat Essen; AlemaniaFil: Lijtmaer, Dario Alejandro. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Museo Argentino de Ciencias Naturales "Bernardino Rivadavia"; ArgentinaFil: Mwale, Monica. South African National Biodiversity Institute; SudáfricaFil: Naaum, Amanda M.. The Queens University of Belfast; IrlandaFil: Pochon, Xavier. University of Auckland; Nueva ZelandaFil: Schubert, Dirk W.. University of Guelph; CanadáFil: Wilson, John James. National Museums Liverpool; Reino UnidoFil: Wood, Susanna. Cawthron Institute; Nueva ZelandaFil: Xu, Jianping. Mcmaster University; CanadáFil: Xu, Sen. University of Texas at Arlington; Estados UnidosFil: Zhou, Xin. China Agricultural University; ChinaFil: Van Der Bank, Michelle. University of Johannesburg; Sudáfric
Single-Species Microarrays and Comparative Transcriptomics
BACKGROUND: Prefabricated expression microarrays are currently available for only a few species but methods have been proposed to extend their application to comparisons between divergent genomes. METHODOLOGY/PRINCIPAL FINDINGS: Here we demonstrate that the hybridization intensity of genomic DNA is a poor basis on which to select unbiased probes on Affymetrix expression arrays for studies of comparative transcriptomics, and that doing so produces spurious results. We used the Affymetrix Xenopus laevis microarray to evaluate expression divergence between X. laevis, X. borealis, and their F1 hybrids. When data are analyzed with probes that interrogate only sequences with confirmed identity in both species, we recover results that differ substantially analyses that use genomic DNA hybridizations to select probes. CONCLUSIONS/SIGNIFICANCE: Our findings have implications for the experimental design of comparative expression studies that use single-species microarrays, and for our understanding of divergent expression in hybrid clawed frogs. These findings also highlight important limitations of single-species microarrays for studies of comparative transcriptomics of polyploid species
The odds of duplicate gene persistence after polyploidization
Background: Gene duplication is an important biological phenomenon associated with genomic redundancy,degeneration, specialization, innovation, and speciation. After duplication, both copies continue functioning when natural selection favors duplicated protein function or expression, or when mutations make them functionally distinct before one copy is silenced. Results: Here we quantify the degree to which genetic parameters related to gene expression, molecular evolution, and gene structure in a diploid frog - Silurana tropicalis - influence the odds of functional persistence of orthologous duplicate genes in a closely related tetraploid species - Xenopus laevis. Using public databases and 454 pyrosequencing, we obtained genetic and expression data from S. tropicalis orthologs of 3,387 X. laevis paralogs and 4,746 X. laevis singletons - the most comprehensive dataset for African clawed frogs yet analyzed. Using logistic regression, we demonstrate that the most important predictors of the odds of duplicate gene persistence in the tetraploid species are the total gene expression level and evenness of expression across tissues and development in the diploid species. Slow protein evolution and information density (fewer exons, shorter introns) in the diploid are also positively correlated with duplicate gene persistence in the tetraploid. Conclusions: Our findings suggest that a combination of factors contribute to duplicate gene persistence following whole genome duplication, but that the total expression level and evenness of expression across tissues and through development before duplication are most important. We speculate that these parameters are useful predictors of duplicate gene longevity after whole genome duplication in other taxa
Extensive Copy-Number Variation of Young Genes across Stickleback Populations
MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript
Genome-Wide Genotype-Expression Relationships Reveal Both Copy Number and Single Nucleotide Differentiation Contribute to Differential Gene Expression between Stickleback Ecotypes
Repeated and independent emergence of trait divergence that matches habitat differences is a sign of parallel evolution by natural selection. Yet, the molecular underpinnings that are targeted by adaptive evolution often remain elusive. We investigate this question by combining genome-wide analyses of copy number variants (CNVs), single nucleotide polymorphisms (SNPs), and gene expression across four pairs of lake and river populations of the three-spined stickleback (Gasterosteus aculeatus). We tested whether CNVs that span entire genes and SNPs occurring in putative cis-regulatory regions contribute to gene expression differences between sticklebacks from lake and river origins. We found 135 gene CNVs that showed a significant positive association between gene copy number and gene expression, suggesting that CNVs result in dosage effects that can fuel phenotypic variation and serve as substrates for habitat-specific selection. Copy number differentiation between lake and river sticklebacks also contributed to expression differences of two immune-related genes in immune tissues, cathepsin A and GIMAP7. In addition, we identified SNPs in cis-regulatory regions (eSNPs) associated with the expression of 1,865 genes, including one eSNP upstream of a carboxypeptidase gene where both the SNP alleles differentiated and the gene was differentially expressed between lake and river populations. Our study highlights two types of mutations as important sources of genetic variation involved in the evolution of gene expression and in potentially facilitating repeated adaptation to novel environments
Genome sequencing of the lizard parasite Leishmania tarentolae reveals loss of genes associated to the intracellular stage of human pathogenic species
The Leishmania tarentolae Parrot-TarII strain genome sequence was resolved to an average 16-fold mean coverage by next-generation DNA sequencing technologies. This is the first non-pathogenic to humans kinetoplastid protozoan genome to be described thus providing an opportunity for comparison with the completed genomes of pathogenic Leishmania species. A high synteny was observed between all sequenced Leishmania species. A limited number of chromosomal regions diverged between L. tarentolae and L. infantum, while remaining syntenic to L. major. Globally, >90% of the L. tarentolae gene content was shared with the other Leishmania species. We identified 95 predicted coding sequences unique to L. tarentolae and 250 genes that were absent from L. tarentolae. Interestingly, many of the latter genes were expressed in the intracellular amastigote stage of pathogenic species. In addition, genes coding for products involved in antioxidant defence or participating in vesicular-mediated protein transport were underrepresented in L. tarentolae. In contrast to other Leishmania genomes, two gene families were expanded in L. tarentolae, namely the zinc metallo-peptidase surface glycoprotein GP63 and the promastigote surface antigen PSA31C. Overall, L. tarentolae's gene content appears better adapted to the promastigote insect stage rather than the amastigote mammalian stage
- …