87 research outputs found

    Controlling for contamination in re-sequencing studies with a reproducible web-based phylogenetic approach

    Get PDF
    Polymorphism discovery is a routine application of next-generation sequencing technology where multiple samples are sent to a service provider for library preparation, subsequent sequencing, and bioinformatic analyses. The decreasing cost and advances in multiplexing approaches have made it possible to analyze hundreds of samples at a reasonable cost. However, because of the manual steps involved in the initial processing of samples and handling of sequencing equipment, cross-contamination remains a significant challenge. It is especially problematic in cases where polymorphism frequencies do not adhere to diploid expectation, for example, heterogeneous tumor samples, organellar genomes, as well as during bacterial and viral sequencing. In these instances, low levels of contamination may be readily mistaken for polymorphisms, leading to false results. Here we describe practical steps designed to reliably detect contamination and uncover its origin, and also provide new, Galaxy-based, readily accessible computational tools and workflows for quality control. All results described in this report can be reproduced interactively on the web as described at http://usegalaxy.org/contamination

    Caracterización Clínica e Histopatológica del Carcinoma Baso celular

    Get PDF
    Introduction: The basaloma also known as basal cell carcinoma is the most frequent case of skin cancer. It can be seen mostly in face, nose and forehead.Objective: To identify the clinical and histopathological characteristics of basal cell carcinoma in patients who underwent surgery at the Maxillo Facial Service of the Celia Sánchez Manduley University Hospital in Manzanillo, during the period from September to December 2016.Methodological Design: It was carried out a descriptive and observant study to identify the clinical and histopathological characteristics of the basal cell carcinoma in patients that were treated in the Maxillofacial Service of the Clinical-Surgical Hospital ´´ Celia Sánchez Manduley´´, between the months of September and December of 2016. The sample had 60 patients.Results: The basal cell carcinoma predominated in males (55%) mostly between the ages of 60 and 80 years (68,67%). This disease was most frequently presented in white people (75%); located mainly in the cheeks (43%), with a size between one and two centimeters. These lesions presented a low diagnosis error with only a two percentage.Conclusions: The research reveal a high frequency of this disease in elderly people mostly men, with a bigger tendency to be located in the facial regions sun exposed. It was also confirmed that these lesions were detected and treated in early stages. Their diagnosis and treatment are performed with a small margin of error.Introducción: el basaloma, también llamado carcinoma de células basales y carcinoma basocelular, es la forma más frecuente de cáncer de piel y se puede encontrar principalmente en cara, nariz y frente.Objetivo: identificar las características clínicas e histopatológicas del carcinoma basocelular en pacientes que fueron operados en el Servicio de Máxilo Facial del Hospital Universitario Celia Sánchez Manduley de Manzanillo, en el período comprendido entre los meses de septiembre a diciembre de 2016.Material y Métodos: se realizó un estudio observacional, descriptivo y de corte transversal para identificar las características clínicas e histopatológicas del carcinoma basocelular en pacientes que fueron operados en el Servicio de Máxilo Facial del Hospital Clínico-quirúrgico Celia Sánchez Manduley de Manzanillo, en el período comprendido entre los meses de septiembre a diciembre de 2016. La muestra estuvo constituida por 60 pacientes.Resultados:  el carcinoma basocelular predominó en el sexo masculino (55%) sobre todo en las edades de 60 a 80 años (68,67%). Esta lesión se presentó con mayor frecuencia en personas de piel blanca (75%); localizándose principalmente en las mejillas (43%) con un tamaño predominante de 1 a 2 cm. Esta lesión presentó bajo error diagnóstico con solo el 2%. Conclusiones: la investigación reveló una alta frecuencia de la enfermedad en personas de edad avanzada sobre todo en hombres, con mayor tendencia a presentarse en regiones faciales expuestas al sol. También se constató que las lesiones son detectadas y tratadas en un estado incipiente. Su diagnóstico y tratamiento se realzan con un margen de error muy pequeño

    Evolution and Survival on Eutherian Sex Chromosomes

    Get PDF
    Since the two eutherian sex chromosomes diverged from an ancestral autosomal pair, the X has remained relatively gene-rich, while the Y has lost most of its genes through the accumulation of deleterious mutations in nonrecombining regions. Presently, it is unclear what is distinctive about genes that remain on the Y chromosome, when the sex chromosomes acquired their unique evolutionary rates, and whether X-Y gene divergence paralleled that of paralogs located on autosomes. To tackle these questions, here we juxtaposed the evolution of X and Y homologous genes (gametologs) in eutherian mammals with their autosomal orthologs in marsupial and monotreme mammals. We discovered that genes on the X and Y acquired distinct evolutionary rates immediately following the suppression of recombination between the two sex chromosomes. The Y-linked genes evolved at higher rates, while the X-linked genes maintained the lower evolutionary rates of the ancestral autosomal genes. These distinct rates have been maintained throughout the evolution of X and Y. Specifically, in humans, most X gametologs and, curiously, also most Y gametologs evolved under stronger purifying selection than similarly aged autosomal paralogs. Finally, after evaluating the current experimental data from the literature, we concluded that unique mRNA/protein expression patterns and functions acquired by Y (versus X) gametologs likely contributed to their retention. Our results also suggest that either the boundary between sex chromosome strata 3 and 4 should be shifted or that stratum 3 should be divided into two strata

    Translog, a web browser for studying the expression divergence of homologous genes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Increasing amount of data from comparative genomics, and newly developed technologies producing accurate gene expression data facilitate the study of the expression divergence of homologous genes. Previous studies have individually highlighted factors that contribute to the expression divergence of duplicate genes, e.g. promoter changes, exon structure heterogeneity, asymmetric histone modifications and genomic neighborhood conservation. However, there is a lack of a tool to integrate multiple factors and visualize their variety among homologous genes in a straightforward way.</p> <p>Results</p> <p>We introduce Translog (a web-based tool for Transcriptome comparison of homologous genes) that assists in the comparison of homologous genes by displaying the loci in three different views: promoter view for studying the sharing/turnover of transcription initiations, exon structure for displaying the exon-intron structure changes, and genomic neighborhood to show the macro-synteny conservation in a larger scale. CAGE data for transcription initiation are mapped for each transcript and can be used to study transcription turnover and expression changes. Alignment anchors between homologous loci can be used to define the precise homologous transcripts. We demonstrate how these views can be used to visualize the changes of homologous genes during evolution, particularly after the 2R and 3R whole genome duplication.</p> <p>Conclusion</p> <p>We have developed a web-based tool for assisting in the transcriptome comparison of homologous genes, facilitating the study of expression divergence.</p

    Role of Duplicate Genes in Robustness against Deleterious Human Mutations

    Get PDF
    It is now widely recognized that robustness is an inherent property of biological systems [1],[2],[3]. The contribution of close sequence homologs to genetic robustness against null mutations has been previously demonstrated in simple organisms [4],[5]. In this paper we investigate in detail the contribution of gene duplicates to back-up against deleterious human mutations. Our analysis demonstrates that the functional compensation by close homologs may play an important role in human genetic disease. Genes with a 90% sequence identity homolog are about 3 times less likely to harbor known disease mutations compared to genes with remote homologs. Moreover, close duplicates affect the phenotypic consequences of deleterious mutations by making a decrease in life expectancy significantly less likely. We also demonstrate that similarity of expression profiles across tissues significantly increases the likelihood of functional compensation by homologs

    Functional Diversification of Paralogous Transcription Factors via Divergence in DNA Binding Site Motif and in Expression

    Get PDF
    BACKGROUND: Gene duplication is a major driver of evolutionary innovation as it allows for an organism to elaborate its existing biological functions via specialization or diversification of initially redundant gene paralogs. Gene function can diversify in several ways. Transcription factor gene paralogs in particular, can diversify either by changes in their tissue-specific expression pattern or by changes in the DNA binding site motif recognized by their protein product, which in turn alters their gene targets. The relationship between these two modes of functional diversification of transcription factor paralogs has not been previously investigated, and is essential for understanding adaptive evolution of transcription factor gene families. FINDINGS: Based on a large set of human paralogous transcription factor pairs, we show that when the DNA binding site motifs of transcription factor paralogs are similar, the expressions of the genes that encode the paralogs have diverged, so in general, at most one of the paralogs is highly expressed in a tissue. Moreover, paralogs with diverged DNA binding site motifs tend to be diverged in their function. Conversely, two paralogs that are highly expressed in a tissue tend to have dissimilar DNA binding site motifs. We have also found that in general, within a paralogous family, tissue-specific decrease in gene expression is more frequent than what is expected by chance. CONCLUSIONS: While previous investigations of paralogous gene diversification have only considered coding sequence divergence, by explicitly quantifying divergence in DNA binding site motif, our work presents a new paradigm for investigating functional diversification. Consistent with evolutionary expectation, our quantitative analysis suggests that paralogous transcription factors have survived extinction in part, either through diversification of their DNA binding site motifs or through alterations in their tissue-specific expression levels

    Gene Expression Divergence is Coupled to Evolution of DNA Structure in Coding Regions

    Get PDF
    Sequence changes in coding region and regulatory region of the gene itself (cis) determine most of gene expression divergence between closely related species. But gene expression divergence between yeast species is not correlated with evolution of primary nucleotide sequence. This indicates that other factors in cis direct gene expression divergence. Here, we studied the contribution of DNA three-dimensional structural evolution as cis to gene expression divergence. We found that the evolution of DNA structure in coding regions and gene expression divergence are correlated in yeast. Similar result was also observed between Drosophila species. DNA structure is associated with the binding of chromatin remodelers and histone modifiers to DNA sequences in coding regions, which influence RNA polymerase II occupancy that controls gene expression level. We also found that genes with similar DNA structures are involved in the same biological process and function. These results reveal the previously unappreciated roles of DNA structure as cis-effects in gene expression

    Testing the Ortholog Conjecture with Comparative Functional Genomic Data from Mammals

    Get PDF
    A common assumption in comparative genomics is that orthologous genes share greater functional similarity than do paralogous genes (the “ortholog conjecture”). Many methods used to computationally predict protein function are based on this assumption, even though it is largely untested. Here we present the first large-scale test of the ortholog conjecture using comparative functional genomic data from human and mouse. We use the experimentally derived functions of more than 8,900 genes, as well as an independent microarray dataset, to directly assess our ability to predict function using both orthologs and paralogs. Both datasets show that paralogs are often a much better predictor of function than are orthologs, even at lower sequence identities. Among paralogs, those found within the same species are consistently more functionally similar than those found in a different species. We also find that paralogous pairs residing on the same chromosome are more functionally similar than those on different chromosomes, perhaps due to higher levels of interlocus gene conversion between these pairs. In addition to offering implications for the computational prediction of protein function, our results shed light on the relationship between sequence divergence and functional divergence. We conclude that the most important factor in the evolution of function is not amino acid sequence, but rather the cellular context in which proteins act

    Late Replicating Domains Are Highly Recombining in Females but Have Low Male Recombination Rates: Implications for Isochore Evolution

    Get PDF
    In mammals sequences that are either late replicating or highly recombining have high rates of evolution at putatively neutral sites. As early replicating domains and highly recombining domains both tend to be GC rich we a priori expect these two variables to covary. If so, the relative contribution of either of these variables to the local neutral substitution rate might have been wrongly estimated owing to covariance with the other. Against our expectations, we find that sex-averaged recombination rates show little or no correlation with replication timing, suggesting that they are independent determinants of substitution rates. However, this result masks significant sex-specific complexity: late replicating domains tend to have high recombination rates in females but low recombination rates in males. That these trends are antagonistic explains why sex-averaged recombination is not correlated with replication timing. This unexpected result has several important implications. First, although both male and female recombination rates covary significantly with intronic substitution rates, the magnitude of this correlation is moderately underestimated for male recombination and slightly overestimated for female recombination, owing to covariance with replicating timing. Second, the result could explain why male recombination is strongly correlated with GC content but female recombination is not. If to explain the correlation between GC content and replication timing we suppose that late replication forces reduced GC content, then GC promotion by biased gene conversion during female recombination is partly countered by the antagonistic effect of later replicating sequence tending increase AT content. Indeed, the strength of the correlation between female recombination rate and local GC content is more than doubled by control for replication timing. Our results underpin the need to consider sex-specific recombination rates and potential covariates in analysis of GC content and rates of evolution

    Contrasting Patterns of Sequence Evolution at the Functionally Redundant bric à brac Paralogs in Drosophila melanogaster

    Get PDF
    Genes with overlapping expression and function may gradually diverge despite retaining some common functions. To test whether such genes show distinct patterns of molecular evolution within species, we examined sequence variation at the bric à brac (bab) locus of Drosophila melanogaster. This locus is composed of two anciently duplicated paralogs, bab1 and bab2, which are involved in patterning the adult abdomen, legs, and ovaries. We have sequenced the 148 kb genomic region spanning the bab1 and bab2 genes from 94 inbred lines of D. melanogaster sampled from a single location. Two non-coding regions, one in each paralog, appear to be under selection. The strongest evidence of directional selection is found in a region of bab2 that has no known functional role. The other region is located in the bab1 paralog and is known to contain a cis-regulatory element that controls sex-specific abdominal pigmentation. The coding region of bab1 appears to be under stronger functional constraint than the bab2 coding sequences. Thus, the two paralogs are evolving under different selective regimes in the same natural population, illuminating the different evolutionary trajectories of partially redundant duplicate genes
    corecore