317 research outputs found

    Benchmarking ortholog identification methods using functional genomics data

    Get PDF
    BACKGROUND: The transfer of functional annotations from model organism proteins to human proteins is one of the main applications of comparative genomics. Various methods are used to analyze cross-species orthologous relationships according to an operational definition of orthology. Often the definition of orthology is incorrectly interpreted as a prediction of proteins that are functionally equivalent across species, while in fact it only defines the existence of a common ancestor for a gene in different species. However, it has been demonstrated that orthologs often reveal significant functional similarity. Therefore, the quality of the orthology prediction is an important factor in the transfer of functional annotations (and other related information). To identify protein pairs with the highest possible functional similarity, it is important to qualify ortholog identification methods. RESULTS: To measure the similarity in function of proteins from different species we used functional genomics data, such as expression data and protein interaction data. We tested several of the most popular ortholog identification methods. In general, we observed a sensitivity/selectivity trade-off: the functional similarity scores per orthologous pair of sequences become higher when the number of proteins included in the ortholog groups decreases. CONCLUSION: By combining the sensitivity and the selectivity into an overall score, we show that the InParanoid program is the best ortholog identification method in terms of identifying functionally equivalent proteins

    Microarray data mining using Bioconductor packages

    Get PDF
    BACKGROUND: This paper describes the results of a Gene Ontology (GO) term enrichment analysis of chicken microarray data using the Bioconductor packages. By checking the enriched GO terms in three contrasts, MM8-PM8, MM8-MA8, and MM8-MM24, of the provided microarray data during this workshop, this analysis aimed to investigate the host reactions in chickens occurring shortly after a secondary challenge with either a homologous or heterologous species of Eimeria. The results of GO enrichment analysis using GO terms annotated to chicken genes and GO terms annotated to chicken-human orthologous genes were also compared. Furthermore, a locally adaptive statistical procedure (LAP) was performed to test differentially expressed chromosomal regions, rather than individual genes, in the chicken genome after Eimeria challenge. RESULTS: GO enrichment analysis identified significant (raw p-value < 0.05) GO terms for all three contrasts included in the analysis. Some of the GO terms linked to, generally, primary immune responses or secondary immune responses indicating the GO enrichment analysis is a useful approach to analyze microarray data. The comparisons of GO enrichment results using chicken gene information and chicken-human orthologous gene information showed more refined GO terms related to immune responses when using chicken-human orthologous gene information, this suggests that using chicken-human orthologous gene information has higher power to detect significant GO terms with more refined functionality. Furthermore, three chromosome regions were identified to be significantly up-regulated in contrast MM8-PM8 (q-value < 0.01). CONCLUSION: Overall, this paper describes a practical approach to analyze microarray data in farm animals where the genome information is still incomplete. For farm animals, such as chicken, with currently limited gene annotation, borrowing gene annotation information from orthologous genes in well-annotated species, such as human, will help improve the pathway analysis results substantially. Furthermore, LAP analysis approach is a relatively new and very useful way to be applied in microarray analysis

    The use of microsatellite polymorphism in genetic mapping of the ostrich (Struthio camelus)

    Get PDF
    The aim of this study was to determine microsatellite polymorphism in ostriches and using it in creation the genetic map of the ostrich. The polymorphism analysis covered 30 microsatellite markers characteristic of ostrich, for the CAU (China Agricultural University) group. The material consisted of 150 ostriches (Struthio camelus). The 30 microsatellite loci was examined and a total of 343 alleles was identified. The number of alleles at a single locus ranged from 5 at locus CAU78 to 34 at locus CAU85. The values for the observed heterozygosity Ho ranged from 0.467 (locus CAU78) to 0.993 (locus CAU16), whereas for the expected heterozygosity He - from 0.510 (locus CAU78) to 0.953 (locus CAU85). Analyzing the individual loci, the highest PIC value, more than 0.7 was observed for: loci CAU85 (0.932), CAU64 (0.861) and CAU32, 75 (0.852), respectively. It should be noted, that the microsatellite markers used in our study were very polymorphic as evidenced by the large number of detected alleles and high rates of heterozygosity, PIC and PE as well. The analysed microsatellite markers may be used in genetic linkage mapping of ostrich, the construction of a comparative genetic map with other ratites, such as emu and rhea, and population genetics studies or phylogenetic studies of these birds

    Electrochemical synthesis of peroxomonophosphate using boron-doped diamond anodes

    Get PDF
    A new method for the synthesis of peroxomonophosphate, based on the use of boron-doped diamond electrodes, is described. The amount of oxidant electrogenerated depends on the characteristics of the supporting media (pH and solute concentration) and on the operating conditions (temperature and current density). Results show that the pH, between values of 1 and 5, does not influence either the electrosynthesis of peroxomonophosphate or the chemical stability of the oxidant generated. Conversely, low temperatures are required during the electrosynthesis process to minimize the thermal decomposition of peroxomonophosphate and to guarantee significant oxidant concentration. In addition, a marked influence of both the current density and the initial substrate is observed. This observation can be explained in terms of the contribution of hydroxyl radicals in the oxidation mechanisms that occur on diamond surfaces. In the assays carried out below the water oxidation potential, the generation of hydroxyl radicals did not take place. In these cases, peroxomonophosphate generation occurs through a direct electron transfer and, therefore, at these low current densities lower concentrations are obtained. On the other hand, at higher potentials both direct and hydroxyl radical-mediated mechanisms contribute to the oxidant generation and the process is more efficient. In the same way, the contribution of hydroxyl radicals may also help to explain the significant influence of the substrate concentration. Thus, the coexistence of both phosphate and hydroxyl radicals is required to ensure the generation of significant amounts of peroxomonophosphoric acid

    Strong signatures of selection in the domestic pig genome

    Get PDF
    Domestication of wild boar (Sus scrofa) and subsequent selection have resulted in dramatic phenotypic changes in domestic pigs for a number of traits, including behavior, body composition, reproduction, and coat color. Here we have used whole-genome resequencing to reveal some of the loci that underlie phenotypic evolution in European domestic pigs. Selective sweep analyses revealed strong signatures of selection at three loci harboring quantitative trait loci that explain a considerable part of one of the most characteristic morphological changes in the domestic pig—the elongation of the back and an increased number of vertebrae. The three loci were associated with the NR6A1, PLAG1, and LCORL genes. The latter two have repeatedly been associated with loci controlling stature in other domestic animals and in humans. Most European domestic pigs are homozygous for the same haplotype at these three loci. We found an excess of derived nonsynonymous substitutions in domestic pigs, most likely reflecting both positive selection and relaxed purifying selection after domestication. Our analysis of structural variation revealed four duplications at the KIT locus that were exclusively present in white or white-spotted pigs, carrying the Dominant white, Patch, or Belt alleles. This discovery illustrates how structural changes have contributed to rapid phenotypic evolution in domestic animals and how alleles in domestic animals may evolve by the accumulation of multiple causative mutations as a response to strong directional selection

    Comparison of linkage disequilibrium and haplotype diversity on macro- and microchromosomes in chicken

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The chicken (<it>Gallus gallus</it>), like most avian species, has a very distinct karyotype consisting of many micro- and a few macrochromosomes. While it is known that recombination frequencies are much higher for micro- as compared to macrochromosomes, there is limited information on differences in linkage disequilibrium (LD) and haplotype diversity between these two classes of chromosomes. In this study, LD and haplotype diversity were systematically characterized in 371 birds from eight chicken populations (commercial lines, fancy breeds, and red jungle fowl) across macro- and microchromosomes. To this end we sampled four regions of ~1 cM each on macrochromosomes (GGA1 and GGA2), and four 1.5 -2 cM regions on microchromosomes (GGA26 and GGA27) at a high density of 1 SNP every 2 kb (total of 889 SNPs).</p> <p>Results</p> <p>At a similar physical distance, LD, haplotype homozygosity, haploblock structure, and haplotype sharing were all lower for the micro- as compared to the macrochromosomes. These differences were consistent across populations. Heterozygosity, genetic differentiation, and derived allele frequencies were also higher for the microchromosomes. Differences in LD, haplotype variation, and haplotype sharing between populations were largely in line with known demographic history of the commercial chicken. Despite very low levels of LD, as measured by r<sup>2 </sup>for most populations, some haploblock structure was observed, particularly in the macrochromosomes, but the haploblock sizes were typically less than 10 kb.</p> <p>Conclusion</p> <p>Differences in LD between micro- and macrochromosomes were almost completely explained by differences in recombination rate. Differences in haplotype diversity and haplotype sharing between micro- and macrochromosomes were explained by differences in recombination rate and genotype variation. Haploblock structure was consistent with demography of the chicken populations, and differences in recombination rates between micro- and macrochromosomes. The limited haploblock structure and LD suggests that future whole-genome marker assays will need 100+K SNPs to exploit haplotype information. Interpretation and transferability of genetic parameters will need to take into account the size of chromosomes in chicken, and, since most birds have microchromosomes, in other avian species as well.</p

    Signatures of Selection in the Genomes of Commercial and Non-Commercial Chicken Breeds

    Get PDF
    Identifying genomics regions that are affected by selection is important to understand the domestication and selection history of the domesticated chicken, as well as understanding molecular pathways underlying phenotypic traits and breeding goals. While whole-genome approaches, either high-density SNP chips or massively parallel sequencing, have been successfully applied to identify evidence for selective sweeps in chicken, it has been difficult to distinguish patterns of selection and stochastic and breed specific effects. Here we present a study to identify selective sweeps in a large number of chicken breeds (67 in total) using a high-density (58 K) SNP chip. We analyzed commercial chickens representing all major breeding goals. In addition, we analyzed non-commercial chicken diversity for almost all recognized traditional Dutch breeds and a selection of representative breeds from China. Based on their shared history or breeding goal we in silico grouped the breeds into 14 breed groups. We identified 396 chromosomal regions that show suggestive evidence of selection in at least one breed group with 26 of these regions showing strong evidence of selection. Of these 26 regions, 13 were previously described and 13 yield new candidate genes for performance traits in chicken. Our approach demonstrates the strength of including many different populations with similar, and breed groups with different selection histories to reduce stochastic effects based on single populations

    Genome sequencing of the extinct Eurasian wild aurochs, Bos primigenius, illuminates the phylogeography and evolution of cattle

    Get PDF
    Background Domestication of the now-extinct wild aurochs, Bos primigenius, gave rise to the two major domestic extant cattle taxa, B. taurus and B. indicus. While previous genetic studies have shed some light on the evolutionary relationships between European aurochs and modern cattle, important questions remain unanswered, including the phylogenetic status of aurochs, whether gene flow from aurochs into early domestic populations occurred, and which genomic regions were subject to selection processes during and after domestication. Here, we address these questions using whole-genome sequencing data generated from an approximately 6,750-year-old British aurochs bone and genome sequence data from 81 additional cattle plus genome-wide single nucleotide polymorphism data from a diverse panel of 1,225 modern animals. Results Phylogenomic analyses place the aurochs as a distinct outgroup to the domestic B. taurus lineage, supporting the predominant Near Eastern origin of European cattle. Conversely, traditional British and Irish breeds share more genetic variants with this aurochs specimen than other European populations, supporting localized gene flow from aurochs into the ancestors of modern British and Irish cattle, perhaps through purposeful restocking by early herders in Britain. Finally, the functions of genes showing evidence for positive selection in B. taurus are enriched for neurobiology, growth, metabolism and immunobiology, suggesting that these biological processes have been important in the domestication of cattle. Conclusions This work provides important new information regarding the origins and functional evolution of modern cattle, revealing that the interface between early European domestic populations and wild aurochs was significantly more complex than previously thought

    PhyloPat: phylogenetic pattern analysis of eukaryotic genes

    Get PDF
    BACKGROUND: Phylogenetic patterns show the presence or absence of certain genes or proteins in a set of species. They can also be used to determine sets of genes or proteins that occur only in certain evolutionary branches. Phylogenetic patterns analysis has routinely been applied to protein databases such as COG and OrthoMCL, but not upon gene databases. Here we present a tool named PhyloPat which allows the complete Ensembl gene database to be queried using phylogenetic patterns. DESCRIPTION: PhyloPat is an easy-to-use webserver, which can be used to query the orthologies of all complete genomes within the EnsMart database using phylogenetic patterns. This enables the determination of sets of genes that occur only in certain evolutionary branches or even single species. We found in total 446,825 genes and 3,164,088 orthologous relationships within the EnsMart v40 database. We used a single linkage clustering algorithm to create 147,922 phylogenetic lineages, using every one of the orthologies provided by Ensembl. PhyloPat provides the possibility of querying with either binary phylogenetic patterns (created by checkboxes) or regular expressions. Specific branches of a phylogenetic tree of the 21 included species can be selected to create a branch-specific phylogenetic pattern. Users can also input a list of Ensembl or EMBL IDs to check which phylogenetic lineage any gene belongs to. The output can be saved in HTML, Excel or plain text format for further analysis. A link to the FatiGO web interface has been incorporated in the HTML output, creating easy access to functional information. Finally, lists of omnipresent, polypresent and oligopresent genes have been included. CONCLUSION: PhyloPat is the first tool to combine complete genome information with phylogenetic pattern querying. Since we used the orthologies generated by the accurate pipeline of Ensembl, the obtained phylogenetic lineages are reliable. The completeness and reliability of these phylogenetic lineages will further increase with the addition of newly found orthologous relationships within each new Ensembl release

    A Simple Model for the Influence of Meiotic Conversion Tracts on GC Content

    Get PDF
    A strong correlation between GC content and recombination rate is observed in many eukaryotes, which is thought to be due to conversion events linked to the repair of meiotic double-strand breaks. In several organisms, the length of conversion tracts has been shown to decrease exponentially with increasing distance from the sites of meiotic double-strand breaks. I show here that this behavior leads to a simple analytical model for the evolution and the equilibrium state of the GC content of sequences devoid of meiotic double-strand break sites. In the yeast Saccharomyces cerevisiae, meiotic double-strand breaks are practically excluded from protein-coding sequences. A good fit was observed between the predictions of the model and the variations of the average GC content of the third codon position (GC3) of S. cerevisiae genes. Moreover, recombination parameters that can be extracted by fitting the data to the model coincide with experimentally determined values. These results thus indicate that meiotic recombination plays an important part in determining the fluctuations of GC content in yeast coding sequences. The model also accounted for the different patterns of GC variations observed in the genes of Candida species that exhibit a variety of sexual lifestyles, and hence a wide range of meiotic recombination rates. Finally, the variations of the average GC3 content of human and chicken coding sequences could also be fitted by the model. These results suggest the existence of a widespread pattern of GC variation in eukaryotic genes due to meiotic recombination, which would imply the generality of two features of meiotic recombination: its association with GC-biased gene conversion and the quasi-exclusion of meiotic double-strand breaks from coding sequences. Moreover, the model points out to specific constraints on protein fragments encoded by exon terminal sequences, which are the most affected by the GC bias
    • …
    corecore