280 research outputs found

    Population genomics of domestic and wild yeasts

    Get PDF
    The natural genetics of an organism is determined by the distribution of sequences of its genome. Here we present one- to four-fold, with some deeper, coverage of the genome sequences of over seventy isolates of the domesticated baker's yeast, _Saccharomyces cerevisiae_, and its closest relative, the wild _S. paradoxus_, which has never been associated with human activity. These were collected from numerous geographic locations and sources (including wild, clinical, baking, wine, laboratory and food spoilage). These sequences provide an unprecedented view of the population structure, natural (and artificial) selection and genome evolution in these species. Variation in gene content, SNPs, indels, copy numbers and transposable elements provide insights into the evolution of different lineages. Phenotypic variation broadly correlates with global genome-wide phylogenetic relationships however there is no correlation with source. _S. paradoxus_ populations are well delineated along geographic boundaries while the variation among worldwide _S. cerevisiae_ isolates show less differentiation and is comparable to a single _S. paradoxus_ population. Rather than one or two domestication events leading to the extant baker's yeasts, the population structure of _S. cerevisiae_ shows a few well defined geographically isolated lineages and many different mosaics of these lineages, supporting the notion that human influence provided the opportunity for outbreeding and production of new combinations of pre-existing variation

    Probe set algorithms: is there a rational best bet?

    Get PDF
    Affymetrix microarrays have become a standard experimental platform for studies of mRNA expression profiling. Their success is due, in part, to the multiple oligonucleotide features (probes) against each transcript (probe set). This multiple testing allows for more robust background assessments and gene expression measures, and has permitted the development of many computational methods to translate image data into a single normalized "signal" for mRNA transcript abundance. There are now many probe set algorithms that have been developed, with a gradual movement away from chip-by-chip methods (MAS5), to project-based model-fitting methods (dCHIP, RMA, others). Data interpretation is often profoundly changed by choice of algorithm, with disoriented biologists questioning what the "accurate" interpretation of their experiment is. Here, we summarize the debate concerning probe set algorithms. We provide examples of how changes in mismatch weight, normalizations, and construction of expression ratios each dramatically change data interpretation. All interpretations can be considered as computationally appropriate, but with varying biological credibility. We also illustrate the performance of two new hybrid algorithms (PLIER, GC-RMA) relative to more traditional algorithms (dCHIP, MAS5, Probe Profiler PCA, RMA) using an interactive power analysis tool. PLIER appears superior to other algorithms in avoiding false positives with poorly performing probe sets. Based on our interpretation of the literature, and examples presented here, we suggest that the variability in performance of probe set algorithms is more dependent upon assumptions regarding "background", than on calculations of "signal". We argue that "background" is an enormously complex variable that can only be vaguely quantified, and thus the "best" probe set algorithm will vary from project to project

    Empirical Bayes models for multiple probe type microarrays at the probe level

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>When analyzing microarray data a primary objective is often to find differentially expressed genes. With empirical Bayes and penalized t-tests the sample variances are adjusted towards a global estimate, producing more stable results compared to ordinary t-tests. However, for Affymetrix type data a clear dependency between variability and intensity-level generally exists, even for logged intensities, most clearly for data at the probe level but also for probe-set summarizes such as the MAS5 expression index. As a consequence, adjustment towards a global estimate results in an intensity-level dependent false positive rate.</p> <p>Results</p> <p>We propose two new methods for finding differentially expressed genes, Probe level Locally moderated Weighted median-t (PLW) and Locally Moderated Weighted-t (LMW). Both methods use an empirical Bayes model taking the dependency between variability and intensity-level into account. A global covariance matrix is also used allowing for differing variances between arrays as well as array-to-array correlations. PLW is specially designed for Affymetrix type arrays (or other multiple-probe arrays). Instead of making inference on probe-set summaries, comparisons are made separately for each perfect-match probe and are then summarized into one score for the probe-set.</p> <p>Conclusion</p> <p>The proposed methods are compared to 14 existing methods using five spike-in data sets. For RMA and GCRMA processed data, PLW has the most accurate ranking of regulated genes in four out of the five data sets, and LMW consistently performs better than all examined moderated t-tests when used on RMA, GCRMA, and MAS5 expression indexes.</p

    SARS-CoV Pathogenesis Is Regulated by a STAT1 Dependent but a Type I, II and III Interferon Receptor Independent Mechanism

    Get PDF
    Severe acute respiratory syndrome coronavirus (SARS-CoV) infection often caused severe end stage lung disease and organizing phase diffuse alveolar damage, especially in the elderly. The virus-host interactions that governed development of these acute end stage lung diseases and death are unknown. To address this question, we evaluated the role of innate immune signaling in protection from human (Urbani) and a recombinant mouse adapted SARS-CoV, designated rMA15. In contrast to most models of viral pathogenesis, infection of type I, type II or type III interferon knockout mice (129 background) with either Urbani or MA15 viruses resulted in clinical disease outcomes, including transient weight loss, denuding bronchiolitis and alveolar inflammation and recovery, identical to that seen in infection of wildtype mice. This suggests that type I, II and III interferon signaling play minor roles in regulating SARS pathogenesis in mouse models. In contrast, infection of STAT1βˆ’/βˆ’ mice resulted in severe disease, high virus titer, extensive pulmonary lesions and 100% mortality by day 9 and 30 post-infection with rMA15 or Urbani viruses, respectively. Non-lethal in BALB/c mice, Urbani SARS-CoV infection in STAT1βˆ’/βˆ’ mice caused disseminated infection involving the liver, spleen and other tissues after day 9. These findings demonstrated that SARS-CoV pathogenesis is regulated by a STAT1 dependent but type I, II and III interferon receptor independent, mechanism. In contrast to a well documented role in innate immunity, we propose that STAT1 also protects mice via its role as an antagonist of unrestrained cell proliferation

    Reconstructing Druze population history

    Get PDF
    The Druze are an aggregate of communities in the Levant and Near East living almost exclusively in the mountains of Syria, Lebanon and Israel whose ~1000 year old religion formally opposes mixed marriages and conversions. Despite increasing interest in genetics of the population structure of the Druze, their population history remains unknown. We investigated the genetic relationships between Israeli Druze and both modern and ancient populations. We evaluated our findings in light of three hypotheses purporting to explain Druze history that posit Arabian, Persian or mixed Near Eastern-Levantine roots. The biogeographical analysis localised proto-Druze to the mountainous regions of southeastern Turkey, northern Iraq and southeast Syria and their descendants clustered along a trajectory between these two regions. The mixed Near Eastern-Middle Eastern localisation of the Druze, shown using both modern and ancient DNA data, is distinct from that of neighbouring Syrians, Palestinians and most of the Lebanese, who exhibit a high affinity to the Levant. Druze biogeographic affinity, migration patterns, time of emergence and genetic similarity to Near Eastern populations are highly suggestive of Armenian-Turkish ancestries for the proto-Druze

    Vaccine candidates derived from a novel infectious cDNA clone of an American genotype dengue virus type 2

    Get PDF
    BACKGROUND: A dengue virus type 2 (DEN-2 Tonga/74) isolated from a 1974 epidemic was characterized by mild illness and belongs to the American genotype of DEN-2 viruses. To prepare a vaccine candidate, a previously described 30 nucleotide deletion (Ξ”30) in the 3' untranslated region of DEN-4 has been engineered into the DEN-2 isolate. METHODS: A full-length cDNA clone was generated from the DEN-2 virus and used to produce recombinant DEN-2 (rDEN-2) and rDEN2Ξ”30. Viruses were evaluated for replication in SCID mice transplanted with human hepatoma cells (SCID-HuH-7 mice), in mosquitoes, and in rhesus monkeys. Neutralizing antibody induction and protective efficacy were also assessed in rhesus monkeys. RESULTS: The rDEN2Ξ”30 virus was ten-fold reduced in replication in SCID-HuH-7 mice when compared to the parent virus. The rDEN-2 viruses were not infectious for Aedes mosquitoes, but both readily infected Toxorynchites mosquitoes. In rhesus monkeys, rDEN2Ξ”30 appeared to be slightly attenuated when compared to the parent virus as measured by duration and peak of viremia and neutralizing antibody induction. A derivative of rDEN2Ξ”30, designated rDEN2Ξ”30-4995, was generated by incorporation of a point mutation previously identified in the NS3 gene of DEN-4 and was found to be more attenuated than rDEN2Ξ”30 in SCID-HuH-7 mice. CONCLUSIONS: The rDEN2Ξ”30 and rDEN2Ξ”30-4995 viruses can be considered for evaluation in humans and for inclusion in a tetravalent dengue vaccine

    Empirical Distributions of F-ST from Large-Scale Human Polymorphism Data

    Get PDF
    Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright’s FST that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-FST may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically FST analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global FST distribution closely follows an exponential distribution. Third, although the overall FST distribution is similarly shaped (inverse J), FST distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-FST of these groups is linear in allele frequency. These results suggest that investigating the extremes of the FST distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection

    Exome sequencing identifies NBEAL2 as the causative gene for gray platelet syndrome.

    Get PDF
    Gray platelet syndrome (GPS) is a predominantly recessive platelet disorder that is characterized by mild thrombocytopenia with large platelets and a paucity of Ξ±-granules; these abnormalities cause mostly moderate but in rare cases severe bleeding. We sequenced the exomes of four unrelated individuals and identified NBEAL2 as the causative gene; it has no previously known function but is a member of a gene family that is involved in granule development. Silencing of nbeal2 in zebrafish abrogated thrombocyte formation
    • …
    corecore