287 research outputs found

    The scale of population structure in Arabidopsis thaliana

    Get PDF
    The population structure of an organism reflects its evolutionary history and influences its evolutionary trajectory. It constrains the combination of genetic diversity and reveals patterns of past gene flow. Understanding it is a prerequisite for detecting genomic regions under selection, predicting the effect of population disturbances, or modeling gene flow. This paper examines the detailed global population structure of Arabidopsis thaliana. Using a set of 5,707 plants collected from around the globe and genotyped at 149 SNPs, we show that while A. thaliana as a species self-fertilizes 97% of the time, there is considerable variation among local groups. This level of outcrossing greatly limits observed heterozygosity but is sufficient to generate considerable local haplotypic diversity. We also find that in its native Eurasian range A. thaliana exhibits continuous isolation by distance at every geographic scale without natural breaks corresponding to classical notions of populations. By contrast, in North America, where it exists as an exotic species, A. thaliana exhibits little or no population structure at a continental scale but local isolation by distance that extends hundreds of km. This suggests a pattern for the development of isolation by distance that can establish itself shortly after an organism fills a new habitat range. It also raises questions about the general applicability of many standard population genetics models. Any model based on discrete clusters of interchangeable individuals will be an uneasy fit to organisms like A. thaliana which exhibit continuous isolation by distance on many scales

    LOCAS – A Low Coverage Assembly Tool for Resequencing Projects

    Get PDF
    Motivation: Next Generation Sequencing (NGS) is a frequently applied approach to detect sequence variations between highly related genomes. Recent large-scale re-sequencing studies as the Human 1000 Genomes Project utilize NGS data of low coverage to afford sequencing of hundreds of individuals. Here, SNPs and micro-indels can be detected by applying an alignment-consensus approach. However, computational methods capable of discovering other variations such as novel insertions or highly diverged sequence from low coverage NGS data are still lacking. Results: We present LOCAS, a new NGS assembler particularly designed for low coverage assembly of eukaryotic genomes using a mismatch sensitive overlap-layout-consensus approach. LOCAS assembles homologous regions in a homologyguided manner while it performs de novo assemblies of insertions and highly polymorphic target regions subsequently to an alignment-consensus approach. LOCAS has been evaluated in homology-guided assembly scenarios with low sequence coverage of Arabidopsis thaliana strains sequenced as part of the Arabidopsis 1001 Genomes Project. While assembling the same amount of long insertions as state-of-the-art NGS assemblers, LOCAS showed best results regarding contig size, error rate and runtime. Conclusion: LOCAS produces excellent results for homology-guided assembly of eukaryotic genomes with short reads and low sequencing depth, and therefore appears to be the assembly tool of choice for the detection of novel sequenc

    Toxoplasma seroprevalence in a rural population in France: detection of a household effect

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Toxoplasma gondii</it>, the agent of toxoplasmosis, has a complex life cycle. In humans, the parasite may be acquired either through ingestion of contaminated meat or through oocysts present in the environment. The importance of each source of contamination varies locally according to the environment characteristics and to differences concerning human eating habits and the presence of cats; thus, the risk factors may be determined through fine-scale studies. Here, we searched for factors associated with seropositivity in the population of two adjacent villages in Lorraine region, France.</p> <p>Methods</p> <p>All voluntary inhabitants filled out a questionnaire and gave a blood sample. The seroprevalence was estimated globally and according to the inhabitants' ages using a cubic spline regression. A mixed logistic regression model was used to quantify the effect of individual and household factors on the probability of seropositivity.</p> <p>Results</p> <p>Based on serological results from 273 persons, we estimated seroprevalence to be 47% (95% confidence interval: 41 to 53%). That seroprevalence increased with age: the slope was the steepest up to the age of 40 years (OR = 2.48 per 10-year increment, 95% credibility interval: [1.29 to 5.09]), but that increase was not significant afterwards. The probability of seropositivity tended to be higher in men than in women (OR = 2.01, 95% credibility interval: [0.92 to 4.72]) and in subjects eating raw vegetables at least once a week than in the others (OR = 8.4, 95% credibility interval: [0.93 to 72.1]). These effects were close to statistical significance. The multivariable analysis highlighted a significant seroprevalence heterogeneity among households. That seroprevalence varied between 6 and 91% (5<sup>th </sup>and 95<sup>th </sup>percentile of the household seropositivity distribution).</p> <p>Conclusion</p> <p>The major finding is the household effect, with a strong heterogeneity of seroprevalence among households. This effect may be explained by common exposures of household members to local risk factors. Future work will quantify the link between the presence of oocysts in the soil and the seroprevalence of exposed households using a spatial analysis.</p

    diArk 2.0 provides detailed analyses of the ever increasing eukaryotic genome sequencing data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Nowadays, the sequencing of even the largest mammalian genomes has become a question of days with current next-generation sequencing methods. It comes as no surprise that dozens of genome assemblies are released per months now. Since the number of next-generation sequencing machines increases worldwide and new major sequencing plans are announced, a further increase in the speed of releasing genome assemblies is expected. Thus it becomes increasingly important to get an overview as well as detailed information about available sequenced genomes. The different sequencing and assembly methods have specific characteristics that need to be known to evaluate the various genome assemblies before performing subsequent analyses.</p> <p>Results</p> <p>diArk has been developed to provide fast and easy access to all sequenced eukaryotic genomes worldwide. Currently, diArk 2.0 contains information about more than 880 species and more than 2350 genome assembly files. Many meta-data like sequencing and read-assembly methods, sequencing coverage, GC-content, extended lists of alternatively used scientific names and common species names, and various kinds of statistics are provided. To intuitively approach the data the web interface makes extensive usage of modern web techniques. A number of search modules and result views facilitate finding and judging the data of interest. Subscribing to the RSS feed is the easiest way to stay up-to-date with the latest genome data.</p> <p>Conclusions</p> <p>diArk 2.0 is the most up-to-date database of sequenced eukaryotic genomes compared to databases like GOLD, NCBI Genome, NHGRI, and ISC. It is different in that only those projects are stored for which genome assembly data or considerable amounts of cDNA data are available. Projects in planning stage or in the process of being sequenced are not included. The user can easily search through the provided data and directly access the genome assembly files of the sequenced genome of interest. diArk 2.0 is available at <url>http://www.diark.org</url>.</p

    The evolution of plasmid-carried antibiotic resistance

    Get PDF
    BACKGROUND: Antibiotic resistance represents a significant public health problem. When resistance genes are mobile, being carried on plasmids or phages, their spread can be greatly accelerated. Plasmids in particular have been implicated in the spread of antibiotic resistance genes. However, the selective pressures which favour plasmid-carried resistance genes have not been fully established. Here we address this issue with mathematical models of plasmid dynamics in response to different antibiotic treatment regimes. RESULTS: We show that transmission of plasmids is a key factor influencing plasmid-borne antibiotic resistance, but the dosage and interval between treatments is also important. Our results also hold when plasmids carrying the resistance gene are in competition with other plasmids that do not carry the resistance gene. By altering the interval between antibiotic treatments, and the dosage of antibiotic, we show that different treatment regimes can select for either plasmid-carried, or chromosome-carried, resistance. CONCLUSIONS: Our research addresses the effect of environmental variation on the evolution of plasmid-carried antibiotic resistance

    Seroprevalence of Toxoplasma gondii infection in arthritis patients in eastern China

    Get PDF
    Background: There is accumulating evidence for an increased susceptibility to infection in patients with arthritis. We sought to understand the epidemiology of Toxoplasma gondii infection in arthritis patients in eastern China, given the paucity of data on the magnitude of T. gondii infection in these patients. Methods: Seroprevalence of T. gondii infection was assessed by enzyme-linked immunosorbent assay using a crude antigen of the parasite in 820 arthritic patients, and an equal number of healthy controls, from Qingdao and Weihai cities, eastern China. Sociodemographic, clinical and lifestyle information on the study participants were also obtained. Results: The prevalence of anti-T. gondii IgG was significantly higher in arthritic patients (18.8%) compared with 12% in healthy controls (P < 0.001). Twelve patients with arthritis had anti-T. gondii IgM antibodies comparable with 10 control patients (1.5% vs 1.2%). Demographic factors did not significantly influence these seroprevalence frequencies. The highest T. gondii infection seropositivity rate was detected in patients with rheumatoid arthritis (24.8%), followed by reactive arthritis (23.8%), osteoarthritis (19%), infectious arthritis (18.4%) and gouty arthritis (14.8%). Seroprevalence rates of rheumatoid arthritis and reactive arthritis were significantly higher when compared with controls (P < 0.001 and P = 0.002, respectively). A significant association was detected between T. gondii infection and cats being present in the home in arthritic patients (odds ratio [OR], 1.68; 95% confidence interval [CI]: 1.24 – 2.28; P = 0.001). Conclusions: These findings are consistent with and extend previous results, providing further evidence to support a link between contact with cats and an increased risk of T. gondii infection. Our study is also the first to confirm an association between T. gondii infection and arthritis patients in China. Implications for better prevention and control of T. gondii infection in arthritis patients are discussed. Trial registration: This is an epidemiological survey, therefore trial registration was not required

    Interactions between the night time valley-wind system and a developing cold-air pool

    Get PDF
    This is a pre-copyedited, author-produced PDF of an article accepted for publication in Boundary-Layer Meteorology following peer review. The version of record [Arduini, G., Staquet, C & Chemel, C., ‘Interactions between the night time valley-wind system and a developing cold-air pool’, Boundary-Layer Meteorol (2016) 161:1 (49-72), first published online June 2, 2016, is available at Springer online at doi: 10.1007/s10546-016-0155-8The Weather Research and Forecast (WRF) numerical model is used to characterize the influence of a thermally-driven down-valley flow on a developing cold-air pool in an idealized alpine valley decoupled from the atmosphere above. Results for a three-dimensional (3D) valley, which allows for the formation of a down-valley flow, and for a two-dimensional (2D) valley, where the formation of a down-valley flow is inhibited, are analyzed and compared. A key result is that advection leads to a net cooling in the 2D valley and to a warming in the 3D valley, once the down-valley flow is fully developed. This difference stems from the suppression of the slope-flow induced upward motions over the valley centre in the 3D valley. As a result, the downslope flows develop a cross-valley circulation within the cold-air pool, the growth of the cold-air pool is reduced and the valley atmosphere is generally warmer than in the 2D valley. A quasi-steady state is reached for which the divergence of the down-valley flow along the valley is balanced by the convergence of the downslope flows at the top of the cold-air pool, with no net contribution of subsiding motions far from the slope layer. More precisely, the inflow of air at the top of the cold-air pool is found to be driven by an interplay between the return flow from the plain region and subsidence over the plateaux. Finally, the mechanisms that control the structure of the cold-air pool and its evolution are found to be independent of the valley length as soon as the quasi-steady state is reached and the down-valley flow is fully developed.Peer reviewedFinal Accepted Versio
    corecore