13 research outputs found

    Whole genome sequencing of an ethnic Pathan (Pakhtun) from the north-west of Pakistan.

    Get PDF
    BACKGROUND: Pakistan covers a key geographic area in human history, being both part of the Indus River region that acted as one of the cradles of civilization and as a link between Western Eurasia and Eastern Asia. This region is inhabited by a number of distinct ethnic groups, the largest being the Punjabi, Pathan (Pakhtuns), Sindhi, and Baloch. RESULTS: We analyzed the first ethnic male Pathan genome by sequencing it to 29.7-fold coverage using the Illumina HiSeq2000 platform. A total of 3.8 million single nucleotide variations (SNVs) and 0.5 million small indels were identified by comparing with the human reference genome. Among the SNVs, 129,441 were novel, and 10,315 nonsynonymous SNVs were found in 5,344 genes. SNVs were annotated for health consequences and high risk diseases, as well as possible influences on drug efficacy. We confirmed that the Pathan genome presented here is representative of this ethnic group by comparing it to a panel of Central Asians from the HGDP-CEPH panels typed for ~650 k SNPs. The mtDNA (H2) and Y haplogroup (L1) of this individual were also typical of his geographic region of origin. Finally, we reconstruct the demographic history by PSMC, which highlights a recent increase in effective population size compatible with admixture between European and Asian lineages expected in this geographic region. CONCLUSIONS: We present a whole-genome sequence and analyses of an ethnic Pathan from the north-west province of Pakistan. It is a useful resource to understand genetic variation and human migration across the whole Asian continent

    Phaeodactylum tricornutum génome et épigénome : caractérisation des variantes naturelles

    No full text
    Since the discovery of Phaeodactylum tricornutum by Bohlin in 1897, its classification within the tree of life has been controversial. It was in 1958 when Lewin, using oval and fusiform morphotypes, described multiple characteristic features of this species that resemble diatoms structure, the debate to whether classify P. tricornutum as a member of Bacillariophyceae was ended. To this point three morphotypes (oval, fusiform and triradiate) of Phaeodactylum tricornutum have been observed. Over the course of approximately 100 years, from 1908 till 2000, 10 strains of Phaeodactylum tricornutum (referred to asecotypes) have been collected and stored axenically as cryopreserved stocks at various stock centers. Various cellular and molecular tools have been established to dissect and understand the physiology and evolution of P. tricornutum, and/or diatoms in general. It is because of decades of research and efforts by many laboratories that now P. tricornutum is considered to be a model diatom species. My thesis majorly focuses in understanding the genetic and epigenetic makeup of P. tricornutum genome to decipher the underlying morphological and physiological diversity within different ecotype populations. To do so, I established the epigenetic landscape within P. tricornutum genome using various histone post-translational modification marks (chapter 1 and chapter 2) and also compared the natural variation in the distribution of some key histone PTMs between two ecotype populations (chapter 4). We also generated a genome-wide genetic diversity map across 10 ecotypes of P. tricornutum revealing the presence of a species-complex within the genus Phaeodactylum as aconsequence of ancient hybridization (Chapter 3). Based on the evidences from many previous reports and similar observations within P. tricornutum, we propose natural hybridization as a strong and potential foundation for explaining unprecedented species diversity within the diatom clade. Moreover, we updated the functional and structural annotations of P. tricornutum genome (Phatr3, chapter 2) and developed a user-friendly software algorithm to fetch CRISPR/Cas9 targets, which is a basis to perform knockout studies using CRISPR/Cas9 genome editing protocol, in 13 phytoplankton genomes including P. tricornutum (chapter 5). To accomplish all this, I used various state-of-the-art technologies like Mass-Spectrometry, ChIPsequencing, Whole genome sequencing, RNA sequencing, CRISPR genome editing protocols and several computational softwares/pipelines. In brief, the thesis work provides a comprehensive platform for future epigenetic, genetic and functional molecular studies in diatoms using Phaeodactylum tricornutum as a model. The work is an addon value to the current state of diatom research by answering questions that have never been asked before and opens a completely new horizon and demand of epigenetics research that underlie the ecological success of diatoms in modern-day ocean.Depuis la dĂ©couverte de Phaeodactylum tricornutum par Bohlin en 1897, sa classification au sein de l'arbre de la vie a Ă©tĂ© controversĂ©e. En utilisant des morphotypes ovales et fusiformes Lewin a dĂ©crit en 1958 plusieurs traits caractĂ©ristiques de cette espĂšce rappelant la structure des diatomĂ©es mettant ainsi fin Ă  la controverse sur la classification de P. tricornutum au sein des BacillariophycĂ©es. Pour se faire, trois morphotypes (ovale, fusiforme et triradiĂ©) de Phaeodactylum tricornutum ont Ă©tĂ© observĂ©s. Au cours d’une centaine d’annĂ©es environ, de 1908 Ă  2000, 10 souches de Phaeodactylum tricornutum (appelĂ©es Ă©cotypes) ont Ă©tĂ© collectĂ©es et stockĂ©es soit de maniĂšre axĂ©nique ou en l’état avec leur populations naturelles de bactĂ©ries dans les centres des ressources gĂ©nĂ©tiques pour algues, cryo-prĂ©servĂ©es quand cela est possible. Divers outils cellulaires et molĂ©culaires ont Ă©tĂ© Ă©tablis pour dissĂ©quer et comprendre la physiologie et l'Ă©volution de P. tricornutum, et/ou les diatomĂ©es en gĂ©nĂ©ral. GrĂące Ă  des dĂ©cennies de recherche et les efforts dĂ©ployĂ©s par de nombreux laboratoires que P. tricornutum est aujourd’hui considĂ©rĂ©e comme une espĂšce modĂšle des diatomĂ©es. Le sujet de ma thĂšse traite majoritairement de la composition gĂ©nĂ©tique et Ă©pigĂ©nĂ©tique du gĂ©nome de P. tricornutum ainsi que de la diversitĂ© morphologique et physiologique sousjacente au sein des populations naturelles prospectĂ©es Ă  diffĂ©rents endroits du globe. Pour se faire, j’ai gĂ©nĂ©rĂ© les profils chromatiniens en utilisant diffĂ©rentes marques des modifications post-traductionnelles des histones (chapitres 1 et 2) et a Ă©galement comparĂ© la variation naturelle dans la distribution de certaines marques clĂ©s entre deux populations d’écotypes (chapitre 4). Nous avons Ă©galement gĂ©nĂ©rĂ© une carte de la diversitĂ© gĂ©nĂ©tique Ă  l’échelle du gĂ©nome chez 10 Ă©cotypes de P. tricornutum rĂ©vĂ©lant ainsi la prĂ©sence d'un complexe d'espĂšces dans le genre Phaeodactylum comme la consĂ©quence d’une hybridation ancienne (chapitre 3). Sur la base de nombreux rapports antĂ©rieurs et des observations similaires au sein de P. tricornutum, nous proposons l’hybridation naturelle comme une base solide et une possibilitĂ© plausible pour expliquer la diversitĂ© des espĂšces chez lest diatomĂ©es. De plus, nous avons mis Ă  jour les annotations fonctionnelles et structurelles du gĂ©nome de P. tricornutum (Phatr3, chapitre 2) et mis au point un algorithme de logiciel convivial pour aller chercher les cibles CRISPR du systĂšme d’édition du gĂ©nome CRISPR / cas9 chez 13 gĂ©nomes de phytoplancton incluant P. tricornutum (chapitre 5). Pour accomplir tout cela, j'ai utilisĂ© diverses mĂ©thodes Ă  la pointe de l’état de l’art comme la spectromĂ©trie de masse, l’immunoprĂ©cipitation de la chromatine suivie de sĂ©quençage Ă  haut dĂ©bit ainsi que les sĂ©quençages du gĂ©nome entier, de l'ARN et des protocoles d'Ă©dition du gĂ©nome CRISPR et plusieurs logiciels / pipelines de calcul. Ainsi, le travail de thĂšse fournit une plate-forme complĂšte qui pourra ĂȘtre utilisĂ©e Ă  l’avenir pour des Ă©tudes Ă©pigĂ©nĂ©tiques, de gĂ©nĂ©tiques molĂ©culaires et fonctionnelles chez les diatomĂ©es en utilisant comme espĂšce modĂšle Phaeodactylum tricornutum. Ce travail est pionnier et reprĂ©sente une valeur ajoutĂ©e importante dans le domaine de la recherche sur les diatomĂ©es en rĂ©pondant Ă  des questions nouvelles ouvrant ainsi de nouveaux horizons Ă  la recherche en particulier en Ă©pigĂ©nĂ©tique qui joue un rĂŽle important mais pas encore assez apprĂ©ciĂ© dans le succĂšs Ă©cologique des diatomĂ©es dans les ocĂ©ans actuels

    PhytoCRISP-Ex: a web-based and stand-alone application to find specific target sequences for CRISPR/CAS editing

    No full text
    International audienceBackground: With the emerging interest in phytoplankton research, the need to establish genetic tools for the functional characterization of genes is indispensable. The CRISPR/Cas9 system is now well recognized as an efficient and accurate reverse genetic tool for genome editing. Several computational tools have been published allowing researchers to find candidate target sequences for the engineering of the CRISPR vectors, while searching possible off-targets for the predicted candidates. These tools provide built-in genome databases of common model organisms that are used for CRISPR target prediction. Although their predictions are highly sensitive, the applicability to non-model genomes, most notably protists, makes their design inadequate. This motivated us to design a new CRISPR target finding tool, PhytoCRISP-Ex. Our software offers CRIPSR target predictions using an extended list of phytoplankton genomes and also delivers a user-friendly standalone application that can be used for any genome. Results: The software attempts to integrate, for the first time, most available phytoplankton genomes information and provide a web-based platform for Cas9 target prediction within them with high sensitivity. By offering a standalone version, PhytoCRISP-Ex maintains an independence to be used with any organism and widens its applicability in high throughput pipelines. PhytoCRISP-Ex out pars all the existing tools by computing the availability of restriction sites over the most probable Cas9 cleavage sites, which can be ideal for mutant screens. Conclusions: PhytoCRISP-Ex is a simple, fast and accurate web interface with 13 pre-indexed and presently updating phytoplankton genomes. The software was also designed as a UNIX-based standalone application that allows the user to search for target sequences in the genomes of a variety of other species

    Probing the evolutionary history of epigenetic mechanisms: what can we learn from marine diatoms

    No full text
    Recent progress made on epigenetic studies revealed the conservation of epigenetic features in deep diverse branching species including Stramenopiles, plants and animals. This suggests their fundamental role in shaping species genomes across different evolutionary time scales. Diatoms are a highly successful and diverse group of phytoplankton with a fossil record of about 190 million years ago. They are distantly related from other super-groups of Eukaryotes and have retained some of the epigenetic features found in mammals and plants suggesting their ancient origin. Phaeodactylum tricornutum and Thalassiosira pseudonana, pennate and centric diatoms, respectively, emerged as model species to address questions on the evolution of epigenetic phenomena such as what has been lost, retained or has evolved in contemporary species. In the present work, we will discuss how the study of non-model or emerging model organisms, such as diatoms, helps understand the evolutionary history of epigenetic mechanisms with a particular focus on DNA methylation and histone modifications

    Additional file 1: Figure S1. of PhytoCRISP-Ex: a web-based and stand-alone application to find specific target sequences for CRISPR/CAS editing

    No full text
    PhytoCRISP-Ex efficacy. The scatter plot depicts that most of the predicted Cas9 targets per gene in Thalassiosira pseudonana and Phaeodactylum tricornutum, respectively, are potential candidates (passing both PhytoCRISP-Ex filter). X-axis represents the percent efficiency of each gene in terms of having high number of potential Cas9 targets compared to the total number of targets. Y-axis represents the number of all potential targets per gene. (PDF 182 kb

    A genomics approach reveals the global genetic polymorphism, structure, and functional diversity of ten accessions of the marine model diatom Phaeodactylum tricornutum

    No full text
    Diatoms emerged in the Mesozoic period and presently constitute one of the main primary producers in the world's ocean and are of a major economic importance. In the current study, using whole genome sequencing of ten accessions of the model diatom Phaeodactylum tricornutum, sampled at broad geospatial and temporal scales, we draw a comprehensive landscape of the genomic diversity within the species. We describe strong genetic subdivisions of the accessions into four genetic clades (A-D) with constituent populations of each clade possessing a conserved genetic and functional makeup, likely a consequence of the limited dispersal of P. tricornutum in the open ocean. We further suggest dominance of asexual reproduction across all the populations, as implied by high linkage disequilibrium. Finally, we show limited yet compelling signatures of genetic and functional convergence inducing changes in the selection pressure on many genes and metabolic pathways. We propose these findings to have significant implications for understanding the genetic structure of diatom populations in nature and provide a framework to assess the genomic underpinnings of their ecological success and impact on aquatic ecosystems where they play a major role. Our work provides valuable resources for functional genomics and for exploiting the biotechnological potential of this model diatom species

    Integrative analysis of large scale transcriptome data draws a comprehensive landscape of Phaeodactylum tricornutum genome and evolutionary origin of diatoms

    No full text
    Abstract Diatoms are one of the most successful and ecologically important groups of eukaryotic phytoplankton in the modern ocean. Deciphering their genomes is a key step towards better understanding of their biological innovations, evolutionary origins, and ecological underpinnings. Here, we have used 90 RNA-Seq datasets from different growth conditions combined with published expressed sequence tags and protein sequences from multiple taxa to explore the genome of the model diatom Phaeodactylum tricornutum, and introduce 1,489 novel genes. The new annotation additionally permitted the discovery of extensive alternative splicing in diatoms, including intron retention and exon skipping, which increase the diversity of transcripts generated in changing environments. In addition, we have used up-to-date reference sequence libraries to dissect the taxonomic origins of diatom genes. We show that the P. tricornutum genome is enriched in lineage-specific genes, with up to 47% of the gene models present only possessing orthologues in other stramenopile groups. Finally, we have performed a comprehensive de novo annotation of repetitive elements showing novel classes of transposable elements such as SINE, MITE and TRIM/LARD. This work provides a solid foundation for future studies of diatom gene function, evolution and ecology

    Changes in functional composition and gene expression in eukaryotic plankton at the Atlantic-Arctic Polar front

    No full text
    Abstract Planktonic communities are shaped by ocean currents and participate in multiple global biogeochemical cycles. Unraveling how the functions of communities respond to strong environmental gradients will improve our understanding of the interactions between climate change and marine ecosystems. Here, we investigate changes in functions and gene expression of eukaryotic plankton transiting between the North Atlantic Ocean (NAO) in winter and the Arctic Ocean (AO) in spring/summer using metatranscriptomes and metagenomes from Tara Oceans. In Arctic communities, functions involved in maintaining the protein pool and translation machinery appear to be more active. Four major phylogenetically distant algal groups are abundant in both basins and show similar strategies at transcriptional level, including increased expression of some functions related to cold acclimation. These results shade lights on gene expression strategies shared by cosmopolitan phototrophs of widely separated lineages
    corecore