2 research outputs found

    4Pipe4-A 454 data analysis pipeline for SNP detection in datasets with no reference sequence or strain information

    Get PDF
    This work was fully supported by projects SOBREIRO/0036/2009 (under the framework of the Cork Oak ESTs Consortium), PTDC/BIA-BEC/098783/2008 and PTDC/AGR-GPL/119943/2010 from Fundação para a CiĂȘncia e Tecnologia (FCT) – Portugal. F. Pina-Martins was funded by FCT grant SFRH/BD/51411/2011, under the PhD program “Biology and Ecology of Global Changes”, Univ. Aveiro & Univ. Lisbon, Portugal. D. Batista was funded by FCT grant SFRH/BPD/104629/2014

    Analyses of the genomic variation to study cork oak evolution and adaptation : from past to future climatic changes

    Get PDF
    Tese de doutoramento, Biologia e Ecologia das AlteraçÔes Globais (Biologia do Genoma e Evolução), Universidade de Lisboa, Faculdade de CiĂȘncias, 2018Current scientific literature indicates that climate change will cause an average world temperature increase between 1 and 4ÂșC, along with changes in precipitation patterns and extreme weather events in the next 50 years. These are likely to have a negative impact for biodiversity in general, and forest ecosystems should be particularly affected, especially those in Mediterranean areas, like the cork oak (Quercus suber L.) “montados”. In order to understand how species can respond to such alterations, it is important to know their evolutionary history and genetic architecture of adaptive traits. Advances in sequencing technologies have relatively recently brought down the cost of sequencing per base pair to a point where even small research facilities can obtain genomic information of non-model organisms. These advances made SNP markers become the most abundant type of genetic variation in eukaryotic genomes, especially with the advent of Reduced Representation libraries such as RAD-Seq and GBS. Yet, despite their widespread use, SNP data analyses still bore its own set of bioinformatics challenges. While most of these are related with the practical aspects of the process, such as being able to handle very large datasets, or discriminate between neutral and non-neutral markers, some fundamental problems, like reproducibility are also important issues affecting research in this area. In this thesis, genomic and transcriptomic data from Q. suber was used to assess the evolutionary history of the species, detect the effects of natural selection across the cork oak’s distribution range and find any associations between the obtained markers and environmental variables. The main methodological contributions of this thesis are in the form of three software suites: (1) 4Pipe4, a software for automatically mining SNP markers from NGS data when no reference genome nor strain information is present, (2) NCBI Mass Sequence Downloader, a program to automate the downloading of large datasets from the NCBI databases, and (3) Structure_threader, a software to automate and parallelize analyses using several popular clustering analyses programs. All of these programs were developed with the intent to improve the automation and reproducibility value of the analysis processes they are meant to be part of. The main findings of this thesis are that (1) the evolutionary history and population structure of Q. suber is not as neatly structured as chloroplastidial markers indicate, (2) local adaptation plays and important role in the distribution of the species’ genetic variability, and (3) the cork oak may be better equipped, from a genetic point of view, to adapt to climate change than what previous studies based solely on ecological modelling indicated
    corecore