Analyses of the genomic variation to study cork oak evolution and adaptation : from past to future climatic changes

Abstract

Tese de doutoramento, Biologia e Ecologia das Alterações Globais (Biologia do Genoma e Evolução), Universidade de Lisboa, Faculdade de Ciências, 2018Current scientific literature indicates that climate change will cause an average world temperature increase between 1 and 4ºC, along with changes in precipitation patterns and extreme weather events in the next 50 years. These are likely to have a negative impact for biodiversity in general, and forest ecosystems should be particularly affected, especially those in Mediterranean areas, like the cork oak (Quercus suber L.) “montados”. In order to understand how species can respond to such alterations, it is important to know their evolutionary history and genetic architecture of adaptive traits. Advances in sequencing technologies have relatively recently brought down the cost of sequencing per base pair to a point where even small research facilities can obtain genomic information of non-model organisms. These advances made SNP markers become the most abundant type of genetic variation in eukaryotic genomes, especially with the advent of Reduced Representation libraries such as RAD-Seq and GBS. Yet, despite their widespread use, SNP data analyses still bore its own set of bioinformatics challenges. While most of these are related with the practical aspects of the process, such as being able to handle very large datasets, or discriminate between neutral and non-neutral markers, some fundamental problems, like reproducibility are also important issues affecting research in this area. In this thesis, genomic and transcriptomic data from Q. suber was used to assess the evolutionary history of the species, detect the effects of natural selection across the cork oak’s distribution range and find any associations between the obtained markers and environmental variables. The main methodological contributions of this thesis are in the form of three software suites: (1) 4Pipe4, a software for automatically mining SNP markers from NGS data when no reference genome nor strain information is present, (2) NCBI Mass Sequence Downloader, a program to automate the downloading of large datasets from the NCBI databases, and (3) Structure_threader, a software to automate and parallelize analyses using several popular clustering analyses programs. All of these programs were developed with the intent to improve the automation and reproducibility value of the analysis processes they are meant to be part of. The main findings of this thesis are that (1) the evolutionary history and population structure of Q. suber is not as neatly structured as chloroplastidial markers indicate, (2) local adaptation plays and important role in the distribution of the species’ genetic variability, and (3) the cork oak may be better equipped, from a genetic point of view, to adapt to climate change than what previous studies based solely on ecological modelling indicated

    Similar works