40 research outputs found

    Sequences after initial processing in QIIME (seqs.fna, library 1)

    No full text
    Raw sequence data from the Roche 454 GS FLX sequencer, region 1 (split_library_output_1). These data are the output of the command: split_libraries.py -m 454_Map.txt -f 1.TCA.454Reads.fna -q 1.TCA.454Reads.qual -o split_library_output_1/ -l 100 -L 700 -H 9 -M 2 -b 1

    Lab protocol and sequencing report

    No full text
    Contains: i) a summary of the ddRAD labwork, description of final libraries and sequencing output, ii) modified ddRAD sequencing protocol and iii) sequencing quality control reports for each lane

    Figures

    No full text
    Contains the R markdown files that were used to generate the Figures from the main text (Figures.Rmd) and the figures from the supporting information (SupportingInfomation_*). Needs the data and scripts from the *2.R analyses of Stacks outputs* section of this repositor

    Datasets

    No full text
    This .zip files contains two folders. The folder "Control" contains the 15 raw 454 sequence files generated from the "Test Pools" analysis detailed in Table 2 of the manuscript, and a file of 27 Sanger sequences listed in Table S1. The folder "Tenerife" contains the 6 raw 454 sequence files associated generated from the "Tenerife Forest Samples" analysis detailed in Table 3 of the manuscript

    0.Demultiplexing and dropbase

    No full text
    Contains the custom Perl scripts of the pipeline used to demultiplex raw reads and drop a base position that was causing a lane effect

    2.R analyses of Stacks outputs

    No full text
    Contains the R scripts, input-output data and metainformation used to perform the analyses described in the *General processing of Stacks outputs* and *Error rates* of the manuscript

    Sampling localities

    No full text
    Cointains and the geographic information of sampling sites. See tab "B.alpina B.moranensis" for Berberis alpina and B. moranensis populations and "Berberis outgroups" for B. trifolia and B. pallid

    Primers, MIDs, sequence formats and consensus reference sequence used in this study.

    No full text
    <p>ColFol-for ColFol-rev are the Sanger primers. 454-ColFol-for and 454-Col307-rev are the primers for mass amplification. Adaptors A and B are used by the ‘454’ sequencer to attach individual DNA molecules to microscopic beads, for subsequent sequencing. MIDs (Multiplex Identifiers) are 7 bp sequences that allow different samples to be sequenced together on a single ‘454’ plate and then separated bioinformatically for downstream analysis. There is no MID with 454-Col307-rev because we only pyrosequenced from the forward direction. Row 5 is an example of a 454 read after demultiplexing with the Roche tools. The forward primer is underlined, and the reverse primer dashed underlined. The MID tag is removed during demultiplexing. Row 6 is an example of a sequence after processing of sequences to produce a file of unique sequences.</p

    1.Running Stacks

    No full text
    Contains scripts and part of the output data of running *Stacks* with the demultiplexed-lane-effect-corrected data (see section **Demultiplexing and dropbase/0.1DropBase** of this repository) available at the Sequence Read Archive (SRA), accession SRP035472; to subsequently produce SNP (*.SNP) and coverage (*.COV) tsv matrices and to run the *populations* program of Stacks with the selected loci of the downstream analyses. The Stacks parameter values corresponds to the experiments defined as *1) Exploratory analysis of Stacks assembly key parameters and SNP calling model using replicates* and *2) Effect of using different parameters on the output information content and on the detection of genetic structuring
    corecore