4 research outputs found

    Alpine reference database EMBL (gh)

    No full text
    This file contains dereplicated sequences of alpine taxa extracted from the EMBL nucleic database release 117 using the ecopcr program (http://metabarcoding.org/obitools). Sequences have been extracted using the primers “g” (5’-GGGCAATCCTGAGCCAA-3’) and “h” (5’-CCATTGAGTCTCTGCACCTATC-3’) (Taberlet et al. 2007). Then, only alpine taxa were selected

    Unfiltered sequencing data (fasta format)

    No full text
    This fasta file contains unfiltered sequencing data (i.e. merged reads assigned to their original sample). Amplicons were amplified using the universal primers “g” (5’-GGGCAATCCTGAGCCAA-3’) and “h” (5’-CCATTGAGTCTCTGCACCTATC-3’) (Taberlet et al. 2007). Sequences were obtained by a 2 x 100 bp paired-end sequencing on Illumina HiSeq 2500 platform. First processing steps were performed using the OBITOOLS software (http://metabarcoding.org/obitools) as follows: (i) Direct and reverse reads corresponding to the same sequence were aligned and merged thanks to the IlluminaPairEnd program. Only merged sequences with a high alignment quality score were retained (>=40) (ii) Each merged sequence was assigned to its original sample using the tags information previously added to primers thanks to the ngsfilter program. For this step, only sequences containing both primers (with a maximum of 3 mismatches per primer) and exact tag sequences were selected. (iii) To reduce the file size, strictly identical sequences were merged together while keeping information about the origin of sequences

    Unfiltered sequencing data (table format)

    No full text
    This table contains pre-filtered sequencing data (i.e. usable merged reads assigned to their original sample). Amplicons were amplified using the universal primers “g” (5’-GGGCAATCCTGAGCCAA-3’) and “h” (5’-CCATTGAGTCTCTGCACCTATC-3’) (Taberlet et al. 2007). Sequences were obtained by a 2 x 100 bp paired-end sequencing on Illumina HiSeq 2500 platform. First processing steps were performed using the OBITOOLS software (http://metabarcoding.org/obitools) as follows: (i) Direct and reverse reads corresponding to the same sequence were aligned and merged thanks to the IlluminaPairEnd program. Only merged sequences with a high alignment quality score were retained (>=40). (ii) Each merged sequence was assigned to its original sample using the tags information previously added to primers thanks to the ngsfilter program. For this step, only sequences containing both primers (with a maximum of 3 mismatches per primer) and exact tag sequences were selected. (iii) To reduce the file size, strictly identical sequences were merged together while keeping information about the origin of sequences. (iv) Sequences containing ambiguous nucleotides or shorter than 7 bp were discarded. (v) Pure singleton (sequences with 1 read among the whole dataset) were removed

    Global reference database EMBL (gh)

    No full text
    This file contains all dereplicated sequences of alpine taxa extracted from the EMBL nucleic database release 117 using the ecopcr program (http://metabarcoding.org/obitools). Sequences have been extracted using the primers “g” (5’-GGGCAATCCTGAGCCAA-3’) and “h” (5’-CCATTGAGTCTCTGCACCTATC-3’) (Taberlet et al. 2007)
    corecore