11 research outputs found

    CMU Common Lisp user's manual Mach IBM RT edition

    No full text
    Computer Science Departmen

    ShizukaMcDonald_data

    No full text
    Compressed file containing 172 data matrices. The cell values represent the number of times the row individual 'beat' the column individual

    Metadata

    No full text
    This .csv file includes citation information and some basic data for each data matrix

    Assemblies

    No full text
    This is a tarred (compressed) file that contains assemblies to an artificial reference genome. When untarred, this file will contain >800 files in .bam format, one for each individual fish. These files were produced using bwa (Burrows-Wheeler Aligner, Li and Durbin 2009)

    lib2_sucker_barcode_key

    No full text
    Barcode-ID pairs for individual fish for lane 2 of sequencing. (Please note that there are individuals included in this barcode file that belong to a different project, and are not discussed in the related paper.

    Locations

    No full text
    Approximate latitude and longitude of samped populations. NOTE: these are estimated from maps, and do not represent exact sampling locations, as GPS coordinates were not taken in the field

    artificial reference genome

    No full text
    This artificial reference genome was constructed via a de novo assembly (using DNAstar's smng) of a subset of 25 million short sequence reads, distributed across all taxa. We then assembled short reads to thi

    names_common_variants_0.5

    No full text
    This file is the formatted input file for entropy, the hierarchical Bayesian model used in this study. The file contains data for a subset of SNPs, in a simplified genotype likelihood format. SNPs in this file have a minor allele frequency >0.05, and a maximum of 1 alternate allele. This file also contains only one locus per contig

    lib1_sucker_barcode_key

    No full text
    Barcode-ID pairs for individual fish for lane 1 of sequencing. (Please note that there are individuals included in this barcode file that belong to a different project, and are not discussed in the related paper.

    VCF file

    No full text
    This is the raw VCF file containing all 32,978 variant genetic sites initially identified from assembled reads
    corecore