34 research outputs found

    SNP discovery in swine by reduced representation and high throughput pyrosequencing

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Relatively little information is available for sequence variation in the pig. We previously used a combination of short read (25 base pair) high-throughput sequencing and reduced genomic representation to discover > 60,000 single nucleotide polymorphisms (SNP) in cattle, but the current lack of complete genome sequence limits this approach in swine. Longer-read pyrosequencing-based technologies have the potential to overcome this limitation by providing sufficient flanking sequence information for assay design. Swine SNP were discovered in the present study using a reduced representation of 450 base pair (bp) porcine genomic fragments (approximately 4% of the swine genome) prepared from a pool of 26 animals relevant to current pork production, and a GS-FLX instrument producing 240 bp reads.</p> <p>Results</p> <p>Approximately 5 million sequence reads were collected and assembled into contigs having an overall observed depth of 7.65-fold coverage. The approximate minor allele frequency was estimated from the number of observations of the alternate alleles. The average coverage at the SNPs was 12.6-fold. This approach identified 115,572 SNPs in 47,830 contigs. Comparison to partial swine genome draft sequence indicated 49,879 SNP (43%) and 22,045 contigs (46%) mapped to a position on a sequenced pig chromosome and the distribution was essentially random. A sample of 176 putative SNPs was examined and 168 (95.5%) were confirmed to have segregating alleles; the correlation of the observed minor allele frequency (MAF) to that predicted from the sequence data was 0.58.</p> <p>Conclusion</p> <p>The process was an efficient means to identify a large number of porcine SNP having high validation rate to be used in an ongoing international collaboration to produce a highly parallel genotyping assay for swine. By using a conservative approach, a robust group of SNPs were detected with greater confidence and relatively high MAF that should be suitable for genotyping in a wide variety of commercial populations.</p

    Novel porcine repetitive elements

    Get PDF
    BACKGROUND: Repetitive elements comprise ~45% of mammalian genomes and are increasingly known to impact genomic function by contributing to the genomic architecture, by direct regulation of gene expression and by affecting genomic size, diversity and evolution. The ubiquity and increasingly understood importance of repetitive elements contribute to the need to identify and annotate them. We set out to identify previously uncharacterized repetitive DNA in the porcine genome. Once found, we characterized the prevalence of these repeats in other mammals. RESULTS: We discovered 27 repetitive elements in 220 BACs covering 1% of the porcine genome (Comparative Vertebrate Sequencing Initiative; CVSI). These repeats varied in length from 55 to 1059 nucleotides. To estimate copy numbers, we went to an independent source of data, the BAC-end sequences (Wellcome Trust Sanger Institute), covering approximately 15% of the porcine genome. Copy numbers in BAC-ends were less than one hundred for 6 repeat elements, between 100 and 1000 for 16 and between 1,000 and 10,000 for 5. Several of the repeat elements were found in the bovine genome and we have identified two with orthologous sites, indicating that these elements were present in their common ancestor. None of the repeat elements were found in primate, rodent or dog genomes. We were unable to identify any of the replication machinery common to active transposable elements in these newly identified repeats. CONCLUSION: The presence of both orthologous and non-orthologous sites indicates that some sites existed prior to speciation and some were generated later. The identification of low to moderate copy number repetitive DNA that is specific to artiodactyls will be critical in the assembly of livestock genomes and studies of comparative genomics

    Single-photon threshold photoionization of NO

    Get PDF
    Single‐photon threshold photoionization spectra for jet‐cooled NO have been measured for the v^+=0 and 1 vibrational levels of the X ^1Σ^+ ground state of NO^+. The NO^+ rotational state distribution for the v^+=0 level is shown to be perturbed by nearby autoionizing levels, whereas the v^+=1 level exhibits a cation rotational distribution which is in near quantitative agreement with calculated spectra near threshold. Only small changes in total angular momentum are observed (‖ΔJ‖=‖J^+ − J‘‖≤5/2) even though a wide range of photoelectron angular momenta (l=0–3) are predicted to contribute to the near‐threshold photoelectron continua. The present results are also discussed in light of recently published two‐photon threshold photoionization spectra of NO which exhibit nearly identical NO^+ rotational state distributions

    A high density recombination map of the pig reveals a correlation between sex-specific recombination and GC content

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The availability of a high-density SNP genotyping chip and a reference genome sequence of the pig (<it>Sus scrofa</it>) enabled the construction of a high-density linkage map. A high-density linkage map is an essential tool for further fine-mapping of quantitative trait loci (QTL) for a variety of traits in the pig and for a better understanding of mechanisms underlying genome evolution.</p> <p>Results</p> <p>Four different pig pedigrees were genotyped using the Illumina PorcineSNP60 BeadChip. Recombination maps for the autosomes were computed for each individual pedigree using a common set of markers. The resulting genetic maps comprised 38,599 SNPs, including 928 SNPs not positioned on a chromosome in the current assembly of the pig genome (build 10.2). The total genetic length varied according to the pedigree, from 1797 to 2149 cM. Female maps were longer than male maps, with a notable exception for SSC1 where male maps are characterized by a higher recombination rate than females in the region between 91–250 Mb. The recombination rates varied among chromosomes and along individual chromosomes, regions with high recombination rates tending to cluster close to the chromosome ends, irrespective of the position of the centromere. Correlations between main sequence features and recombination rates were investigated and significant correlations were obtained for all the studied motifs. Regions characterized by high recombination rates were enriched for specific GC-rich sequence motifs as compared to low recombinant regions. These correlations were higher in females than in males, and females were found to be more recombinant than males at regions where the GC content was greater than 0.4.</p> <p>Conclusions</p> <p>The analysis of the recombination rate along the pig genome highlighted that the regions exhibiting higher levels of recombination tend to cluster around the ends of the chromosomes irrespective of the location of the centromere. Major sex-differences in recombination were observed: females had a higher recombination rate within GC-rich regions and exhibited a stronger correlation between recombination rates and specific sequence features.</p

    MicroRNA transcriptome profiles during swine skeletal muscle development

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>MicroRNA (miR) are a class of small RNAs that regulate gene expression by inhibiting translation of protein encoding transcripts. To evaluate the role of miR in skeletal muscle of swine, global microRNA abundance was measured at specific developmental stages including proliferating satellite cells, three stages of fetal growth, day-old neonate, and the adult.</p> <p>Results</p> <p>Twelve potential novel miR were detected that did not match previously reported sequences. In addition, a number of miR previously reported to be expressed in mammalian muscle were detected, having a variety of abundance patterns through muscle development. Muscle-specific miR-206 was nearly absent in proliferating satellite cells in culture, but was the highest abundant miR at other time points evaluated. In addition, miR-1 was moderately abundant throughout developmental stages with highest abundance in the adult. In contrast, miR-133 was moderately abundant in adult muscle and either not detectable or lowly abundant throughout fetal and neonate development. Changes in abundance of ubiquitously expressed miR were also observed. MiR-432 abundance was highest at the earliest stage of fetal development tested (60 day-old fetus) and decreased throughout development to the adult. Conversely, miR-24 and miR-27 exhibited greatest abundance in proliferating satellite cells and the adult, while abundance of miR-368, miR-376, and miR-423-5p was greatest in the neonate.</p> <p>Conclusion</p> <p>These data present a complete set of transcriptome profiles to evaluate miR abundance at specific stages of skeletal muscle growth in swine. Identification of these miR provides an initial group of miR that may play a vital role in muscle development and growth.</p

    Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

    Get PDF
    Background: The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings: A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance: Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pig

    Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population.</p> <p>Results</p> <p>The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme <it>Hae</it>III; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the sequences from the validated markers were associated with rainbow trout transcripts.</p> <p>Conclusion</p> <p>The use of reduced representation libraries and pyrosequencing technology proved to be an effective strategy for the discovery of a high number of putative SNPs in rainbow trout; however, modifications to the technique to decrease the false discovery rate resulting from the evolutionary recent genome duplication would be desirable.</p

    Spectroscopy and dynamics of highly excited molecules

    No full text
    This dissertation considers the high-lying electronic structure and fragmentation dynamics of the triatomics NO\sb2, N\sb2O, H\sb2O and the radical OH. The experimental techniques used to determine the high-lying electronic structure of N\sb2O include mass-selected resonantly-enhanced multi-photon ionization (REMPI) using coherent ultraviolet light. Dissociation dynamics are probed indirectly in N\sb2O through peak width analysis and directly in NO\sb2 by an additional laser which scanned the NO product. Photoionization dynamics and, in particular, the partitioning of angular momentum between the ionic core and the ejected electron, are studied with the aid of two powerful and practical techniques: (1) production of coherent vacuum ultraviolet radiation by frequency tripling the doubled output of a YAG-pumped dye laser in a free-jet expansion and (2) threshold photoionization by detecting electrons released via the application of a pulsed field. The major results include (1) newly assigned Rydberg states in N\sb2O where a valence picture was expected by some theoreticians to be appropiate, (2) the conclusion that rotational distributions of product NO from photodissociating NO\sb2 is independent of Rydberg structure in the dissociation continuum, (3) ionization potentials of significantly greater accuracy for N\sb2O, H\sb2O, and OH, and, finally, (4) significant evidence to support a theory of Rydberg state interactions which asymmetrically affect the photoionization rotational branch intensities

    Next-Generation Sequencing of the Porcine Skeletal Muscle Transcriptome for Computational Prediction of MicroRNA Gene Targets

    Get PDF
    Background: MicroRNA are a class of small RNAs that regulate gene expression by inhibiting translation of protein encoding transcripts through targeting of a microRNA-protein complex by base-pairing of the microRNA sequence to cognate recognition sequences in the 39 untranslated region (UTR) of the mRNA. Target identification for a given microRNA sequence is generally accomplished by informatics analysis of predicted mRNA sequences present in the genome or in databases of transcript sequence for the tissue of interest. However, gene models for porcine skeletal muscle transcripts in current databases, specifically complete sequence of the 39 UTR, are inadequate for this exercise. Methodology/Principal Findings: To provide data necessary to identify gene targets for microRNA in porcine skeletal muscle, normalized cDNA libraries were sequenced using Roche 454 GS-FLX pyrosequencing and de novo assembly of transcripts enriched in the 39 UTR was performed using the MIRA sequence assembly program. Over 725 million bases of sequence were generated, which assembled into 18,202 contigs. Sequence reads were mapped to a 39 UTR database containing porcine sequences. The 39 UTR that mapped to the database were examined to predict targets for previously identified microRNA that had been separately sequenced from the same porcine muscle sample used to generate the cDNA libraries. For genes with microRNA-targeted 39 UTR, KEGG pathways were computationally determined in order to identify potential functional effects of these microRNA-targeted transcripts. Conclusions: Through next-generation sequencing of transcripts expressed in skeletal muscle, mapping reads to a 39 UTR database, and prediction of microRNA target sites in the 39 UTR, our results identified genes expressed in porcine skeletal muscle and predicted the microRNA that target these genes. Additionally, identification of pathways regulated by these microRNA-targeted genes provides us with a set of genes that can be further evaluated for their potential role in skeletal muscle development and growth
    corecore