48 research outputs found

    Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population.</p> <p>Results</p> <p>The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme <it>Hae</it>III; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the sequences from the validated markers were associated with rainbow trout transcripts.</p> <p>Conclusion</p> <p>The use of reduced representation libraries and pyrosequencing technology proved to be an effective strategy for the discovery of a high number of putative SNPs in rainbow trout; however, modifications to the technique to decrease the false discovery rate resulting from the evolutionary recent genome duplication would be desirable.</p

    New Alzheimer Amyloid β Responsive Genes Identified in Human Neuroblastoma Cells by Hierarchical Clustering

    Get PDF
    Alzheimer's disease (AD) is characterized by neuronal degeneration and cell loss. Aβ42, in contrast to Aβ40, is thought to be the pathogenic form triggering the pathological cascade in AD. In order to unravel overall gene regulation we monitored the transcriptomic responses to increased or decreased Aβ40 and Aβ42 levels, generated and derived from its precursor C99 (C-terminal fragment of APP comprising 99 amino acids) in human neuroblastoma cells. We identified fourteen differentially expressed transcripts by hierarchical clustering and discussed their involvement in AD. These fourteen transcripts were grouped into two main clusters each showing distinct differential expression patterns depending on Aβ40 and Aβ42 levels. Among these transcripts we discovered an unexpected inverse and strong differential expression of neurogenin 2 (NEUROG2) and KIAA0125 in all examined cell clones. C99-overexpression had a similar effect on NEUROG2 and KIAA0125 expression as a decreased Aβ42/Aβ40 ratio. Importantly however, an increased Aβ42/Aβ40 ratio, which is typical of AD, had an inverse expression pattern of NEUROG2 and KIAA0125: An increased Aβ42/Aβ40 ratio up-regulated NEUROG2, but down-regulated KIAA0125, whereas the opposite regulation pattern was observed for a decreased Aβ42/Aβ40 ratio. We discuss the possibilities that the so far uncharacterized KIAA0125 might be a counter player of NEUROG2 and that KIAA0125 could be involved in neurogenesis, due to the involvement of NEUROG2 in developmental neural processes

    Discovery of Pod Shatter-Resistant Associated SNPs by Deep Sequencing of a Representative Library Followed by Bulk Segregant Analysis in Rapeseed

    Get PDF
    Background: Single nucleotide polymorphisms (SNPs) are an important class of genetic marker for target gene mapping. As of yet, there is no rapid and effective method to identify SNPs linked with agronomic traits in rapeseed and other crop species. Methodology/Principal Findings: We demonstrate a novel method for identifying SNP markers in rapeseed by deep sequencing a representative library and performing bulk segregant analysis. With this method, SNPs associated with rapeseed pod shatter-resistance were discovered. Firstly, a reduced representation of the rapeseed genome was used. Genomic fragments ranging from 450–550 bp were prepared from the susceptible bulk (ten F2 plants with the silique shattering resistance index, SSRI,0.10) and the resistance bulk (ten F2 plants with SSRI.0.90), and also Solexa sequencingproduced 90 bp reads. Approximately 50 million of these sequence reads were assembled into contigs to a depth of 20-fold coverage. Secondly, 60,396 ‘simple SNPs ’ were identified, and the statistical significance was evaluated using Fisher’s exact test. There were 70 associated SNPs whose –log10p value over 16 were selected to be further analyzed. The distribution of these SNPs appeared a tight cluster, which consisted of 14 associated SNPs within a 396 kb region on chromosome A09. Our evidence indicates that this region contains a major quantitative trait locus (QTL). Finally, two associated SNPs from this region were mapped on a major QTL region

    Genome wide SNP discovery, analysis and evaluation in mallard (Anas platyrhynchos)

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Next generation sequencing technologies allow to obtain at low cost the genomic sequence information that currently lacks for most economically and ecologically important organisms. For the mallard duck genomic data is limited. The mallard is, besides a species of large agricultural and societal importance, also the focal species when it comes to long distance dispersal of Avian Influenza. For large scale identification of SNPs we performed Illumina sequencing of wild mallard DNA and compared our data with ongoing genome and EST sequencing of domesticated conspecifics. This is the first study of its kind for waterfowl.</p> <p>Results</p> <p>More than one billion base pairs of sequence information were generated resulting in a 16× coverage of a reduced representation library of the mallard genome. Sequence reads were aligned to a draft domesticated duck reference genome and allowed for the detection of over 122,000 SNPs within our mallard sequence dataset. In addition, almost 62,000 nucleotide positions on the domesticated duck reference showed a different nucleotide compared to wild mallard. Approximately 20,000 SNPs identified within our data were shared with SNPs identified in the sequenced domestic duck or in EST sequencing projects. The shared SNPs were considered to be highly reliable and were used to benchmark non-shared SNPs for quality. Genotyping of a representative sample of 364 SNPs resulted in a SNP conversion rate of 99.7%. The correlation of the minor allele count and observed minor allele frequency in the SNP discovery pool was 0.72.</p> <p>Conclusion</p> <p>We identified almost 150,000 SNPs in wild mallards that will likely yield good results in genotyping. Of these, ~101,000 SNPs were detected within our wild mallard sequences and ~49,000 were detected between wild and domesticated duck data. In the ~101,000 SNPs we found a subset of ~20,000 SNPs shared between wild mallards and the sequenced domesticated duck suggesting a low genetic divergence. Comparison of quality metrics between the total SNP set (122,000 + 62,000 = 184,000 SNPs) and the validated subset shows similar characteristics for both sets. This indicates that we have detected a large amount (~150,000) of accurately inferred mallard SNPs, which will benefit bird evolutionary studies, ecological studies (e.g. disentangling migratory connectivity) and industrial breeding programs.</p

    Single-nucleotide polymorphism discovery by high-throughput sequencing in sorghum

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Eight diverse sorghum (<it>Sorghum bicolor </it>L. Moench) accessions were subjected to short-read genome sequencing to characterize the distribution of single-nucleotide polymorphisms (SNPs). Two strategies were used for DNA library preparation. Missing SNP genotype data were imputed by local haplotype comparison. The effect of library type and genomic diversity on SNP discovery and imputation are evaluated.</p> <p>Results</p> <p>Alignment of eight genome equivalents (6 Gb) to the public reference genome revealed 283,000 SNPs at ≥82% confirmation probability. Sequencing from libraries constructed to limit sequencing to start at defined restriction sites led to genotyping 10-fold more SNPs in all 8 accessions, and correctly imputing 11% more missing data, than from semirandom libraries. The SNP yield advantage of the reduced-representation method was less than expected, since up to one fifth of reads started at noncanonical restriction sites and up to one third of restriction sites predicted <it>in silico </it>to yield unique alignments were not sampled at near-saturation. For imputation accuracy, the availability of a genomically similar accession in the germplasm panel was more important than panel size or sequencing coverage.</p> <p>Conclusions</p> <p>A sequence quantity of 3 million 50-base reads per accession using a <it>Bsr</it>FI library would conservatively provide satisfactory genotyping of 96,000 sorghum SNPs. For most reliable SNP-genotype imputation in shallowly sequenced genomes, germplasm panels should consist of pairs or groups of genomically similar entries. These results may help in designing strategies for economical genotyping-by-sequencing of large numbers of plant accessions.</p

    Genome-Wide Polymorphism and Comparative Analyses in the White-Tailed Deer (Odocoileus virginianus): A Model for Conservation Genomics

    Get PDF
    The white-tailed deer (Odocoileus virginianus) represents one of the most successful and widely distributed large mammal species within North America, yet very little nucleotide sequence information is available. We utilized massively parallel pyrosequencing of a reduced representation library (RRL) and a random shotgun library (RSL) to generate a complete mitochondrial genome sequence and identify a large number of putative single nucleotide polymorphisms (SNPs) distributed throughout the white-tailed deer nuclear and mitochondrial genomes. A SNP validation study designed to test specific classes of putative SNPs provides evidence for as many as 10,476 genome-wide SNPs in the current dataset. Based on cytogenetic evidence for homology between cow (Bos taurus) and white-tailed deer chromosomes, we demonstrate that a divergent genome may be used for estimating the relative distribution and density of de novo sequence contigs as well as putative SNPs for species without draft genome assemblies. Our approach demonstrates that bioinformatic tools developed for model or agriculturally important species may be leveraged to support next-generation research programs for species of biological, ecological and evolutionary importance. We also provide a functional annotation analysis for the de novo sequence contigs assembled from white-tailed deer pyrosequencing reads, a mitochondrial phylogeny involving 13,722 nucleotide positions for 10 unique species of Cervidae, and a median joining haplotype network as a putative representation of mitochondrial evolution in O. virginianus. The results of this study are expected to provide a detailed template enabling genome-wide sequence-based studies of threatened, endangered or conservationally important non-model organisms

    Fosmid library end sequencing reveals a rarely known genome structure of marine shrimp Penaeus monodon

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The black tiger shrimp (<it>Penaeus monodon</it>) is one of the most important aquaculture species in the world, representing the crustacean lineage which possesses the greatest species diversity among marine invertebrates. Yet, we barely know anything about their genomic structure. To understand the organization and evolution of the <it>P. monodon </it>genome, a fosmid library consisting of 288,000 colonies and was constructed, equivalent to 5.3-fold coverage of the 2.17 Gb genome. Approximately 11.1 Mb of fosmid end sequences (FESs) from 20,926 non-redundant reads representing 0.45% of the <it>P. monodon </it>genome were obtained for repetitive and protein-coding sequence analyses.</p> <p>Results</p> <p>We found that microsatellite sequences were highly abundant in the <it>P. monodon </it>genome, comprising 8.3% of the total length. The density and the average length of microsatellites were evidently higher in comparison to those of other taxa. AT-rich microsatellite motifs, especially poly (AT) and poly (AAT), were the most abundant. High abundance of microsatellite sequences were also found in the transcribed regions. Furthermore, <it>via </it>self-BlastN analysis we identified 103 novel repetitive element families which were categorized into four groups, <it>i.e</it>., 33 WSSV-like repeats, 14 retrotransposons, 5 gene-like repeats, and 51 unannotated repeats. Overall, various types of repeats comprise 51.18% of the <it>P. monodon </it>genome in length. Approximately 7.4% of the FESs contained protein-coding sequences, and the Inhibitor of Apoptosis Protein (IAP) gene and the Innexin 3 gene homologues appear to be present in high abundance in the <it>P. monodon </it>genome.</p> <p>Conclusions</p> <p>The redundancy of various repeat types in the <it>P. monodon </it>genome illustrates its highly repetitive nature. In particular, long and dense microsatellite sequences as well as abundant WSSV-like sequences highlight the uniqueness of genome organization of penaeid shrimp from those of other taxa. These results provide substantial improvement to our current knowledge not only for shrimp but also for marine crustaceans of large genome size.</p

    Sequencing technologies and genome sequencing

    Get PDF
    The high-throughput - next generation sequencing (HT-NGS) technologies are currently the hottest topic in the field of human and animals genomics researches, which can produce over 100 times more data compared to the most sophisticated capillary sequencers based on the Sanger method. With the ongoing developments of high throughput sequencing machines and advancement of modern bioinformatics tools at unprecedented pace, the target goal of sequencing individual genomes of living organism at a cost of $1,000 each is seemed to be realistically feasible in the near future. In the relatively short time frame since 2005, the HT-NGS technologies are revolutionizing the human and animal genome researches by analysis of chromatin immunoprecipitation coupled to DNA microarray (ChIP-chip) or sequencing (ChIP-seq), RNA sequencing (RNA-seq), whole genome genotyping, genome wide structural variation, de novo assembling and re-assembling of genome, mutation detection and carrier screening, detection of inherited disorders and complex human diseases, DNA library preparation, paired ends and genomic captures, sequencing of mitochondrial genome and personal genomics. In this review, we addressed the important features of HT-NGS like, first generation DNA sequencers, birth of HT-NGS, second generation HT-NGS platforms, third generation HT-NGS platforms: including single molecule Heliscope™, SMRT™ and RNAP sequencers, Nanopore, Archon Genomics X PRIZE foundation, comparison of second and third HT-NGS platforms, applications, advances and future perspectives of sequencing technologies on human and animal genome research

    Protective Role of False Tendon in Subjects with Left Bundle Branch Block: A Virtual Population Study.

    Get PDF
    False tendons (FTs) are fibrous or fibromuscular bands that can be found in both the normal and abnormal human heart in various anatomical forms depending on their attachment points, tissue types, and geometrical properties. While FTs are widely considered to affect the function of the heart, their specific roles remain largely unclear and unexplored. In this paper, we present an in silico study of the ventricular activation time of the human heart in the presence of FTs. This study presents the first computational model of the human heart that includes a FT, Purkinje network, and papillary muscles. Based on this model, we perform simulations to investigate the effect of different types of FTs on hearts with the electrical conduction abnormality of a left bundle branch block (LBBB). We employ a virtual population of 70 human hearts derived from a statistical atlas, and run a total of 560 simulations to assess ventricular activation time with different FT configurations. The obtained results indicate that, in the presence of a LBBB, the FT reduces the total activation time that is abnormally augmented due to a branch block, to such an extent that surgical implant of cardiac resynchronisation devices might not be recommended by international guidelines. Specifically, the simulation results show that FTs reduce the QRS duration at least 10 ms in 80% of hearts, and up to 45 ms for FTs connecting to the ventricular free wall, suggesting a significant reduction of cardiovascular mortality risk. In further simulation studies we show the reduction in the QRS duration is more sensitive to the shape of the heart then the size of the heart or the exact location of the FT. Finally, the model suggests that FTs may contribute to reducing the activation time difference between the left and right ventricles from 12 ms to 4 ms. We conclude that FTs may provide an alternative conduction pathway that compensates for the propagation delay caused by the LBBB. Further investigation is needed to quantify the clinical impact of FTs on cardiovascular mortality risk
    corecore