313,848 research outputs found

    GBS-SNP-CROP: a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by-sequencing data

    Get PDF
    Background: With its simple library preparation and robust approach to genome reduction, genotyping-by-sequencing (GBS) is a flexible and cost-effective strategy for SNP discovery and genotyping, provided an appropriate reference genome is available. For resource-limited curation, research, and breeding programs of underutilized plant genetic resources, however, even low-depth references may not be within reach, despite declining sequencing costs. Such programs would find value in an open-source bioinformatics pipeline that can maximize GBS data usage and perform high-density SNP genotyping in the absence of a reference. Results: The GBS SNP-Calling Reference Optional Pipeline (GBS-SNP-CROP) developed and presented here adopts a clustering strategy to build a population-tailored “Mock Reference” from the same GBS data used for downstream SNP calling and genotyping. Designed for libraries of paired-end (PE) reads, GBS-SNP-CROP maximizes data usage by eliminating unnecessary data culling due to imposed read-length uniformity requirements. Using 150 bp PE reads from a GBS library of 48 accessions of tetraploid kiwiberry (Actinidia arguta), GBS-SNP-CROP yielded on average three times as many SNPs as TASSEL-GBS analyses (32 and 64 bp tag lengths) and over 18 times as many as TASSEL-UNEAK, with fewer genotyping errors in all cases, as evidenced by comparing the genotypic characterizations of biological replicates. Using the published reference genome of a related diploid species (A. chinensis), the reference-based version of GBS-SNP-CROP behaved similarly to TASSEL-GBS in terms of the number of SNPs called but had an improved read depth distribution and fewer genotyping errors. Our results also indicate that the sets of SNPs detected by the different pipelines above are largely orthogonal to one another; thus GBS-SNP-CROP may be used to augment the results of alternative analyses, whether or not a reference is available. Conclusions: By achieving high-density SNP genotyping in populations for which no reference genome is available, GBS-SNP-CROP is worth consideration by curators, researchers, and breeders of under-researched plant genetic resources. In cases where a reference is available, especially if from a related species or when the target population is particularly diverse, GBS-SNP-CROP may complement other reference-based pipelines by extracting more information per sequencing dollar spent. The current version of GBS-SNP-CROP is available at https://github.com/halelab/GBS-SNP-CROP.gi

    Time After Time: Notes on Delays In Spiking Neural P Systems

    Full text link
    Spiking Neural P systems, SNP systems for short, are biologically inspired computing devices based on how neurons perform computations. SNP systems use only one type of symbol, the spike, in the computations. Information is encoded in the time differences of spikes or the multiplicity of spikes produced at certain times. SNP systems with delays (associated with rules) and those without delays are two of several Turing complete SNP system variants in literature. In this work we investigate how restricted forms of SNP systems with delays can be simulated by SNP systems without delays. We show the simulations for the following spike routing constructs: sequential, iteration, join, and split.Comment: 11 pages, 9 figures, 4 lemmas, 1 theorem, preprint of Workshop on Computation: Theory and Practice 2012 at DLSU, Manila together with UP Diliman, DLSU, Tokyo Institute of Technology, and Osaka universit

    Does replication groups scoring reduce false positive rate in SNP interaction discovery?

    Get PDF
    BACKGROUNG. Computational methods that infer single nucleotide polymorphism (SNP) interactions from phenotype data may uncover new biological mechanisms in non-Mendelian diseases. However, practical aspects of such analysis face many problems. Present experimental studies typically use SNP arrays with hundreds of thousands of SNPs but record only hundreds of samples. Candidate SNP pairs inferred by interaction analysis may include a high proportion of false positives. Recently, Gayan et al. (2008) proposed to reduce the number of false positives by combining results of interaction analysis performed on subsets of data (replication groups), rather than analyzing the entire data set directly. If performing as hypothesized, replication groups scoring could improve interaction analysis and also any type of feature ranking and selection procedure in systems biology. Because Gayan et al. do not compare their approach to the standard interaction analysis techniques, we here investigate if replication groups indeed reduce the number of reported false positive interactions. RESULTS. A set of simulated and false interaction-imputed experimental SNP data sets were used to compare the inference of SNP-SNP interactions by means of replication groups to the standard approach where the entire data set was directly used to score all candidate SNP pairs. In all our experiments, the inference of interactions from the entire data set (e.g. without using the replication groups) reported fewer false positives. CONCLUSIONS. With respect to the direct scoring approach the utility of replication groups does not reduce false positive rates, and may, depending on the data set, often perform worse

    Non-stationary self-similar Gaussian processes as scaling limits of power law shot noise processes and generalizations of fractional Brownian motion

    Full text link
    We study shot noise processes with Poisson arrivals and non-stationary noises. The noises are conditionally independent given the arrival times, but the distribution of each noise does depend on its arrival time. We establish scaling limits for such shot noise processes in two situations: 1) the conditional variance functions of the noises have a power law and 2) the conditional noise distributions are piecewise. In both cases, the limit processes are self-similar Gaussian with nonstationary increments. Motivated by these processes, we introduce new classes of self-similar Gaussian processes with non-stationary increments, via the time-domain integral representation, which are natural generalizations of fractional Brownian motions.Published versio

    PTPN2 gene variants are associated with susceptibility to both Crohn's disease and ulcerative colitis supporting a common genetic disease background.

    Get PDF
    Genome-wide association studies identified PTPN2 (protein tyrosine phosphatase, non-receptor type 2) as susceptibility gene for inflammatory bowel diseases (IBD). However, the exact role of PTPN2 in Crohn's disease (CD) and ulcerative colitis (UC) and its phenotypic effect are unclear. We therefore performed a detailed genotype-phenotype and epistasis analysis of PTPN2 gene variants. Genomic DNA from 2131 individuals of Caucasian origin (905 patients with CD, 318 patients with UC, and 908 healthy, unrelated controls) was analyzed for two SNPs in the PTPN2 region (rs2542151, rs7234029) for which associations with IBD were found in previous studies in other cohorts. Our analysis revealed a significant association of PTPN2 SNP rs2542151 with both susceptibility to CD (p = 1.95×10⁻⁵; OR 1.49 [1.34-1.79]) and UC (p = 3.87×10⁻², OR 1.31 [1.02-1.68]). Moreover, PTPN2 SNP rs7234029 demonstrated a significant association with susceptibility to CD (p = 1.30×10⁻³; OR 1.35 [1.13-1.62]) and a trend towards association with UC (p = 7.53×10⁻²; OR 1.26 [0.98-1.62]). Genotype-phenotype analysis revealed an association of PTPN2 SNP rs7234029 with a stricturing disease phenotype (B2) in CD patients (p = 6.62×10⁻³). Epistasis analysis showed weak epistasis between the ATG16L1 SNP rs2241879 and PTPN2 SNP rs2542151 (p = 0.024) in CD and between ATG16L1 SNP rs4663396 and PTPN2 SNP rs7234029 (p = 4.68×10⁻³) in UC. There was no evidence of epistasis between PTPN2 and NOD2 and PTPN2 and IL23R. In silico analysis revealed that the SNP rs7234029 modulates potentially the binding sites of several transcription factors involved in inflammation including GATA-3, NF-κB, C/EBP, and E4BP4. Our data confirm the association of PTPN2 variants with susceptibility to both CD and UC, suggesting a common disease pathomechanism for these diseases. Given recent evidence that PTPN2 regulates autophagosome formation in intestinal epithelial cells, the potential link between PTPN2 and ATG16L1 should be further investigated

    Identification of SNPs in TG and EDG1 genes and their relationships with carcass traits in Korean cattle (Hanwoo)

    Get PDF
    Thyroglobulin (TG) gene was known to be regulated fat cell growth and differentiation and the endothelial differentiation sphingolipid G-protein-coupled receptor 1 (EDG1) gene involves blood vessel formation and known to be affecting carcass traits in beef cattle. The aim of this study was to identify the single nucleotide polymorphisms (SNPs) in both TG and EDG1 genes and to analyze the association with carcass traits in Korean cattle (Hanwoo). The T354C SNP in TG gene located at the 3’ flanking region and c.-312A>G SNP located at 3’-UTR of EDG1 gene were used for genotyping the animals using PCR-RFLP method. Three genotypes were identified in T354C SNP in TG gene and only two AA and AG genotypes were observed for the c.-312A>G SNP in EDG1 gene. The results indicated that T354C SNP in TG gene was not significantly associated with carcass traits. However, the c.-312A>G SNP in EDG1 gene had significant effects on backfat thickness (BF) and yield index (YI). These results may provide valuable information for further candidate gene studies affecting carcass traits in Korean cattle and may use as marker assisted selection for improving the quality of meat in Hanwoo. Key words : TG, EDG1, Carcass traits, Hanwo

    Investigations into the molecular effects of single nucleotide polymorphism

    Get PDF
    Objectives: DNA sequences are very rich in short repeats and their pattern can be altered by point mutations. We wanted to investigate the effect of single nucleotide polymorphism (SNP) on the pattern of short DNA repeats and its biological consequences. Methods: Analysis of the pattern of short DNA repeats of the Thy-1 sequence with and without SNP. Searching for DNA-binding factors in any region of significance. Results: Comparing the pattern of short repeats in the Thy-1 gene sequences of Turkish patients with ataxia telangiectasia (AT) with the `wild type' sequence from the DNA database, we identified a missing 8-bp repeat element due to an SNP in position 1271 (intron II) in AT-DNA sequences. Only the mutated sequence had the potential for the formation of a stem loop in DNA or pre-mRNA. In super-shift experiments we found that DNA oligomers covering the area of this SNP formed a complex with proteins amongst which we identified the proliferating cell nuclear antigen (PCNA) protein. Conclusion: SNPs have the potential to alter DNA or pre-mRNA conformation. Although no SNP-depeding formation of the DNA-protein complex was evident, future investigations could reveal differential molecular mechanisms of cellular regulation. Copyright (C) 2001 S. Karger AG, Basel

    Modeling Fault Propagation Paths in Power Systems: A New Framework Based on Event SNP Systems With Neurotransmitter Concentration

    Get PDF
    To reveal fault propagation paths is one of the most critical studies for the analysis of power system security; however, it is rather dif cult. This paper proposes a new framework for the fault propagation path modeling method of power systems based on membrane computing.We rst model the fault propagation paths by proposing the event spiking neural P systems (Ev-SNP systems) with neurotransmitter concentration, which can intuitively reveal the fault propagation path due to the ability of its graphics models and parallel knowledge reasoning. The neurotransmitter concentration is used to represent the probability and gravity degree of fault propagation among synapses. Then, to reduce the dimension of the Ev-SNP system and make them suitable for large-scale power systems, we propose a model reduction method for the Ev-SNP system and devise its simpli ed model by constructing single-input and single-output neurons, called reduction-SNP system (RSNP system). Moreover, we apply the RSNP system to the IEEE 14- and 118-bus systems to study their fault propagation paths. The proposed approach rst extends the SNP systems to a large-scaled application in critical infrastructures from a single element to a system-wise investigation as well as from the post-ante fault diagnosis to a new ex-ante fault propagation path prediction, and the simulation results show a new success and promising approach to the engineering domain
    corecore