98 research outputs found

    Computing Local Sensitivities of Counting Queries with Joins

    Full text link
    Local sensitivity of a query Q given a database instance D, i.e. how much the output Q(D) changes when a tuple is added to D or deleted from D, has many applications including query analysis, outlier detection, and in differential privacy. However, it is NP-hard to find local sensitivity of a conjunctive query in terms of the size of the query, even for the class of acyclic queries. Although the complexity is polynomial when the query size is fixed, the naive algorithms are not efficient for large databases and queries involving multiple joins. In this paper, we present a novel approach to compute local sensitivity of counting queries involving join operations by tracking and summarizing tuple sensitivities -- the maximum change a tuple can cause in the query result when it is added or removed. We give algorithms for the sensitivity problem for full acyclic join queries using join trees, that run in polynomial time in both the size of the database and query for an interesting sub-class of queries, which we call 'doubly acyclic queries' that include path queries, and in polynomial time in combined complexity when the maximum degree in the join tree is bounded. Our algorithms can be extended to certain non-acyclic queries using generalized hypertree decompositions. We evaluate our approach experimentally, and show applications of our algorithms to obtain better results for differential privacy by orders of magnitude.Comment: To be published in Proceedings of the 2020 ACM SIGMOD International Conference on Management of Dat

    Reconsideration of In-Silico siRNA Design Based on Feature Selection: A Cross-Platform Data Integration Perspective

    Get PDF
    RNA interference via exogenous short interference RNAs (siRNA) is increasingly more widely employed as a tool in gene function studies, drug target discovery and disease treatment. Currently there is a strong need for rational siRNA design to achieve more reliable and specific gene silencing; and to keep up with the increasing needs for a wider range of applications. While progress has been made in the ability to design siRNAs with specific targets, we are clearly at an infancy stage towards achieving rational design of siRNAs with high efficacy. Among the many obstacles to overcome, lack of general understanding of what sequence features of siRNAs may affect their silencing efficacy and of large-scale homogeneous data needed to carry out such association analyses represents two challenges. To address these issues, we investigated a feature-selection based in-silico siRNA design from a novel cross-platform data integration perspective. An integration analysis of 4,482 siRNAs from ten meta-datasets was conducted for ranking siRNA features, according to their possible importance to the silencing efficacy of siRNAs across heterogeneous data sources. Our ranking analysis revealed for the first time the most relevant features based on cross-platform experiments, which compares favorably with the traditional in-silico siRNA feature screening based on the small samples of individual platform data. We believe that our feature ranking analysis can offer more creditable suggestions to help improving the design of siRNA with specific silencing targets. Data and scripts are available at http://csbl.bmb.uga.edu/publications/materials/qiliu/siRNA.html

    On the concentrations of magnetically ordered clusters and paramagnetic centers per magnetic cluster in Ag/AgI-As2S3 glasses

    Get PDF
    In the present work the magnetic properties of Ag/AgI-As2S3 glasses are studied using magnetic susceptibility (MS) measurement by the Faraday method at the temperatures of 293 and 77 K. On the basis of the MS experimental data and the model based on the Langevin function, the concentrations of magnetically ordered clusters and paramagnetic centers per magnetic cluster for the investigated glasses are evaluated and the results obtained are discussed

    Discretization Provides a Conceptually Simple Tool to Build Expression Networks

    Get PDF
    Biomarker identification, using network methods, depends on finding regular co-expression patterns; the overall connectivity is of greater importance than any single relationship. A second requirement is a simple algorithm for ranking patients on how relevant a gene-set is. For both of these requirements discretized data helps to first identify gene cliques, and then to stratify patients

    Identification of superior reference genes for data normalisation of expression studies via quantitative PCR in hybrid roses (Rosa hybrida)

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene expression studies are a prerequisite for understanding the biological function of genes. Because of its high sensitivity and easy use, quantitative PCR (qPCR) has become the gold standard for gene expression quantification. To normalise qPCR measurements between samples, the most prominent technique is the use of stably expressed endogenous control genes, the so called reference genes. However, recent studies show there is no universal reference gene for all biological questions. Roses are important ornamental plants for which there has been no evaluation of useful reference genes for gene expression studies.</p> <p>Results</p> <p>We used three different algorithms (BestKeeper, geNorm and NormFinder) to validate the expression stability of nine candidate reference genes in different rose tissues from three different genotypes of <it>Rosa hybrida </it>and in leaves treated with various stress factors. The candidate genes comprised the classical "housekeeping genes" (<it>Actin, EF-1Ξ±, GAPDH</it>, <it>Tubulin </it>and <it>Ubiquitin</it>), and genes showing stable expression in studies in <it>Arabidopsis </it>(<it>PP2A, SAND, TIP </it>and <it>UBC</it>). The programs identified no single gene that showed stable expression under all of the conditions tested, and the individual rankings of the genes differed between the algorithms. Nevertheless the new candidate genes, specifically, <it>PP2A </it>and <it>UBC</it>, were ranked higher as compared to the other traditional reference genes. In general, <it>Tubulin </it>showed the most variable expression and should be avoided as a reference gene.</p> <p>Conclusions</p> <p>Reference genes evaluated as suitable in experiments with <it>Arabidopsis thaliana </it>were stably expressed in roses under various experimental conditions. In most cases, these genes outperformed conventional reference genes, such as <it>EF1-Ξ± </it>and <it>Tubulin</it>. We identified <it>PP2A</it>, <it>SAND </it>and <it>UBC </it>as suitable reference genes, which in different combinations may be used for normalisation in expression analyses via qPCR for different rose tissues and stress treatments. However, the vast genetic variation found within the genus <it>Rosa</it>, including differences in ploidy levels, might also influence expression stability of reference genes, so that future research should also consider different genotypes and ploidy levels.</p

    MGEx-Udb: A Mammalian Uterus Database for Expression-Based Cataloguing of Genes across Conditions, Including Endometriosis and Cervical Cancer

    Get PDF
    Gene expression profiling of uterus tissue has been performed in various contexts, but a significant amount of the data remains underutilized as it is not covered by the existing general resources.). The database can be queried with gene names/IDs, sub-tissue locations, as well as various conditions such as the cervical cancer, endometrial cycles and disorders, and experimental treatments. Accordingly, the output would be a) transcribed and dormant genes listed for the queried condition/location, or b) expression profile of the gene of interest in various uterine conditions. The results also include the reliability score for the expression status of each gene. MGEx-Udb also provides information related to Gene Ontology annotations, protein-protein interactions, transcripts, promoters, and expression status by other sequencing techniques, and facilitates various other types of analysis of the individual genes or co-expressed gene clusters.In brief, MGEx-Udb enables easy cataloguing of co-expressed genes and also facilitates bio-marker discovery for various uterine conditions

    Reference gene validation for quantitative RT-PCR during biotic and abiotic stresses in Vitis vinifera

    Get PDF
    Grapevine is one of the most cultivated fruit crop worldwide with Vitis vinifera being the species with the highest economical importance. Being highly susceptible to fungal pathogens and increasingly affected by environmental factors, it has become an important agricultural research area, where gene expression analysis plays a fundamental role. Quantitative reverse transcription polymerase chain reaction (qRT-PCR) is currently amongst the most powerful techniques to perform gene expression studies. Nevertheless, accurate gene expression quantification strongly relies on appropriate reference gene selection for sample normalization. Concerning V. vinifera, limited information still exists as for which genes are the most suitable to be used as reference under particular experimental conditions. In this work, seven candidate genes were investigated for their stability in grapevine samples referring to four distinct stresses (Erysiphe necator, wounding and UV-C irradiation in leaves and Phaeomoniella chlamydospora colonization in wood). The expression stability was evaluated using geNorm, NormFinder and BestKeeper. In all cases, full agreement was not observed for the three methods. To provide comprehensive rankings integrating the three different programs, for each treatment, a consensus ranking was created using a non-weighted unsupervised rank aggregation method. According to the last, the three most suitable reference genes to be used in grapevine leaves, regardless of the stress, are UBC, VAG and PEP. For the P. chlamydospora treatment, EF1, CYP and UBC were the best scoring genes. Acquaintance of the most suitable reference genes to be used in grapevine samples can contribute for accurate gene expression quantification in forthcoming studiesinfo:eu-repo/semantics/publishedVersio

    Systems Biology of the qa Gene Cluster in Neurospora crassa

    Get PDF
    An ensemble of genetic networks that describe how the model fungal system, Neurospora crassa, utilizes quinic acid (QA) as a sole carbon source has been identified previously. A genetic network for QA metabolism involves the genes, qa-1F and qa-1S, that encode a transcriptional activator and repressor, respectively and structural genes, qa-2, qa-3, qa-4, qa-x, and qa-y. By a series of 4 separate and independent, model-guided, microarray experiments a total of 50 genes are identified as QA-responsive and hypothesized to be under QA-1F control and/or the control of a second QA-responsive transcription factor (NCU03643) both in the fungal binuclear Zn(II)2Cys6 cluster family. QA-1F regulation is not sufficient to explain the quantitative variation in expression profiles of the 50 QA-responsive genes. QA-responsive genes include genes with products in 8 mutually connected metabolic pathways with 7 of them one step removed from the tricarboxylic (TCA) Cycle and with 7 of them one step removed from glycolysis: (1) starch and sucrose metabolism; (2) glycolysis/glucanogenesis; (3) TCA Cycle; (4) butanoate metabolism; (5) pyruvate metabolism; (6) aromatic amino acid and QA metabolism; (7) valine, leucine, and isoleucine degradation; and (8) transport of sugars and amino acids. Gene products both in aromatic amino acid and QA metabolism and transport show an immediate response to shift to QA, while genes with products in the remaining 7 metabolic modules generally show a delayed response to shift to QA. The additional QA-responsive cutinase transcription factor-1Ξ² (NCU03643) is found to have a delayed response to shift to QA. The series of microarray experiments are used to expand the previously identified genetic network describing the qa gene cluster to include all 50 QA-responsive genes including the second transcription factor (NCU03643). These studies illustrate new methodologies from systems biology to guide model-driven discoveries about a core metabolic network involving carbon and amino acid metabolism in N. crassa
    • …
    corecore