99 research outputs found
Computing Local Sensitivities of Counting Queries with Joins
Local sensitivity of a query Q given a database instance D, i.e. how much the
output Q(D) changes when a tuple is added to D or deleted from D, has many
applications including query analysis, outlier detection, and in differential
privacy. However, it is NP-hard to find local sensitivity of a conjunctive
query in terms of the size of the query, even for the class of acyclic queries.
Although the complexity is polynomial when the query size is fixed, the naive
algorithms are not efficient for large databases and queries involving multiple
joins. In this paper, we present a novel approach to compute local sensitivity
of counting queries involving join operations by tracking and summarizing tuple
sensitivities -- the maximum change a tuple can cause in the query result when
it is added or removed. We give algorithms for the sensitivity problem for full
acyclic join queries using join trees, that run in polynomial time in both the
size of the database and query for an interesting sub-class of queries, which
we call 'doubly acyclic queries' that include path queries, and in polynomial
time in combined complexity when the maximum degree in the join tree is
bounded. Our algorithms can be extended to certain non-acyclic queries using
generalized hypertree decompositions. We evaluate our approach experimentally,
and show applications of our algorithms to obtain better results for
differential privacy by orders of magnitude.Comment: To be published in Proceedings of the 2020 ACM SIGMOD International
Conference on Management of Dat
Reconsideration of In-Silico siRNA Design Based on Feature Selection: A Cross-Platform Data Integration Perspective
RNA interference via exogenous short interference RNAs (siRNA) is increasingly more widely employed as a tool in gene function studies, drug target discovery and disease treatment. Currently there is a strong need for rational siRNA design to achieve more reliable and specific gene silencing; and to keep up with the increasing needs for a wider range of applications. While progress has been made in the ability to design siRNAs with specific targets, we are clearly at an infancy stage towards achieving rational design of siRNAs with high efficacy. Among the many obstacles to overcome, lack of general understanding of what sequence features of siRNAs may affect their silencing efficacy and of large-scale homogeneous data needed to carry out such association analyses represents two challenges. To address these issues, we investigated a feature-selection based in-silico siRNA design from a novel cross-platform data integration perspective. An integration analysis of 4,482 siRNAs from ten meta-datasets was conducted for ranking siRNA features, according to their possible importance to the silencing efficacy of siRNAs across heterogeneous data sources. Our ranking analysis revealed for the first time the most relevant features based on cross-platform experiments, which compares favorably with the traditional in-silico siRNA feature screening based on the small samples of individual platform data. We believe that our feature ranking analysis can offer more creditable suggestions to help improving the design of siRNA with specific silencing targets. Data and scripts are available at http://csbl.bmb.uga.edu/publications/materials/qiliu/siRNA.html
On the concentrations of magnetically ordered clusters and paramagnetic centers per magnetic cluster in Ag/AgI-As2S3 glasses
In the present work the magnetic properties of Ag/AgI-As2S3 glasses are studied using magnetic susceptibility (MS) measurement by the Faraday method at the temperatures of 293 and 77 K. On the basis of the MS experimental data and the model based on the Langevin function, the concentrations of magnetically ordered clusters and paramagnetic centers per magnetic cluster for the investigated glasses are evaluated and the results obtained are discussed
Discretization Provides a Conceptually Simple Tool to Build Expression Networks
Biomarker identification, using network methods, depends on finding regular co-expression patterns; the overall connectivity is of greater importance than any single relationship. A second requirement is a simple algorithm for ranking patients on how relevant a gene-set is. For both of these requirements discretized data helps to first identify gene cliques, and then to stratify patients
Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk
Blood pressure is a heritable trait influenced by several biological pathways and responsive to environmental stimuli. Over one billion people worldwide have hypertension (≥140mmg Hg systolic blood pressure ≥90mmg Hg diastolic blood pressure). Even small increments in blood pressure are associated with an increased risk of cardiovascular events. This genome-wide association study of systolic and diastolic blood pressure, which used a multi-stage design in 200,000 individuals of European descent, identified sixteen novel loci: six of these loci contain genes previously known or suspected to regulate blood pressure (GUCY1A3 GUCY1B3, NPR3 C5orf23, ADM, FURIN FES, GOSR2, GNAS EDN3); the other ten provide new clues to blood pressure physiology. A genetic risk score based on 29 genome-wide significant variants was associated with hypertension, left ventricular wall thickness, stroke and coronary artery disease, but not kidney disease or kidney function. We also observed associations with blood pressure in East Asian, South Asian and African ancestry individuals. Our findings provide new insights into the genetics and biology of blood pressure, and suggest potential novel therapeutic pathways for cardiovascular disease prevention. © 2011 Macmillan Publishers Limited. All rights reserved
Identification of superior reference genes for data normalisation of expression studies via quantitative PCR in hybrid roses (Rosa hybrida)
<p>Abstract</p> <p>Background</p> <p>Gene expression studies are a prerequisite for understanding the biological function of genes. Because of its high sensitivity and easy use, quantitative PCR (qPCR) has become the gold standard for gene expression quantification. To normalise qPCR measurements between samples, the most prominent technique is the use of stably expressed endogenous control genes, the so called reference genes. However, recent studies show there is no universal reference gene for all biological questions. Roses are important ornamental plants for which there has been no evaluation of useful reference genes for gene expression studies.</p> <p>Results</p> <p>We used three different algorithms (BestKeeper, geNorm and NormFinder) to validate the expression stability of nine candidate reference genes in different rose tissues from three different genotypes of <it>Rosa hybrida </it>and in leaves treated with various stress factors. The candidate genes comprised the classical "housekeeping genes" (<it>Actin, EF-1α, GAPDH</it>, <it>Tubulin </it>and <it>Ubiquitin</it>), and genes showing stable expression in studies in <it>Arabidopsis </it>(<it>PP2A, SAND, TIP </it>and <it>UBC</it>). The programs identified no single gene that showed stable expression under all of the conditions tested, and the individual rankings of the genes differed between the algorithms. Nevertheless the new candidate genes, specifically, <it>PP2A </it>and <it>UBC</it>, were ranked higher as compared to the other traditional reference genes. In general, <it>Tubulin </it>showed the most variable expression and should be avoided as a reference gene.</p> <p>Conclusions</p> <p>Reference genes evaluated as suitable in experiments with <it>Arabidopsis thaliana </it>were stably expressed in roses under various experimental conditions. In most cases, these genes outperformed conventional reference genes, such as <it>EF1-α </it>and <it>Tubulin</it>. We identified <it>PP2A</it>, <it>SAND </it>and <it>UBC </it>as suitable reference genes, which in different combinations may be used for normalisation in expression analyses via qPCR for different rose tissues and stress treatments. However, the vast genetic variation found within the genus <it>Rosa</it>, including differences in ploidy levels, might also influence expression stability of reference genes, so that future research should also consider different genotypes and ploidy levels.</p
MGEx-Udb: A Mammalian Uterus Database for Expression-Based Cataloguing of Genes across Conditions, Including Endometriosis and Cervical Cancer
Gene expression profiling of uterus tissue has been performed in various contexts, but a significant amount of the data remains underutilized as it is not covered by the existing general resources.). The database can be queried with gene names/IDs, sub-tissue locations, as well as various conditions such as the cervical cancer, endometrial cycles and disorders, and experimental treatments. Accordingly, the output would be a) transcribed and dormant genes listed for the queried condition/location, or b) expression profile of the gene of interest in various uterine conditions. The results also include the reliability score for the expression status of each gene. MGEx-Udb also provides information related to Gene Ontology annotations, protein-protein interactions, transcripts, promoters, and expression status by other sequencing techniques, and facilitates various other types of analysis of the individual genes or co-expressed gene clusters.In brief, MGEx-Udb enables easy cataloguing of co-expressed genes and also facilitates bio-marker discovery for various uterine conditions
Reference gene validation for quantitative RT-PCR during biotic and abiotic stresses in Vitis vinifera
Grapevine is one of the most cultivated fruit crop worldwide with Vitis vinifera being the species with the highest
economical importance. Being highly susceptible to fungal pathogens and increasingly affected by environmental factors, it
has become an important agricultural research area, where gene expression analysis plays a fundamental role. Quantitative
reverse transcription polymerase chain reaction (qRT-PCR) is currently amongst the most powerful techniques to perform
gene expression studies. Nevertheless, accurate gene expression quantification strongly relies on appropriate reference
gene selection for sample normalization. Concerning V. vinifera, limited information still exists as for which genes are the
most suitable to be used as reference under particular experimental conditions. In this work, seven candidate genes were
investigated for their stability in grapevine samples referring to four distinct stresses (Erysiphe necator, wounding and UV-C
irradiation in leaves and Phaeomoniella chlamydospora colonization in wood). The expression stability was evaluated using
geNorm, NormFinder and BestKeeper. In all cases, full agreement was not observed for the three methods. To provide
comprehensive rankings integrating the three different programs, for each treatment, a consensus ranking was created
using a non-weighted unsupervised rank aggregation method. According to the last, the three most suitable reference
genes to be used in grapevine leaves, regardless of the stress, are UBC, VAG and PEP. For the P. chlamydospora treatment,
EF1, CYP and UBC were the best scoring genes. Acquaintance of the most suitable reference genes to be used in grapevine
samples can contribute for accurate gene expression quantification in forthcoming studiesinfo:eu-repo/semantics/publishedVersio
- …