151 research outputs found

    VerdictDB: Universalizing Approximate Query Processing

    Full text link
    Despite 25 years of research in academia, approximate query processing (AQP) has had little industrial adoption. One of the major causes of this slow adoption is the reluctance of traditional vendors to make radical changes to their legacy codebases, and the preoccupation of newer vendors (e.g., SQL-on-Hadoop products) with implementing standard features. Additionally, the few AQP engines that are available are each tied to a specific platform and require users to completely abandon their existing databases---an unrealistic expectation given the infancy of the AQP technology. Therefore, we argue that a universal solution is needed: a database-agnostic approximation engine that will widen the reach of this emerging technology across various platforms. Our proposal, called VerdictDB, uses a middleware architecture that requires no changes to the backend database, and thus, can work with all off-the-shelf engines. Operating at the driver-level, VerdictDB intercepts analytical queries issued to the database and rewrites them into another query that, if executed by any standard relational engine, will yield sufficient information for computing an approximate answer. VerdictDB uses the returned result set to compute an approximate answer and error estimates, which are then passed on to the user or application. However, lack of access to the query execution layer introduces significant challenges in terms of generality, correctness, and efficiency. This paper shows how VerdictDB overcomes these challenges and delivers up to 171×\times speedup (18.45×\times on average) for a variety of existing engines, such as Impala, Spark SQL, and Amazon Redshift, while incurring less than 2.6% relative error. VerdictDB is open-sourced under Apache License.Comment: Extended technical report of the paper that appeared in Proceedings of the 2018 International Conference on Management of Data, pp. 1461-1476. ACM, 201

    Database Learning: Toward a Database that Becomes Smarter Every Time

    Full text link
    In today's databases, previous query answers rarely benefit answering future queries. For the first time, to the best of our knowledge, we change this paradigm in an approximate query processing (AQP) context. We make the following observation: the answer to each query reveals some degree of knowledge about the answer to another query because their answers stem from the same underlying distribution that has produced the entire dataset. Exploiting and refining this knowledge should allow us to answer queries more analytically, rather than by reading enormous amounts of raw data. Also, processing more queries should continuously enhance our knowledge of the underlying distribution, and hence lead to increasingly faster response times for future queries. We call this novel idea---learning from past query answers---Database Learning. We exploit the principle of maximum entropy to produce answers, which are in expectation guaranteed to be more accurate than existing sample-based approximations. Empowered by this idea, we build a query engine on top of Spark SQL, called Verdict. We conduct extensive experiments on real-world query traces from a large customer of a major database vendor. Our results demonstrate that Verdict supports 73.7% of these queries, speeding them up by up to 23.0x for the same accuracy level compared to existing AQP systems.Comment: This manuscript is an extended report of the work published in ACM SIGMOD conference 201

    Complete removal of pathogenic bacteria from drinking water using nano silver-coated cylindrical polypropylene filters.

    Get PDF
    An attempt was made to investigate the removal of Escherichia coli bacteria from drinking water using nano silver-coated polypropylene water filter. For the production of nano silver filters, a modified Balzers 760 machine equipped with an electron beam gun was used. The nano-silver particles were made by electron beam bombardment of the silver metal, which were subsequently deposited on the polypropylene filters evenly. The thickness of the nano layer coated on the filters was 35.0 nm. The nano silver-coated filters were characterized using scanning electron microscopy, X-ray diffraction, transmission electron microscopy, and atomic force microscopy. The antibacterial efficiency of the filters was evaluated using the membrane filter method. At a flow rate of 3 l/h, the output count of E. coli was zero after 7 h filtration when the input water had a bacterial load of 103 colony-forming units (cfu) per milliliter. The inductively coupled plasma/mass spectrometry (ICP/MS) results showed that the 35 nm layer of the silver nanoparticles were stable on the water filter and were not washed away by water flow even after 72 h

    BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees

    Full text link
    The rising volume of datasets has made training machine learning (ML) models a major computational cost in the enterprise. Given the iterative nature of model and parameter tuning, many analysts use a small sample of their entire data during their initial stage of analysis to make quick decisions (e.g., what features or hyperparameters to use) and use the entire dataset only in later stages (i.e., when they have converged to a specific model). This sampling, however, is performed in an ad-hoc fashion. Most practitioners cannot precisely capture the effect of sampling on the quality of their model, and eventually on their decision-making process during the tuning phase. Moreover, without systematic support for sampling operators, many optimizations and reuse opportunities are lost. In this paper, we introduce BlinkML, a system for fast, quality-guaranteed ML training. BlinkML allows users to make error-computation tradeoffs: instead of training a model on their full data (i.e., full model), BlinkML can quickly train an approximate model with quality guarantees using a sample. The quality guarantees ensure that, with high probability, the approximate model makes the same predictions as the full model. BlinkML currently supports any ML model that relies on maximum likelihood estimation (MLE), which includes Generalized Linear Models (e.g., linear regression, logistic regression, max entropy classifier, Poisson regression) as well as PPCA (Probabilistic Principal Component Analysis). Our experiments show that BlinkML can speed up the training of large-scale ML tasks by 6.26x-629x while guaranteeing the same predictions, with 95% probability, as the full model.Comment: 22 pages, SIGMOD 201

    Activity and Interactions of Liposomal Antibiotics in Presence of Polyanions and Sputum of Patients with Cystic Fibrosis

    Get PDF
    BACKGROUND:To compare the effectiveness of liposomal tobramycin or polymyxin B against Pseudomonas aeruginosa in the Cystic Fibrosis (CF) sputum and its inhibition by common polyanionic components such as DNA, F-actin, lipopolysaccharides (LPS), and lipoteichoic acid (LTA). METHODOLOGY:Liposomal formulations were prepared from a mixture of 1,2-Dimyristoyl-sn-Glycero-3-Phosphocholine (DMPC) or 1,2-Dipalmitoyl-sn-Glycero-3-Phosphocholine (DPPC) and Cholesterol (Chol), respectively. Stability of the formulations in different biological milieus and antibacterial activities compared to conventional forms in the presence of the aforementioned inhibitory factors or CF sputum were evaluated. RESULTS:The formulations were stable in all conditions tested with no significant differences compared to the controls. Inhibition of antibiotic formulations by DNA/F-actin and LPS/LTA was concentration dependent. DNA/F-actin (125 to 1000 mg/L) and LPS/LTA (1 to 1000 mg/L) inhibited conventional tobramycin bioactivity, whereas, liposome-entrapped tobramycin was inhibited at higher concentrations--DNA/F-actin (500 to 1000 mg/L) and LPS/LTA (100 to 1000 mg/L). Neither polymyxin B formulation was inactivated by DNA/F-actin, but LPS/LTA (1 to 1000 mg/L) inhibited the drug in conventional form completely and higher concentrations of the inhibitors (100 to 1000 mg/L) was required to inhibit the liposome-entrapped polymyxin B. Co-incubation with inhibitory factors (1000 mg/L) increased conventional (16-fold) and liposomal (4-fold) tobramycin minimum bactericidal concentrations (MBCs), while both polymyxin B formulations were inhibited 64-fold. CONCLUSIONS:Liposome-entrapment reduced antibiotic inhibition up to 100-fold and the CFU of endogenous P. aeruginosa in sputum by 4-fold compared to the conventional antibiotic, suggesting their potential applications in CF lung infections

    Human Induced Pluripotent Stem Cells Differentiation into Oligodendrocyte Progenitors and Transplantation in a Rat Model of Optic Chiasm Demyelination

    Get PDF
    BACKGROUND: This study aims to differentiate human induced pluripotent stem cells (hiPSCs) into oligodendrocyte precursors and assess their recovery potential in a demyelinated optic chiasm model in rats. METHODOLOGY/PRINCIPAL FINDINGS: We generated a cell population of oligodendrocyte progenitors from hiPSCs by using embryoid body formation in a defined medium supplemented with a combination of factors, positive selection and mechanical enrichment. Real-time polymerase chain reaction and immunofluorescence analyses showed that stage-specific markers, Olig2, Sox10, NG2, PDGFRα, O4, A2B5, GalC, and MBP were expressed following the differentiation procedure, and enrichment of the oligodendrocyte lineage. These results are comparable with the expression of stage-specific markers in human embryonic stem cell-derived oligodendrocyte lineage cells. Transplantation of hiPSC-derived oligodendrocyte progenitors into the lysolecithin-induced demyelinated optic chiasm of the rat model resulted in recovery from symptoms, and integration and differentiation into oligodendrocytes were detected by immunohistofluorescence staining against PLP and MBP, and measurements of the visual evoked potentials. CONCLUSIONS/SIGNIFICANCE: These results showed that oligodendrocyte progenitors generated efficiently from hiPSCs can be used in future biomedical studies once safety issues have been overcome

    Conventional and Dense Gas Techniques for the Production of Liposomes: A Review

    Get PDF
    The aim of this review paper is to compare the potential of various techniques developed for production of homogenous, stable liposomes. Traditional techniques, such as Bangham, detergent depletion, ether/ethanol injection, reverse-phase evaporation and emulsion methods, were compared with the recent advanced techniques developed for liposome formation. The major hurdles for scaling up the traditional methods are the consumption of large quantities of volatile organic solvent, the stability and homogeneity of the liposomal product, as well as the lengthy multiple steps involved. The new methods have been designed to alleviate the current issues for liposome formulation. Dense gas liposome techniques are still in their infancy, however they have remarkable advantages in reducing the use of organic solvents, providing fast, single-stage production and producing stable, uniform liposomes. Techniques such as the membrane contactor and heating methods are also promising as they eliminate the use of organic solvent, however high temperature is still required for processing

    Formation Of Supramolecular Structures By Negatively Charged Liposomes In The Presence Of Nucleic Acids And Divalent Cations

    No full text
    Cationic liposomes are being increasingly studied as delivery vehicles for bioactive agents such as DNA and other polynucleotides, The mechanism of interaction of DNA with liposomes and the organization of these interacting structures during and after the interaction are still poorly understood. Nucleic acids are known to induce aggregation and size enlargement of liposomes, In the case of phosphatidylcholine (PC) vesicles, these processes depend on the presence and concentration of divalent metal cations and the amount of cholesterol in the liposomes. In this study, anionic small unilamellar vesicles (SUV) and multilamellar vesicles (MLV) composed of dicetylphosphate (DCP):PC:cholesterol at 2:7:1 molar ratios were prepared and incubated with the DNA (from wheat) and Ca2+ (50 mM) at 25 degrees C with the aim of transferring the genetic material into the liposomes by inducing fusion of liposome-liposome aggregates created in the presence, and with the help, of DNA, The organization and the nature of the resultant liposome-DNA-Ca2+ complexes were investigated by scanning tunneling microscopy (STM) and fluorescence microscopy, Observations of complexes with similar appearances with both SUV and MLV, as shown by two quite different microscopic approaches, prove that the resultant forms are real and not artifacts of the methodology used. At this stage it is not clear whether the detected complexes represent an intermediate state before fusion of liposomes which will lead to engulfing of the genomic material by the fused liposomes, or the final form. In either case the structures consisting of some adhered or semifused liposomes bearing the nucleic acid seem to be candidates as vehicles for in-vitro and in-vivo transfection.WoSScopu
    corecore