253 research outputs found

    VerdictDB: Universalizing Approximate Query Processing

    Full text link
    Despite 25 years of research in academia, approximate query processing (AQP) has had little industrial adoption. One of the major causes of this slow adoption is the reluctance of traditional vendors to make radical changes to their legacy codebases, and the preoccupation of newer vendors (e.g., SQL-on-Hadoop products) with implementing standard features. Additionally, the few AQP engines that are available are each tied to a specific platform and require users to completely abandon their existing databases---an unrealistic expectation given the infancy of the AQP technology. Therefore, we argue that a universal solution is needed: a database-agnostic approximation engine that will widen the reach of this emerging technology across various platforms. Our proposal, called VerdictDB, uses a middleware architecture that requires no changes to the backend database, and thus, can work with all off-the-shelf engines. Operating at the driver-level, VerdictDB intercepts analytical queries issued to the database and rewrites them into another query that, if executed by any standard relational engine, will yield sufficient information for computing an approximate answer. VerdictDB uses the returned result set to compute an approximate answer and error estimates, which are then passed on to the user or application. However, lack of access to the query execution layer introduces significant challenges in terms of generality, correctness, and efficiency. This paper shows how VerdictDB overcomes these challenges and delivers up to 171×\times speedup (18.45×\times on average) for a variety of existing engines, such as Impala, Spark SQL, and Amazon Redshift, while incurring less than 2.6% relative error. VerdictDB is open-sourced under Apache License.Comment: Extended technical report of the paper that appeared in Proceedings of the 2018 International Conference on Management of Data, pp. 1461-1476. ACM, 201

    Database Learning: Toward a Database that Becomes Smarter Every Time

    Full text link
    In today's databases, previous query answers rarely benefit answering future queries. For the first time, to the best of our knowledge, we change this paradigm in an approximate query processing (AQP) context. We make the following observation: the answer to each query reveals some degree of knowledge about the answer to another query because their answers stem from the same underlying distribution that has produced the entire dataset. Exploiting and refining this knowledge should allow us to answer queries more analytically, rather than by reading enormous amounts of raw data. Also, processing more queries should continuously enhance our knowledge of the underlying distribution, and hence lead to increasingly faster response times for future queries. We call this novel idea---learning from past query answers---Database Learning. We exploit the principle of maximum entropy to produce answers, which are in expectation guaranteed to be more accurate than existing sample-based approximations. Empowered by this idea, we build a query engine on top of Spark SQL, called Verdict. We conduct extensive experiments on real-world query traces from a large customer of a major database vendor. Our results demonstrate that Verdict supports 73.7% of these queries, speeding them up by up to 23.0x for the same accuracy level compared to existing AQP systems.Comment: This manuscript is an extended report of the work published in ACM SIGMOD conference 201

    The analysis of the effect of tax on profitability indices in listed companies of Tehran Stock Exchange

    Get PDF
    Profitability is considered as the most complicated feature for a company to be understood and evaluated. These ratios included in profitability are applied for evaluating business capabilities and making the wages in comparison with all cost during a specific period of time. In a more accurate way, the ratios indicate the profitability of a company, having calculated the total costs and tax on revenue, operational efficiency, company pricing policies, assets profitability and company’s shareholders. The approach applied in this research is descriptive-analytic. Using the data of 28 companies listed in Tehran Stock Exchange from 2004 to 2010 and using panel data approach, the tax effects over the paid profitability indices were studied in this paper. The results achieved from all estimation cases point out a negative significant effects on various profitability indices. It should be mentioned that in order to relate the taxes to the profitability indices, the costs and the debts of a corporation can be referred. Results of the study indicated that the debts ratio to asset and the type of the industry showed a negative effect on profitability and capital ratio to asset and the size of the company indicated positive significant effects on profitability index

    The analysis of the effect of tax on profitability indices in listed companies of Tehran Stock Exchange

    Get PDF
    Profitability is considered as the most complicated feature for a company to be understood and evaluated. These ratios included in profitability are applied for evaluating business capabilities and making the wages in comparison with all cost during a specific period of time. In a more accurate way, the ratios indicate the profitability of a company, having calculated the total costs and tax on revenue, operational efficiency, company pricing policies, assets profitability and company’s shareholders. The approach applied in this research is descriptive-analytic. Using the data of 28 companies listed in Tehran Stock Exchange from 2004 to 2010 and using panel data approach, the tax effects over the paid profitability indices were studied in this paper. The results achieved from all estimation cases point out a negative significant effects on various profitability indices. It should be mentioned that in order to relate the taxes to the profitability indices, the costs and the debts of a corporation can be referred. Results of the study indicated that the debts ratio to asset and the type of the industry showed a negative effect on profitability and capital ratio to asset and the size of the company indicated positive significant effects on profitability index

    REMOVAL OF Cr(VI) FROM SIMULATED ELECTROPLATING WASTEWATER BY MAGNETITE NANOPARTICLES

    Get PDF
    In this study, the efficiency of magnetic nanoparticles for removal of hexavalent chromium from simulated electroplating wastewater was evaluated. The nanoparticles were prepared using the sol-gel method and were characterized by X-ray diffraction (XRD), X-ray fluorescence (XRF), a scanning electron microscopy energy dispersive X-ray analyzer (SEM-Edx), a particle sizer and a vibrating sample magnetometer (VSM). The results showed that synthesized nanoparticles were in the size range of 40-300 rim, had purity of about 90 percent, and had magnetization of 36.5 electromagnetic unit per gram (emu/g). In conditions including pH 2, Cr (VI) concentration of 10 mg/L, nanomagnetite concentration of 1 g/L, a shaking speed of 250 rpm and a 20 minute retention time, 82% of Cr(VI) was removed. Competition from common coexisting ions such as Na(+), Ni(2+), Cu(2+), NO(3)(-), SO(4)(2-), and Cl was negligible. The adsorption data was well fitted by the Freundlich isotherm. It was concluded that magnetite nanoparticles have considerable potential for removal of Cr(VI) from electroplating wastewaters

    Efecto de la aplicación foliar de selenio y zinc para aumentar los rendimientos cuantitativos y cualitativos de colza en diferentes fechas de siembra

    Get PDF
    The sowing date is an important factor for expanding the cultivated area of rapeseed and affects seed yield, oil content, and fatty acid compounds. Micronutrient elements play an important role in improving the vegetative and reproductive growth of the plant, especially under conditions of biological and environmental stresses. A two-year experiment (2014-2016) was performed to study the response of rapeseed genotypes to foliar application of micronutrients on different sowing dates. The treatments were arranged as a factorial-split plot in a randomized complete block design with three replicates. Three sowing dates of 7 (well-timed sowing date), 17, and 27 (delayed sowing dates) October and two levels of foliar application with pure water (control), selenium (1.5%), zinc (1.5%), and selenium+zinc (1.5%) were factorial in the main plots and five genotypes of SW102, Ahmadi, GKH2624, GK-Gabriella, and Okapi were randomized in the subplots (a total of 30 treatments). Seed yield, oil yield and content, oleic acid, and linoleic acid were reduced when rapeseeds were cultivated on 17 and 27 October, while the contents in palmitic, linolenic, and erucic acids, and glucosinolate increased (p < 0.01). a selenium+zinc treatment improved seed yield, oil content and yield (p < 0.01). The oil quality increased due to increased contents of oleic and linoleic acids under the selenium+zinc treatment (p < 0.01). The GK-Gabriella and GKH2624 genotypes are recommended to be sown on well-timed (7 October) and delayed sowing dates (17 and 27 October) and treated with selenium+zinc due to the higher oil yield, linoleic and oleic acids.La fecha de siembra es un factor importante para expandir el área cultivada de colza que afecta el rendimiento de la semilla, el contenido de aceite y la composición en ácidos grasos. Los micronutrientes juegan un papel importante en la mejora del crecimiento vegetativo y reproductivo de la planta, especialmente en condiciones de estrés biológico y ambiental. Se realizó un experimento de dos años (2014-2016) para estudiar la respuesta de los genotipos de colza a la aplicación foliar de micronutrientes en diferentes fechas de siembra. Los tratamientos se organizaron como una parcela dividida factorial en un diseño de bloques completos al azar con tres repeticiones. Tres fechas de siembra del 7 (fecha de siembra en el momento oportuno), 17 y 27 (fechas de siembra retrasadas) de octubre y dos niveles de aplicación foliar con agua pura (control), selenio (1,5%), zinc (1,5%) y selenio + zinc (1.5%) fueron factoriales en las parcelas principales y cinco genotipos de SW102, Ahmadi, GKH2624, GK-Gabriella y Okapi fueron aleatorizados en las subparcelas (un total de 30 tratamientos). El rendimiento de semilla, el contenido y rendimiento de aceite, los ácidos grasos oleico y linoleico se redujeron cuando se cultivaron semillas de colza los días 17 y 27 de octubre, mientras que los contenidos de los ácidos grasos palmítico, linolénico y erúcico y glucosinolato aumentaron (p <0,01). El tratamiento con selenio + zinc mejoró el rendimiento de semillas, el contenido de aceite y el rendimiento (p <0,01). La calidad del aceite aumentó debido al mayor contenido de ácidos oleico y linoleico bajo tratamiento con selenio + zinc (p <0.01). Se recomiendan los genotipos GK-Gabriella y GKH2624 sembrados en fechas oportunas (7 de octubre) y tardía (17 y 27 de octubre) y tratados con selenio + zinc, respectivamente, debido al mayor rendimiento de aceite y contenido de los ácidos linoleico y oleico

    Optimal Operation of Micro-grids Considering the Uncertainties of Demand and Renewable Energy Resources Generation

    Full text link
    Nowadays, due to technical and economic reasons, the distributed generation (DG) units are widely connected to the low and medium voltage network and created a new structure called micro-grid. Renewable energies (especially wind and solar) based DGs are one of the most important generations units among DG units. Because of stochastic behavior of these resources, the optimum and safe management and operation of micro-grids has become one of the research priorities for researchers. So, in this study, the optimal operation of a typical micro-grid is investigated in order to maximize the penetration of renewable energy sources with the lowest operation cost with respect to the limitations for the load supply and the distributed generation resources. The understudy micro-grid consists of diesel generator, battery, wind turbines and photovoltaic panels. The objective function comprises of fuel cost, start-up cost, spinning reserve cost, power purchasing cost from the upstream grid and the sales revenue of the power to the upstream grid. In this paper, the uncertainties of demand, wind speed and solar radiation are considered and the optimization will be made by using the GAMS software and mixed integer planning method (MIP).Article History: Received May 21, 2016; Received in revised form July 11, 2016; Accepted October 15, 2016; Available onlineHow to Cite This Article: Jasemi, M., Adabi, F., Mozafari, B., and Salahi, S. (2016) Optimal Operation of Micro-grids Considering the Uncertainties of Demand and Renewable Energy Resources Generation, Int. Journal of Renewable Energy Development, 5(3),233-248.http://dx.doi.org/10.14710/ijred.5.3.233-24

    Performance prediction for set similarity joins

    Full text link
    corecore