333 research outputs found

    MonetDB/DataCell: Online Analytics in a Streaming Column-Store

    Get PDF
    In DataCell, we design streaming functionalities in a mod- ern relational database kernel which targets big data analyt- ics. This includes exploitation of both its storage/execution engine and its optimizer infrastructure. We investigate the opportunities and challenges that arise with such a direction and we show that it carries significant advantages for mod- ern applications in need for online analytics such as web logs, network monitoring and scientific data management. The major challenge then becomes the efficient support for specialized stream features, e.g., multi-query processing and incremental window-based processing as well as exploiting standard DBMS functionalities in a streaming environment such as indexing. In this demo, we present the DataCell system, an exten- sion of the MonetDB open-source column-store for online an- alytics. The demo gives the user the opportunity to experi- ence the features of DataCell such as processing both stream and persistent data and performing window based process- ing. The demo provides a visual interface to monitor the critical system components, e.g., how query plans transform from typical DBMS query plans to online query plans, how data flows through the query plans as the streams evolve, how DataCell maintains intermediate results in columnar form to avoid repeated evaluation of the same stream por- tions, etc. The demo also provides the ability to interac- tively set the test scenarios regarding input data and various DataCell knobs

    Enhanced Stream Processing in a DBMS Kernel

    Get PDF
    Continuous query processing has emerged as a promising query processing paradigm with numerous applications. A recent development is the need to handle both streaming queries and typical one-time queries in the same application. For example, data warehousing can greatly benefit from the integration of stream semantics, i.e., online analysis of incoming data and combination with existing data. This is especially useful to provide low latency in data-intensive analysis in big data warehouses that are augmented with new data on a daily basis. However, state-of-the-art database technology cannot handle streams efficiently due to their "continuous" nature. At the same time, state-of-the-art stream technology is purely focused on stream applications. The research efforts are mostly geared towards the creation of specialized stream management systems built with a different philosophy than a DBMS. The drawback of this approach is the limited opportunities to exploit successful past data processing technology, e.g., query optimization techniques. For this new problem we need to combine the best of both worlds. Here we take a completely different route by designing a stream engine on top of an existing relational database kernel. This includes reuse of both its storage/execution engine and its optimizer infrastructure. The major challenge then becomes the efficient support for specialized stream features. This paper focuses on incremental window-based processing, arguably the most crucial stream-specific requirement. In order to maintain and reuse the generic storage and execution model of the DBMS, we elevate the problem at the query plan level. Proper op

    Column-store support for RDF data management: not all swans are white

    Get PDF
    This paper reports on the results of an independent evaluation of the techniques presented in the VLDB 2007 paper "Scalable Semantic Web Data Management Using Vertical Partitioning", authored by D. Abadi, A. Marcus, S. R. Madden, and K. Hollenbach. We revisit the proposed benchmark and examine both the data and query space coverage. The benchmark is extended to cover a larger portion of the query space in a canonical way. Repeatability of the experiments is assessed using the code base obtained from the authors. Inspired by the proposed vertically-partitioned storage solution for RDF data and the performance figures using a column-store, we conduct a complementary analy- sis of state-of-the-art RDF storage solutions. To this end, we employ MonetDB/SQL, a fully-functional open source column-store, and a well-known --- for its performance --- commercial row-store DBMS.We implement two relational RDF storage solutions – triple-store and vertically-partitioned --- in both systems. This allows us to expand the scope of with the performance characterization along both dimensions --- triple-store vs. vertically-partitioned and row-store vs. column-store --- individually, before analyzing their combined effects. A detailed report of the experimental test-bed, as well as an in-depth analysis of the parameters involved, clarify the scope of the solution originally presented and position the results in a broader context by covering more systems

    Замена электродвигателя ПЭН турбоприводом на Кемеровской ТЭЦ

    Get PDF
    В данной работе рассматривается возможность замены электродвигателя ПЭН турбоприводом на Кемеровской ТЭЦ, с установкой турбопривода на существующий фундамент. Целью работы является оценка возможности увеличения отпуска электроэнергии от станции в результате уменьшения затрат на собственные нужды и повышение маневренности ТЭЦ.In this paper we consider the possibility of replacing the turbine drive motor PEN to Kemerovo CHP , with the installation of turbine drive on the existing foundation. The aim is to assess the possibility of increasing the supply of electric power from the plant by reducing the costs of their own needs and improving maneuverability CHP

    Подбор механического метода борьбы с асфальтово-парафиновыми отложениями

    Get PDF
    В дипломной работе рассмотрены способы образования отложений парафина, механизмы образования АСПО, современные способы борьбы с отложениями, также проведен патентный поиск. Приведенный в дипломной работе расчет показывает, что для данной скважины применение скребков – центраторов целесообразно, условие прочности выполняется. Результатом проведенной дипломной работы является расчет на прочность штанг ШСНУ. В работе рассмотрено, как параметры скважины влияют на выпадение парафина из нефти.In the thesis work the methods of paraffin deposition, mechanisms of AFS, modern ways of dealing with deposits, also held a patent search. Powered by a research paper calculation shows that for this well use scrapers - centralizers appropriate strength condition is satisfied. The result of conducted research paper is to calculate the strength SHSNU rods. The paper considers how well the parameters affect the loss of paraffin oil

    Phase II study of continuous daily sunitinib dosing in patients with previously treated advanced non-small cell lung cancer

    Get PDF
    Background:Sunitinib malate (SUTENT) has promising single-agent activity given on Schedule 4/2 (4 weeks on treatment followed by 2 weeks off treatment) in advanced non-small cell lung cancer (NSCLC).Methods:We examined the activity of sunitinib on a continuous daily dosing (CDD) schedule in an open-label, multicentre phase II study in patients with previously treated, advanced NSCLC. Patients ⩾18 years with stage IIIB/IV NSCLC after failure with platinum-based chemotherapy, received sunitinib 37.5 mg per day. The primary end point was objective response rate (ORR). Secondary end points included progression-free survival (PFS), overall survival (OS), 1-year survival rate, and safety.Results:Of 47 patients receiving sunitinib, one patient achieved a confirmed partial response (ORR 2.1% (95% confidence interval (CI) 0.1, 11.3)) and 11 (23.4%) had stable disease (SD) ⩾8 weeks. Five patients had SD>6 months. Median PFS was 11.9 weeks (95% CI 8.6, 14.1) and median OS was 37.1 weeks (95% CI 31.1, 69.7). The 1-year survival probability was 38.4% (95% CI 24.2, 52.5). Treatment was generally well tolerated.Conclusions:The safety profile and time-to-event analyses, albeit relatively low response rate of 2%, suggest single-agent sunitinib on a CDD schedule may be a potential therapeutic agent for patients with advanced, refractory NSCLC

    A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics (Extended Version)

    Full text link
    There has been significant amount of excitement and recent work on GPU-based database systems. Previous work has claimed that these systems can perform orders of magnitude better than CPU-based database systems on analytical workloads such as those found in decision support and business intelligence applications. A hardware expert would view these claims with suspicion. Given the general notion that database operators are memory-bandwidth bound, one would expect the maximum gain to be roughly equal to the ratio of the memory bandwidth of GPU to that of CPU. In this paper, we adopt a model-based approach to understand when and why the performance gains of running queries on GPUs vs on CPUs vary from the bandwidth ratio (which is roughly 16x on modern hardware). We propose Crystal, a library of parallel routines that can be combined together to run full SQL queries on a GPU with minimal materialization overhead. We implement individual query operators to show that while the speedups for selection, projection, and sorts are near the bandwidth ratio, joins achieve less speedup due to differences in hardware capabilities. Interestingly, we show on a popular analytical workload that full query performance gain from running on GPU exceeds the bandwidth ratio despite individual operators having speedup less than bandwidth ratio, as a result of limitations of vectorizing chained operators on CPUs, resulting in a 25x speedup for GPUs over CPUs on the benchmark

    Phase 1/2 Dose Escalating Study of Twice-Monthly Pemetrexed and Gemcitabine in Patients with Advanced Cancer and Non-small Cell Lung Cancer

    Get PDF
    IntroductionPemetrexed is synergistic with gemcitabine in preclinical models of non-small cell lung cancer (NSCLC). The optimal dose and utility of gemcitabine and pemetrexed was evaluated in a dose-escalating study.MethodsThe phase 1 study included patients with advanced tumors, whereas the phase 2 study included patients with locally advanced or metastatic NSCLC. Gemcitabine was infused over 30 minutes, followed by pemetrexed administered over 10 minutes on day 1 of a 14-day cycle. Treatment continued for 12 cycles or until disease progression. All patients received folic acid, Vitamin B12, and steroid prophylaxis.ResultsMaximum tolerated dose was gemcitabine 1500 mg/m2, followed by pemetrexed 500 mg/m2. Fifty-three patients (29 male, 24 female) were enrolled in the phase 2 study. Response rate was 20.8% (95% CI: 0.108–0.341), and the clinical benefit rate (CR + PR + SD) was 64.2%. Median time to disease progression was 4.6 months (95% CI: 2.79–6.18), median survival was 10.1 month (95% CI: 5.95–14.09, censorship = 20.75%), and 1-year survival was 41.0%. Common grade 3 or 4 adverse events (% of patients) were neutropenia (28.3%), fatigue (22.6%), and febrile neutropenia (9.4%).ConclusionsTwice-monthly gemcitabine and pemetrexed was well tolerated, with overall survival and clinical benefit indicating disease activity in NSCLC patients
    corecore