591 research outputs found
The Data Lakehouse: Data Warehousing and More
Relational Database Management Systems designed for Online Analytical
Processing (RDBMS-OLAP) have been foundational to democratizing data and
enabling analytical use cases such as business intelligence and reporting for
many years. However, RDBMS-OLAP systems present some well-known challenges.
They are primarily optimized only for relational workloads, lead to
proliferation of data copies which can become unmanageable, and since the data
is stored in proprietary formats, it can lead to vendor lock-in, restricting
access to engines, tools, and capabilities beyond what the vendor offers. As
the demand for data-driven decision making surges, the need for a more robust
data architecture to address these challenges becomes ever more critical. Cloud
data lakes have addressed some of the shortcomings of RDBMS-OLAP systems, but
they present their own set of challenges. More recently, organizations have
often followed a two-tier architectural approach to take advantage of both
these platforms, leveraging both cloud data lakes and RDBMS-OLAP systems.
However, this approach brings additional challenges, complexities, and
overhead. This paper discusses how a data lakehouse, a new architectural
approach, achieves the same benefits of an RDBMS-OLAP and cloud data lake
combined, while also providing additional advantages. We take today's data
warehousing and break it down into implementation independent components,
capabilities, and practices. We then take these aspects and show how a
lakehouse architecture satisfies them. Then, we go a step further and discuss
what additional capabilities and benefits a lakehouse architecture provides
over an RDBMS-OLAP
Recommended from our members
A systematic approach for the accurate non-invasive estimation of blood glucose utilizing a novel light-tissue interaction adaptive modelling scheme
Diabetes is one of the biggest health challenges of the 21st century. The obesity epidemic, sedentary lifestyles and an ageing population mean prevalence of the condition is currently doubling every generation. Diabetes is associated with serious chronic ill health, disability and premature mortality. Long-term complications including heart disease, stroke, blindness, kidney disease and amputations, make the greatest contribution to the costs of diabetes care. Many of these long-term effects could be avoided with earlier, more effective monitoring and treatment. Currently, blood glucose can only be monitored through the use of invasive techniques. To date there is no widely accepted and readily available non-invasive monitoring technique to measure blood glucose despite the many attempts. This paper challenges one of the most difficult non-invasive monitoring techniques, that of blood glucose, and proposes a new novel approach that will enable the accurate, and calibration free estimation of glucose concentration in blood. This approach is based on spectroscopic techniques and a new adaptive modelling scheme. The theoretical implementation and the effectiveness of the adaptive modelling scheme for this application has been described and a detailed mathematical evaluation has been employed to prove that such a scheme has the capability of extracting accurately the concentration of glucose from a complex biological media
The trade-off between taxi time and fuel consumption in airport ground movement
Environmental impact is a very important agenda item in many sectors nowadays, which the air transportation sector is also trying to reduce
as much as possible. One area which has remained relatively unexplored in this context is the ground movement problem for aircraft on the airportâs surface.
Aircraft have to be routed from a gate to a runway and vice versa and it is
still unknown whether fuel burn and environmental impact reductions will best result from purely minimising the taxi times or whether it is also important to avoid multiple acceleration phases. This paper presents a newly developed multi-objective approach for analysing the trade-off between taxi time and fuel consumption during taxiing. The approach consists of a combination of a graph-based routing algorithm and a population adaptive immune algorithm to discover different speed profiles of aircraft. Analysis with data from a European hub airport has highlighted the impressive performance of the new approach. Furthermore, it is shown that the trade-off between taxi time and fuel consumption is very sensitive to the fuel-related objective function which is used
Strong Ultraviolet Pulse From a Newborn Type Ia Supernova
Type Ia supernovae are destructive explosions of carbon oxygen white dwarfs.
Although they are used empirically to measure cosmological distances, the
nature of their progenitors remains mysterious, One of the leading progenitor
models, called the single degenerate channel, hypothesizes that a white dwarf
accretes matter from a companion star and the resulting increase in its central
pressure and temperature ignites thermonuclear explosion. Here we report
observations of strong but declining ultraviolet emission from a Type Ia
supernova within four days of its explosion. This emission is consistent with
theoretical expectations of collision between material ejected by the supernova
and a companion star, and therefore provides evidence that some Type Ia
supernovae arise from the single degenerate channel.Comment: Accepted for publication on the 21 May 2015 issue of Natur
Measuring the effectiveness of in-hospital and on-base Prevent Alcohol and Risk-related Trauma in Youth (P.A.R.T.Y.) programs on reducing alcohol related harms in naval trainees: P.A.R.T.Y. Defence study protocol
Abstract Background Reducing alcohol related harms in Australian Defence Force (ADF) trainees has been identified as a priority, but there are few evidence-based prevention programs available for the military setting. The study aims to test whether the P.A.R.T.Y. program delivered in-hospital or on-base, can reduce harmful alcohol consumption among ADF trainees. Methods/design The study is a 3-arm randomized controlled trial, involving 953 Royal Australian Navy trainees from a single base. Trainees, aged 18 to 30Â years, will be randomly assigned to the study arms: i. in-hospital P.A.R.T.Y.; ii. On-base P.A.R.T.Y.; and iii. Control group. All groups will receive the routine ADF annual alcohol awareness training. The primary outcome is the proportion of participants reporting an Alcohol Use Disorders Identification Test (AUDIT) score of 8 or above at 12Â monthsâ post-intervention. The secondary outcome is the number of alcohol related incidents reported to the Royal Australian Navy (RAN) in the 12Â monthsâ post-intervention. Discussion This is the first trial of the use of the P.A.R.T.Y. program in the military. If the proposed intervention proves efficacious, it may be a useful program in the early education of RAN trainees. Trial registration Australian New Zealand Clinical Trials Registry (ANZCTR): ACTRN12614001332617 , date of registration: 18/12/2014 âretrospectively registeredâ
Identification of potential therapeutic targets in prostate cancer through a cross-species approach.
Genetically engineered mouse models of cancer can be used to filter genome-wide expression datasets generated from human tumours and to identify gene expression alterations that are functionally important to cancer development and progression. In this study, we have generated RNAseq data from tumours arising in two established mouse models of prostate cancer, PB-Cre/PtenloxP/loxP and p53loxP/loxPRbloxP/loxP, and integrated this with published human prostate cancer expression data to pinpoint cancer-associated gene expression changes that are conserved between the two species. To identify potential therapeutic targets, we then filtered this information for genes that are either known or predicted to be druggable. Using this approach, we revealed a functional role for the kinase MELK as a driver and potential therapeutic target in prostate cancer. We found that MELK expression was required for cell survival, affected the expression of genes associated with prostate cancer progression and was associated with biochemical recurrence
Replication of an empirical approach to delineate the heterogeneity of chronic unexplained fatigue
<p>Abstract</p> <p>Background</p> <p>Chronic fatigue syndrome (CFS) is defined by self-reported symptoms. There are no diagnostic signs or laboratory markers, and the pathophysiology remains inchoate. In part, difficulties identifying and replicating biomarkers and elucidating the pathophysiology reflect the heterogeneous nature of the syndromic illness CFS. We conducted this analysis of people from defined metropolitan, urban, and rural populations to replicate our earlier empirical delineation of medically unexplained chronic fatigue and CFS into discrete endophenotypes. Both the earlier and current analyses utilized quantitative measures of functional impairment and symptoms as well as laboratory data. This study and the earlier one enrolled participants from defined populations and measured the internal milieu, which differentiates them from studies of clinic referrals that examine only clinical phenotypes.</p> <p>Methods</p> <p>This analysis evaluated 386 women identified in a population-based survey of chronic fatigue and unwellness in metropolitan, urban, and rural populations of the state of Georgia, USA. We used variables previously demonstrated to effectively delineate endophenotypes in an attempt to replicate identification of these endophenotypes. Latent class analyses were used to derive the classes, and these were compared and contrasted to those described in the previous study based in Wichita, Kansas.</p> <p>Results</p> <p>We identified five classes in the best fit analysis. Participants in Class 1 (25%) were polysymptomatic, with sleep problems and depressed mood. Class 2 (24%) was also polysymptomatic, with insomnia and depression, but participants were also obese with associated metabolic strain. Class 3 (20%) had more selective symptoms but was equally obese with metabolic strain. Class 4 (20%) and Class 5 (11%) consisted of nonfatigued, less symptomatic individuals, Class 4 being older and Class 5 younger. The classes were generally validated by independent variables. People with CFS fell equally into Classes 1 and 2. Similarities to the Wichita findings included the same four main defining variables of obesity, sleep problems, depression, and the multiplicity of symptoms. Four out of five classes were similar across both studies.</p> <p>Conclusion</p> <p>These data support the hypothesis that chronic medically unexplained fatigue is heterogeneous and can be delineated into discrete endophenotypes that can be replicated. The data do not support the current perception that CFS represents a unique homogeneous disease and suggests broader criteria may be more explanatory. This replication suggests that delineation of endophenotypes of CFS and associated ill health may be necessary in order to better understand etiology and provide more patient-focused treatments.</p
Evaluation of Treatment-Related Mortality Among Paediatric Cancer Deaths: a population based analysis.
BACKGROUND: Objectives were to describe the proportion of deaths due to treatment-related mortality (TRM) and to identify risk factors and probable causes of TRM among paediatric cancer deaths in a population-based cohort.
METHODS: We included children with cancer ⩜18 years diagnosed and treated in Ontario who died between January 2003 and December 2012. Deaths were identified using a provincial registry, the Pediatric Oncology Group of Ontario Networked Information System. Probable causes of TRM were described.
RESULTS: Among the 964 deaths identified, 821 were included. The median age at diagnosis was 6.6 years (range 0-18.8) and 51.8% had at least one relapse. Of the deaths examined, TRM occurred in 217/821 (26.4%) while 604/821 (73.6%) were due to progressive cancer. Deaths from TRM did not change over time. Using multiple regression, younger age, leukaemia diagnosis and absence of relapse were independently positively associated with TRM. The most common probable causes of TRM were respiratory, infection and haemorrhage.
CONCLUSIONS: TRM was responsible for 26.4% of deaths in paediatric cancer. Underlying diagnosis, younger age and absence of relapse were associated with TRM and causes of TRM differed by diagnosis group. Future work should evaluate TRM rate and risk factors among newly diagnosed cancer patients
Using a New Odour-Baited Device to Explore Options for Luring and Killing Outdoor-Biting Malaria Vectors: A Report on Design and Field Evaluation of the Mosquito Landing Box.
Mosquitoes that bite people outdoors can sustain malaria transmission even where effective indoor interventions such as bednets or indoor residual spraying are already widely used. Outdoor tools may therefore complement current indoor measures and improve control. We developed and evaluated a prototype mosquito control device, the 'Mosquito Landing Box' (MLB), which is baited with human odours and treated with mosquitocidal agents. The findings are used to explore technical options and challenges relevant to luring and killing outdoor-biting malaria vectors in endemic settings. Field experiments were conducted in Tanzania to assess if wild host-seeking mosquitoes 1) visited the MLBs, 2) stayed long or left shortly after arrival at the device, 3) visited the devices at times when humans were also outdoors, and 4) could be killed by contaminants applied on the devices. Odours suctioned from volunteer-occupied tents were also evaluated as a potential low-cost bait, by comparing baited and unbaited MLBs. There were significantly more Anopheles arabiensis, An. funestus, Culex and Mansonia mosquitoes visiting baited MLB than unbaited controls (P<=0.028). Increasing sampling frequency from every 120 min to 60 and 30 min led to an increase in vector catches of up to 3.6 fold (P<=0.002), indicating that many mosquitoes visited the device but left shortly afterwards. Outdoor host-seeking activity of malaria vectors peaked between 7:30 and 10:30pm, and between 4:30 and 6:00am, matching durations when locals were also outdoors. Maximum mortality of mosquitoes visiting MLBs sprayed or painted with formulations of candidate mosquitocidal agent (pirimiphos-methyl) was 51%. Odours from volunteer occupied tents attracted significantly more mosquitoes to MLBs than controls (P<0.001). While odour-baited devices such as the MLBs clearly have potential against outdoor-biting mosquitoes in communities where LLINs are used, candidate contaminants must be those that are effective at ultra-low doses even after short contact periods, since important vector species such as An. arabiensis make only brief visits to such devices. Natural human odours suctioned from occupied dwellings could constitute affordable sources of attractants to supplement odour baits for the devices. The killing agents used should be environmentally safe, long lasting, and have different modes of action (other than pyrethroids as used on LLINs), to curb the risk of physiological insecticide resistance
Susceptibility to Vibrio cholerae Infection in a Cohort of Household Contacts of Patients with Cholera in Bangladesh
Vibrio cholerae is the bacterium that causes cholera, a severe form of diarrhea that leads to rapid and potentially fatal dehydration when the infection is not treated promptly. Cholera remains an important cause of diarrhea globally, and V. cholerae continues to cause major epidemics in the most vulnerable populations. Although there have been recent discoveries about how the bacterium adapts to the human intestine and causes diarrhea, there is little understanding of why some people are protected from infection with V. cholerae. This article describes several factors that are associated with the risk of developing V. cholerae infection among people living in the same household with a patient with severe cholera who are at high risk of contracting the infection. One of the findings is that IgA antibodies, a type of antibody associated with immunity at mucosal surfaces such as the intestine, that target several components of the bacteria are associated with immunity to V. cholerae infection. This article also describes genetic and nutritional factors that additionally influence susceptibility to V. cholerae infection
- âŠ