786 research outputs found
Database Learning: Toward a Database that Becomes Smarter Every Time
In today's databases, previous query answers rarely benefit answering future
queries. For the first time, to the best of our knowledge, we change this
paradigm in an approximate query processing (AQP) context. We make the
following observation: the answer to each query reveals some degree of
knowledge about the answer to another query because their answers stem from the
same underlying distribution that has produced the entire dataset. Exploiting
and refining this knowledge should allow us to answer queries more
analytically, rather than by reading enormous amounts of raw data. Also,
processing more queries should continuously enhance our knowledge of the
underlying distribution, and hence lead to increasingly faster response times
for future queries.
We call this novel idea---learning from past query answers---Database
Learning. We exploit the principle of maximum entropy to produce answers, which
are in expectation guaranteed to be more accurate than existing sample-based
approximations. Empowered by this idea, we build a query engine on top of Spark
SQL, called Verdict. We conduct extensive experiments on real-world query
traces from a large customer of a major database vendor. Our results
demonstrate that Verdict supports 73.7% of these queries, speeding them up by
up to 23.0x for the same accuracy level compared to existing AQP systems.Comment: This manuscript is an extended report of the work published in ACM
SIGMOD conference 201
Unprecedented Alexandrium blooms in a previously low biotoxin risk area of Tasmania, Australia.
During October 2012, a shipment of blue mussels (Mytilus galloprovincialis) from the poorly monitored east coast of Tasmania, Australia, was tested by Japanese import authorities and found to be contaminated with unacceptable levels of Paralytic Shellfish Toxins (PSTs; 10 mg/kg). Subsequently local oysters, scallops, clams, the viscera of abalone and rock lobsters were also found to be contaminated. This led to a global product recall and loss to the local economy of AUD 23M. Following low toxicity during 2013 and 2014 and implementation of minimal shellfish farm closures, a more severe bloom event occurred during July-November 2015 and again June-September 2016 (up to 300,000 Alexandrium cells/L; 24 mg/kg PST in mussels, 6 mg/kg in Crassostrea gigas oysters), also causing 4 human illnesses resulting in hospitalization after consumption of wild shellfish. While Alexandrium tamarense had been detected in low concentrations in southeastern Australia since 1987, all cultured strains belonged to the mostly non-toxic group 5 (now designated A. australiense; detected since 1987) and weakly toxic group 4 (A. pacificum; detected in 1997). In contrast, the 2012 to 2016 outbreaks were dominated by highly toxic group 1 (A. fundyense) never detected previously in the Australian region. Molecular analyses suggest that A. fundyense may have been a cryptic ribotype previously present in Tasmania, but newly stimulated by altered water column stratification conditions driven by changing rainfall and temperature patterns. Increased seafood and plankton monitoring of the area now include the implementation of Alexandrium qPCR, routine Neogen™ immunological and HPLC PST tests, but ultimately may also drive change in harvesting strategies and aquaculture species selection by the local seafood industry
VerdictDB: Universalizing Approximate Query Processing
Despite 25 years of research in academia, approximate query processing (AQP)
has had little industrial adoption. One of the major causes of this slow
adoption is the reluctance of traditional vendors to make radical changes to
their legacy codebases, and the preoccupation of newer vendors (e.g.,
SQL-on-Hadoop products) with implementing standard features. Additionally, the
few AQP engines that are available are each tied to a specific platform and
require users to completely abandon their existing databases---an unrealistic
expectation given the infancy of the AQP technology. Therefore, we argue that a
universal solution is needed: a database-agnostic approximation engine that
will widen the reach of this emerging technology across various platforms.
Our proposal, called VerdictDB, uses a middleware architecture that requires
no changes to the backend database, and thus, can work with all off-the-shelf
engines. Operating at the driver-level, VerdictDB intercepts analytical queries
issued to the database and rewrites them into another query that, if executed
by any standard relational engine, will yield sufficient information for
computing an approximate answer. VerdictDB uses the returned result set to
compute an approximate answer and error estimates, which are then passed on to
the user or application. However, lack of access to the query execution layer
introduces significant challenges in terms of generality, correctness, and
efficiency. This paper shows how VerdictDB overcomes these challenges and
delivers up to 171 speedup (18.45 on average) for a variety of
existing engines, such as Impala, Spark SQL, and Amazon Redshift, while
incurring less than 2.6% relative error. VerdictDB is open-sourced under Apache
License.Comment: Extended technical report of the paper that appeared in Proceedings
of the 2018 International Conference on Management of Data, pp. 1461-1476.
ACM, 201
Developing core sets for persons following amputation based on the International Classification of Functioning, Disability and Health as a way to specify functioning
Amputation is a common late stage sequel of peripheral vascular disease and diabetes or a sequel of accidental trauma, civil unrest and landmines. The functional impairments affect many facets of life including but not limited to: Mobility; activities of daily living; body image and sexuality. Classification, measurement and comparison of the consequences of amputations has been impeded by the limited availability of internationally, multiculturally standardized instruments in the amputee setting. The introduction of the International Classification of Functioning, Disability and Health (ICF) by the World Health Assembly in May 2001 provides a globally accepted framework and classification system to describe, assess and compare function and disability. In order to facilitate the use of the ICF in everyday clinical practice and research, ICF core sets have been developed that focus on specific aspects of function typically associated with a particular disability. The objective of this paper is to outline the development process for the ICF core sets for persons following amputation. The ICF core sets are designed to translate the benefits of the ICF into clinical routine. The ICF core sets will be defined at a Consensus conference which will integrate evidence from preparatory studies, namely: (a) a systematic literature review regarding the outcome measures of clinical trails and observational studies, (b) semi-structured patient interviews, (c) international experts participating in an internet-based survey, and (d) cross-sectional, multi-center studies for clinical applicability. To validate the ICF core sets field-testing will follow. Invitation for participation: The development of ICF Core Sets is an inclusive and open process. Anyone who wishes to actively participate in this process is invited to do so
Homozygosity for a missense mutation in the 67 kDa isoform of glutamate decarboxylase in a family with autosomal recessive spastic cerebral palsy: parallels with Stiff-Person Syndrome and other movement disorders
Background
Cerebral palsy (CP) is an heterogeneous group of neurological disorders of movement and/or posture, with an estimated incidence of 1 in 1000 live births. Non-progressive forms of symmetrical, spastic CP have been identified, which show a Mendelian autosomal recessive pattern of inheritance. We recently described the mapping of a recessive spastic CP locus to a 5 cM chromosomal region located at 2q24-31.1, in rare consanguineous families.
Methods
Here we present data that refine this locus to a 0.5 cM region, flanked by the microsatellite markers D2S2345 and D2S326. The minimal region contains the candidate gene GAD1, which encodes a glutamate decarboxylase isoform (GAD67), involved in conversion of the amino acid and excitatory neurotransmitter glutamate to the inhibitory neurotransmitter γ-aminobutyric acid (GABA).
Results
A novel amino acid mis-sense mutation in GAD67 was detected, which segregated with CP in affected individuals.
Conclusions
This result is interesting because auto-antibodies to GAD67 and the more widely studied GAD65 homologue encoded by the GAD2 gene, are described in patients with Stiff-Person Syndrome (SPS), epilepsy, cerebellar ataxia and Batten disease. Further investigation seems merited of the possibility that variation in the GAD1 sequence, potentially affecting glutamate/GABA ratios, may underlie this form of spastic CP, given the presence of anti-GAD antibodies in SPS and the recognised excitotoxicity of glutamate in various contexts
Recommended from our members
High Temperature Electrolysis for Hydrogen Production from Nuclear Energy – TechnologySummary
The Department of Energy, Office of Nuclear Energy, has requested that a Hydrogen Technology Down-Selection be performed to identify the hydrogen production technology that has the best potential for timely commercial demonstration and for ultimate deployment with the Next Generation Nuclear Plant (NGNP). An Independent Review Team has been assembled to execute the down-selection. This report has been prepared to provide the members of the Independent Review Team with detailed background information on the High Temperature Electrolysis (HTE) process, hardware, and state of the art. The Idaho National Laboratory has been serving as the lead lab for HTE research and development under the Nuclear Hydrogen Initiative. The INL HTE program has included small-scale experiments, detailed computational modeling, system modeling, and technology demonstration. Aspects of all of these activities are included in this report. In terms of technology demonstration, the INL successfully completed a 1000-hour test of the HTE Integrated Laboratory Scale (ILS) technology demonstration experiment during the fall of 2008. The HTE ILS achieved a hydrogen production rate in excess of 5.7 Nm3/hr, with a power consumption of 18 kW. This hydrogen production rate is far larger than has been demonstrated by any of the thermochemical or hybrid processes to date
BCL10 is rarely mutated in human prostate carcinoma, small-cell lung cancer, head and neck tumours, renal carcinoma and sarcomas
We have used single-strand conformation polymorphism (SSCP) analysis to screen for mutations in the BCL 10 gene in 81 primary prostate carcinomas, 20 squamous cell cancers of the head and neck, 15 small-cell lung cancer cell lines, 24 renal carcinoma cell lines and 13 sarcoma cell lines. We failed to find evidence of somatically acquired mutations of the BCL10 gene suggesting that BCL 10 does not play a major role in the development of these malignancies
Robust Revascularization in Models of Limb Ischemia Using a Clinically Translatable Human Stem Cell-Derived Endothelial Cell Product
Pluripotent stem cell-derived differentiated endothelial cells offer high potential in regenerative medicine in the cardiovascular system. With the aim of translating the use of a human stem cell-derived endothelial cell product (hESC-ECP) for treatment of critical limb ischemia (CLI) in man, we report a good manufacturing practice (GMP)-compatible protocol and detailed cell tracking and efficacy data in multiple preclinical models. The clinical-grade cell line RC11 was used to generate hESC-ECP, which was identified as mostly endothelial (60% CD31+/CD144+), with the remainder of the subset expressing various pericyte/mesenchymal stem cell markers. Cell tracking using MRI, PET, and qPCR in a murine model of limb ischemia demonstrated that hESC-ECP was detectable up to day 7 following injection. Efficacy in several murine models of limb ischemia (immunocompromised/immunocompetent mice and mice with either type I/II diabetes mellitus) demonstrated significantly increased blood perfusion and capillary density. Overall, we demonstrate a GMP-compatible hESC-ECP that improved ischemic limb perfusion and increased local angiogenesis without engraftment, paving the way for translation of this therapy
Critical reflections on the benefits of ICT in education
In both schools and homes, information and communication technologies (ICT) are widely seen as enhancing learning, this hope fuelling their rapid diffusion and adoption throughout developed societies. But they are not yet so embedded in the social practices of everyday life as to be taken for granted, with schools proving slower to change their lesson plans than they were to fit computers in the classroom. This article examines two possible explanations - first, that convincing evidence of improved learning outcomes remains surprisingly elusive, and second, the unresolved debate over whether ICT should be conceived of as supporting delivery of a traditional or a radically different vision of pedagogy based on soft skills and new digital literacies. The difficulty in establishing traditional benefits, and the uncertainty over pursuing alternative benefits, raises fundamental questions over whether society really desires a transformed, technologically-mediated relation between teacher and learner
- …