Search CORE

1,294 research outputs found

Sampling-Based Query Re-Optimization

Author: Bruno N.
Graefe G.
Ioannidis Y. E.
Poosala V.
Reddy N.
Stillger M.
Publication venue
Publication date: 21/01/2016
Field of study

Despite of decades of work, query optimizers still make mistakes on "difficult" queries because of bad cardinality estimates, often due to the interaction of multiple predicates and correlations in the data. In this paper, we propose a low-cost post-processing step that can take a plan produced by the optimizer, detect when it is likely to have made such a mistake, and take steps to fix it. Specifically, our solution is a sampling-based iterative procedure that requires almost no changes to the original query optimizer or query evaluation mechanism of the system. We show that this indeed imposes low overhead and catches cases where three widely used optimizers (PostgreSQL and two commercial systems) make large errors.Comment: This is the extended version of a paper with the same title and authors that appears in the Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2016

arXiv.org e-Print Archive

Crossref

Measuring co-authorship and networking-adjusted scientific impact

Author: AFJ Van Raan
AFJ Van Raan
AG Mainous 3rd
AL Kinney
CD Kelly
D Rennie
DJ Shulkin
E Garfield
E Tarnow
Etienne Joly
G Mowatt
GB Parker
HA Abt
JE Hirsch
JE Hirsch
John P. A. Ioannidis
JP Drenth
JP Ioannidis
JP Ioannidis
JT King Jr
KS Khan
L Egghe
M Enserink
M Wendl
MD Fetters
MD Fetters
N Bakkalbasi
PC Gøtzsche
PD Batista
RD Powers
RJ Epstein
RM Slone
RN Kostoff
S Papatheodorou
V Ilakovac
V Yank
WB Weeks
Y Hama
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2008
Field of study

Appraisal of the scientific impact of researchers, teams and institutions with productivity and citation metrics has major repercussions. Funding and promotion of individuals and survival of teams and institutions depend on publications and citations. In this competitive environment, the number of authors per paper is increasing and apparently some co-authors don't satisfy authorship criteria. Listing of individual contributions is still sporadic and also open to manipulation. Metrics are needed to measure the networking intensity for a single scientist or group of scientists accounting for patterns of co-authorship. Here, I define I1 for a single scientist as the number of authors who appear in at least I1 papers of the specific scientist. For a group of scientists or institution, In is defined as the number of authors who appear in at least In papers that bear the affiliation of the group or institution. I1 depends on the number of papers authored Np. The power exponent R of the relationship between I1 and Np categorizes scientists as solitary (R>2.5), nuclear (R=2.25-2.5), networked (R=2-2.25), extensively networked (R=1.75-2) or collaborators (R<1.75). R may be used to adjust for co-authorship networking the citation impact of a scientist. In similarly provides a simple measure of the effective networking size to adjust the citation impact of groups or institutions. Empirical data are provided for single scientists and institutions for the proposed metrics. Cautious adoption of adjustments for co-authorship and networking in scientific appraisals may offer incentives for more accountable co-authorship behaviour in published articles.Comment: 25 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Bounded Conjunctive Queries

Author: Abiteboul S.
Acharya S.
Armbrust M.
Armbrust M.
Ausiello G.
Babcock B.
Bárány V.
Ester M.
Fan W.
Garofalakis M. N.
Gottlob G.
Ioannidis Y. E.
Jagadish H. V.
Kanellakis P. C.
Kaufman L.
Rösch P.
Publication venue: 'VLDB Endowment'
Publication date: 01/08/2014
Field of study

Crossref

Edinburgh Research Explorer

Toward computational fact-checking

Author: Bender M. A.
Börzsönyi S.
Cohen S.
Fischer J.
Ganguly S.
Gray J.
Hulgeri A.
Ioannidis Y. E.
Wu Y.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Recommended from our members

Ontology-Based Integration of Streaming and Static Relational Data with Optique

Author: Brandt S.
Horrocks I.
Ioannidis Y.
Jimenez-Ruiz E.
Kharlamov E.
Kotidis Y.
Lamparter S.
Mailis T.
Moeller R.
Neuenstadt C.
Oezcep O.
Pinkel C.
Svingos C.
Zheleznyakov D.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

An important application of semantic technologies in industry has been the formalisation of information models usingOWL 2 ontologies and the use of RDF for storing and exchanging application data. Moreover, legacy data can be virtualised asRDF using ontologies following the ontology-based data access (OBDA) approach. In all these applications, it is important toprovide domain experts with query formulation tools for expressing their information needs in terms of queries over ontologies. Inthis work, we present such a tool, OptiqueVQS, which is designed based on our experience with OBDA applications in Statoil andSiemens and on best HCI practices for interdisciplinary engineering environments. OptiqueVQS implements a number of uniquetechniques distinguishing it from analogous query formulation systems. In particular, it exploits ontology projection techniquesto enable graph-based navigation over an ontology during query construction. Secondly, while OptiqueVQS is primarily ontologydriven, it exploits sampled data to enhance selection of data values for some data attributes. Finally, OptiqueVQS is built onwell-grounded requirements, design rationale, and quality attributes. We evaluated OptiqueVQS with both domain experts andcasual users and qualitatively compared our system against prominent visual systems for ontology-driven query formulation andexploration of semantic data. OptiqueVQS is available online and can be downloaded together with an example OBDA scenario

City Research Online

Crossref

Oxford University Research Archive

International ranking systems for universities and institutions: a critical appraisal

Author: AFJ Van Raan
AFJ Van Raan
Athina Tatsioni
D Butler
DA King
DD Dill
DE Heller
Despina G Contopoulos-Ioannidis
E Garfield
E Garfield
Evangelos Evangelou
Fotini K Kavvoura
G Palla
George Liberopoulos
Ioanna Kouri
J Banatvala
J Giles
John PA Ioannidis
JP Ioannidis
JP Ioannidis
JP Ioannidis
K Dickersin
M Bhandari
M Schein
NA Patsopoulos
NC Liu
Nikolaos A Patsopoulos
RG Green
RM Slone
RN Kostoff
S Tomlinson
T Drewes
WC McGaghie
Y Cheng
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Ranking of universities and institutions has attracted wide attention recently. Several systems have been proposed that attempt to rank academic institutions worldwide. Methods We review the two most publicly visible ranking systems, the Shanghai Jiao Tong University 'Academic Ranking of World Universities' and the Times Higher Education Supplement 'World University Rankings' and also briefly review other ranking systems that use different criteria. We assess the construct validity for educational and research excellence and the measurement validity of each of the proposed ranking criteria, and try to identify generic challenges in international ranking of universities and institutions. Results None of the reviewed criteria for international ranking seems to have very good construct validity for both educational and research excellence, and most don't have very good construct validity even for just one of these two aspects of excellence. Measurement error for many items is also considerable or is not possible to determine due to lack of publication of the relevant data and methodology details. The concordance between the 2006 rankings by Shanghai and Times is modest at best, with only 133 universities shared in their top 200 lists. The examination of the existing international ranking systems suggests that generic challenges include adjustment for institutional size, definition of institutions, implications of average measurements of excellence versus measurements of extremes, adjustments for scientific field, time frame of measurement and allocation of credit for excellence. Conclusion Naïve lists of international institutional rankings that do not address these fundamental challenges with transparent methods are misleading and should be abandoned. We make some suggestions on how focused and standardized evaluations of excellence could be improved and placed in proper context.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Ontology Based Data Access in Statoil

Author: Bilidas D.
Giese M.
Hovland D.
Ioannidis Y.
Jimenez-Ruiz E.
Kharlamov E.
Kotidis Y.
Koubarakis M.
Lanti D.
Lie H.
Rezk M.
Skjaeveland M. G.
Soylu A.
Waaler A.
Xiao G.
Zheleznyakov D.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Ontology Based Data Access (OBDA) is a prominent approach to query databases which uses an ontology to expose data in a conceptually clear manner by abstracting away from the technical schema-level details of the underlying data. The ontology is ‘connected’ to the data via mappings that allow to automatically translate queries posed over the ontology into data-level queries that can be executed by the underlying database management system. Despite a lot of attention from the research community, there are still few instances of real world industrial use of OBDA systems. In this work we present data access challenges in the data-intensive petroleum company Statoil and our experience in addressing these challenges with OBDA technology. In particular, we have developed a deployment module to create ontologies and mappings from relational databases in a semi-automatic fashion; a query processing module to perform and optimise the process of translating ontological queries into data queries and their execution over either a single DB of federated DBs; and a query formulation module to support query construction for engineers with a limited IT background. Our modules have been integrated in one OBDA system, deployed at Statoil, integrated with Statoil’s infrastructure, and evaluated with Statoil’s engineers and data

City Research Online

Crossref

Oxford University Research Archive

NORA - Norwegian Open Research Archives

Reporting of Human Genome Epidemiology (HuGE) association studies: An empirical assessment

Abstract Background Several thousand human genome epidemiology association studies are published every year investigating the relationship between common genetic variants and diverse phenotypes. Transparent reporting of study methods and results allows readers to better assess the validity of study findings. Here, we document reporting practices of human genome epidemiology studies. Methods Articles were randomly selected from a continuously updated database of human genome epidemiology association studies to be representative of genetic epidemiology literature. The main analysis evaluated 315 articles published in 2001–2003. For a comparative update, we evaluated 28 more recent articles published in 2006, focusing on issues that were poorly reported in 2001–2003. Results During both time periods, most studies comprised relatively small study populations and examined one or more genetic variants within a single gene. Articles were inconsistent in reporting the data needed to assess selection bias and the methods used to minimize misclassification (of the genotype, outcome, and environmental exposure) or to identify population stratification. Statistical power, the use of unrelated study participants, and the use of replicate samples were reported more often in articles published during 2006 when compared with the earlier sample. Conclusion We conclude that many items needed to assess error and bias in human genome epidemiology association studies are not consistently reported. Although some improvements were seen over time, reporting guidelines and online supplemental material may help enhance the transparency of this literature.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Spiral - Imperial College Digital Repository

WhoLoDancE: Towards a methodology for selecting Motion Capture Data across different Dance Learning Practice

Author: Camurri A.
Di Pietro S.
El Raheb K.
Even-Zohar O.
Ioannidis Y.
Markatzi A.
Matos J.-M.
Morley-Fletcher E.
Palacio P.
Romero M.
Sarti A.
Viro V.
Whatley Sarah
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

In this paper we present the objectives and preliminary work of WhoLoDancE a Research and Innovation Action funded under the European Union‘s Horizon 2020 programme, aiming at using new technologies for capturing and analyzing dance movement to facilitate whole-body interaction learning experiences for a variety of dance genres. Dance is a diverse and heterogeneous practice and WhoLoDancE will develop a protocol for the creation and/or selection of dance sequences drawn from different dance styles for different teaching and learning modalities. As dance learning practice lacks standardization beyond dance genres and specific schools and techniques, one of the first project challenges is to bring together a variety of dance genres and teaching practices and work towards a methodology for selecting the appropriate shots for motion capturing, to acquire kinetic material which will provide a satisfying proof of concept for Learning scenarios of particular genres. The four use cases we are investigating are 1) classical ballet, 2) contemporary dance, 3) flamenco and 4) Greek folk dance.</p

Crossref

Coventry University Pure Portal

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

FigShare

Foundations of visual metaphors for schema display

Author: A. Motro
B. A. Myers
E. R. Tufte
E. R. Tufte
Eben M. Haber
J. D. Foley
L. Rowe
M. Consens
Miron Livny
P. Dewan
W. Kuhn
Y. Ioannidis
Yannis E. Ioannidis
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref