Search CORE

10,550 research outputs found

How do top- and bottom-performing companies differ in using business analytics?

Author: Cao Guangming
Duan Yanqing
Publication venue: 'Emerald'
Publication date: 21/09/2017
Field of study

Purpose Business analytics (BA) has attracted growing attention mainly due to the phenomena of big data. While studies suggest that BA positively affects organizational performance, there is a lack of academic research. The purpose of this paper, therefore, is to examine the extent to which top- and bottom-performing companies differ regarding their use and organizational facilitation of BA. Design/methodology/approach Hypotheses are developed drawing on the information processing view and contingency theory, and tested using multivariate analysis of variance to analyze data collected from 117 UK manufacture companies. Findings Top- and bottom-performing companies differ significantly in their use of BA, data-driven environment, and level of fit between BA and data-drain environment. Practical implications Extensive use of BA and data-driven decisions will lead to superior firm performance. Companies wishing to use BA to improve decision making and performance need to develop relevant analytical strategy to guide BA activities and design its structure and business processes to embed BA activities. Originality/value This study provides useful management insights into the effective use of BA for improving organizational performance

Crossref

University of Bedfordshire Repository

explorase: Multivariate Exploratory Analysis and Visualization for Systems Biology

Author: Dianne Cook
Eun-Kyung Lee
Eve Syrkin Wurtele
Heather Babka
Michael Lawrence
Publication venue
Publication date
Field of study

The datasets being produced by high-throughput biological experiments, such as microarrays, have forced biologists to turn to sophisticated statistical analysis and visualization tools in order to understand their data. We address the particular need for an open-source exploratory data analysis tool that applies numerical methods in coordination with interactive graphics to the analysis of experimental data. The software package, known as explorase, provides a graphical user interface (GUI) on top of the R platform for statistical computing and the GGobi software for multivariate interactive graphics. The GUI is designed for use by biologists, many of whom are unfamiliar with the R language. It displays metadata about experimental design and biological entities in tables that are sortable and filterable. There are menu shortcuts to the analysis methods implemented in R, including graphical interfaces to linear modeling tools. The GUI is linked to data plots in GGobi through a brush tool that simultaneously colors rows in the entity information table and points in the GGobi plots.

Research Papers in Economics

Multivariate Pointwise Information-Driven Data Sampling and Visualization

Author: Ahrens James
Biswas Ayan
Dutta Soumya
Publication venue: 'MDPI AG'
Publication date: 16/07/2019
Field of study

With increasing computing capabilities of modern supercomputers, the size of the data generated from the scientific simulations is growing rapidly. As a result, application scientists need effective data summarization techniques that can reduce large-scale multivariate spatiotemporal data sets while preserving the important data properties so that the reduced data can answer domain-specific queries involving multiple variables with sufficient accuracy. While analyzing complex scientific events, domain experts often analyze and visualize two or more variables together to obtain a better understanding of the characteristics of the data features. Therefore, data summarization techniques are required to analyze multi-variable relationships in detail and then perform data reduction such that the important features involving multiple variables are preserved in the reduced data. To achieve this, in this work, we propose a data sub-sampling algorithm for performing statistical data summarization that leverages pointwise information theoretic measures to quantify the statistical association of data points considering multiple variables and generates a sub-sampled data that preserves the statistical association among multi-variables. Using such reduced sampled data, we show that multivariate feature query and analysis can be done effectively. The efficacy of the proposed multivariate association driven sampling algorithm is presented by applying it on several scientific data sets.Comment: 25 page

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

Portinari: A Data Exploration Tool to Personalize Cervical Cancer Screening

Author: Meira Jr. Wagner
Minardi Raquel C. de Melo
Nigard Mari
Ribeiro Manoel Horta
Sen Sagar
Publication venue
Publication date: 01/04/2017
Field of study

Socio-technical systems play an important role in public health screening programs to prevent cancer. Cervical cancer incidence has significantly decreased in countries that developed systems for organized screening engaging medical practitioners, laboratories and patients. The system automatically identifies individuals at risk of developing the disease and invites them for a screening exam or a follow-up exam conducted by medical professionals. A triage algorithm in the system aims to reduce unnecessary screening exams for individuals at low-risk while detecting and treating individuals at high-risk. Despite the general success of screening, the triage algorithm is a one-size-fits all approach that is not personalized to a patient. This can easily be observed in historical data from screening exams. Often patients rely on personal factors to determine that they are either at high risk or not at risk at all and take action at their own discretion. Can exploring patient trajectories help hypothesize personal factors leading to their decisions? We present Portinari, a data exploration tool to query and visualize future trajectories of patients who have undergone a specific sequence of screening exams. The web-based tool contains (a) a visual query interface (b) a backend graph database of events in patients' lives (c) trajectory visualization using sankey diagrams. We use Portinari to explore diverse trajectories of patients following the Norwegian triage algorithm. The trajectories demonstrated variable degrees of adherence to the triage algorithm and allowed epidemiologists to hypothesize about the possible causes.Comment: Conference paper published at ICSE 2017 Buenos Aires, at the Software Engineering in Society Track. 10 pages, 5 figure

arXiv.org e-Print Archive

Crossref

Curriculum Guidelines for Undergraduate Programs in Data Science

Author: Agarwal Mahesh
Averett Maia
Baumer Benjamin
Bray Andrew
Bressoud Thomas
Bryant Lance
Cheng Lei
De Veaux Richard
Francis Amanda
Gould Robert
Kim Albert Y.
Kretchmar Matt
Lu Qin
Moskol Ann
Nolan Deborah
Pelayo Roberto
Raleigh Sean
Sethi Ricky J.
Sondjaja Mutiara
Tiruviluamala Neelesh
Uhlig Paul
Washington Talitha
Wesley Curtis
White David
Ye Ping
Publication venue: 'Annual Reviews'
Publication date: 01/01/2017
Field of study

The Park City Math Institute (PCMI) 2016 Summer Undergraduate Faculty Program met for the purpose of composing guidelines for undergraduate programs in Data Science. The group consisted of 25 undergraduate faculty from a variety of institutions in the U.S., primarily from the disciplines of mathematics, statistics and computer science. These guidelines are meant to provide some structure for institutions planning for or revising a major in Data Science

arXiv.org e-Print Archive

Smith College: Smith ScholarWorks

Viewpoints: A high-performance high-dimensional exploratory data analysis tool

Author: C. Levit
Høg E.
M. J. Way
P. R. Gazis
Publication venue: 'University of Chicago Press'
Publication date: 08/11/2010
Field of study

Scientific data sets continue to increase in both size and complexity. In the past, dedicated graphics systems at supercomputing centers were required to visualize large data sets, but as the price of commodity graphics hardware has dropped and its capability has increased, it is now possible, in principle, to view large complex data sets on a single workstation. To do this in practice, an investigator will need software that is written to take advantage of the relevant graphics hardware. The Viewpoints visualization package described herein is an example of such software. Viewpoints is an interactive tool for exploratory visual analysis of large, high-dimensional (multivariate) data. It leverages the capabilities of modern graphics boards (GPUs) to run on a single workstation or laptop. Viewpoints is minimalist: it attempts to do a small set of useful things very well (or at least very quickly) in comparison with similar packages today. Its basic feature set includes linked scatter plots with brushing, dynamic histograms, normalization and outlier detection/removal. Viewpoints was originally designed for astrophysicists, but it has since been used in a variety of fields that range from astronomy, quantum chemistry, fluid dynamics, machine learning, bioinformatics, and finance to information technology server log mining. In this article, we describe the Viewpoints package and show examples of its usage.Comment: 18 pages, 3 figures, PASP in press, this version corresponds more closely to that to be publishe

arXiv.org e-Print Archive

Crossref