Search CORE

2,848 research outputs found

Analysis & design of data farming algorithm for cardiac patient data

Author: Pandey Hari
Saxena Kanak
Shahnawaz Mohd
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/01/2018
Field of study

Data farming is a process to grow data by applying various statistical, predictions, machine learning and data mining approach on the available data. As data collection cost is high so many times data mining projects use existing data collected for various other purposes, such as daily collected data to process and data required for monitoring & control. Sometimes, the dataset available might be large or wide data set and sufficient for extraction of knowledge but sometimes the data set might be narrow and insufficient to extract meaningful knowledge or the data may not even exist. Mining from wide datasets has received wide attention in the available literature. Many models and algorithms for data reduction & feature selection have been developed for wide datasets. Determining or extracting knowledge from a narrow data set (partial availability of data) or in the absence of an existing data set has not been sufficiently addressed in the literature. In this paper we propose an algorithm for data farming, which farm sufficient data from the available little seed data. Classification accuracy of J48 classification for farmed data is achieved better than classification results for the seed data, which proves that the proposed data farming algorithm is effective

Crossref

Edge Hill University Research Information Repository

Middlesex University Research Repository

Analysis & design of data farming algorithm for cardiac patient data

Author: Pandey H.
Pandey H.
Saxena K.
Saxena K.
Shahnawaz M.
Shahnawaz M.
Publication venue: Institute of Electrical and Electronics Engineers (IEEE)
Publication date: 01/01/2018
Field of study

Middlesex University Research Repository

Data-Driven Understanding of Smart Service Systems Through Text Mining

Author: Allmendinger G
Bird S
Bishop CM
Blei DM
Chen TM
Hartigan JA
Jolliffe I
Lee YC
Lim J
Lo CH
Maglio P
Maglio PP
Montgomery DC
Noor A
Normann R
Pedregosa F
Prahalad CK
Prahalad CK
Pérez González D
Raghupathi W
Yadav K
Zhuge H
Publication venue: 'Institute for Operations Research and the Management Sciences (INFORMS)'
Publication date: 01/06/2018
Field of study

Smart service systems are everywhere, in homes and in the transportation, energy, and healthcare sectors. However, such systems have yet to be fully understood in the literature. Given the widespread applications of and research on smart service systems, we used text mining to develop a unified understanding of such systems in a data-driven way. Specifically, we used a combination of metrics and machine learning algorithms to preprocess and analyze text data related to smart service systems, including text from the scientific literature and news articles. By analyzing 5,378 scientific articles and 1,234 news articles, we identify important keywords, 16 research topics, 4 technology factors, and 13 application areas. We define ???smart service system??? based on the analytics results. Furthermore, we discuss the theoretical and methodological implications of our work, such as the 5Cs (connection, collection, computation, and communications for co-creation) of smart service systems and the text mining approach to understand service research topics. We believe this work, which aims to establish common ground for understanding these systems across multiple disciplinary perspectives, will encourage further research and development of modern service systems

Crossref

ScholarWorks@UNIST

ICTs as enablers of agricultural innovation systems

Author: Ballantyne Peter G.
Castello R. Del
Edge P.
Hani M.
Maru Ajit
Morras E.
Nichterlein K.
Porcari Enrica M.
Rudgard Stephen
Treinen S.
Publication venue: 'World Bank'
Publication date: 05/12/2011
Field of study

CGSpace

ASCR/HEP Exascale Requirements Review Report

This draft report summarizes and details the findings, results, and recommendations derived from the ASCR/HEP Exascale Requirements Review meeting held in June, 2015. The main conclusions are as follows. 1) Larger, more capable computing and data facilities are needed to support HEP science goals in all three frontiers: Energy, Intensity, and Cosmic. The expected scale of the demand at the 2025 timescale is at least two orders of magnitude -- and in some cases greater -- than that available currently. 2) The growth rate of data produced by simulations is overwhelming the current ability, of both facilities and researchers, to store and analyze it. Additional resources and new techniques for data analysis are urgently needed. 3) Data rates and volumes from HEP experimental facilities are also straining the ability to store and analyze large and complex data volumes. Appropriately configured leadership-class facilities can play a transformational role in enabling scientific discovery from these datasets. 4) A close integration of HPC simulation and data analysis will aid greatly in interpreting results from HEP experiments. Such an integration will minimize data movement and facilitate interdependent workflows. 5) Long-range planning between HEP and ASCR will be required to meet HEP's research needs. To best use ASCR HPC resources the experimental HEP program needs a) an established long-term plan for access to ASCR computational and data resources, b) an ability to map workflows onto HPC resources, c) the ability for ASCR facilities to accommodate workflows run by collaborations that can have thousands of individual members, d) to transition codes to the next-generation HPC platforms that will be available at ASCR facilities, e) to build up and train a workforce capable of developing and using simulations and analysis to support HEP scientific research on next-generation systems.Comment: 77 pages, 13 Figures; draft report, subject to further revisio

arXiv.org e-Print Archive

eScholarship - University of California

Models of everywhere revisited: a technological perspective

Author: Ahmed
Atzori
Barry Hankin
Basco-Carrera
Bastin
Beven
Beven
Beven
Beven
Beven
Beven
Beven
Beven
Beven
Beven
Beven
Beven
Beven
Bierkens
Blair
Blei
Box
Clark
Coulouris
Coxon
Dadson
David
Dean
Di Baldassarre
Di Baldassarre
Edwards
Evers
Faiza Samreen
Fenicia
Ferré
Foster
France
Gilbert
Gordon S. Blair
Graham Dean
Habata
Hankin
Hazeleger
Hurrell
Johnson
Keith Beven
Kephart
Kon
Kris Cauwenberghs
Landström
Lane
Leavesley
Liz Edwards
Lloyd
Lopez
Maes
Maskrey
McCallum
McDonnell
McKinley
Metcalfe
Nearing
Neil Hunter
Nundloll
Oreizy
Prudhomme
Renard
Richard Bassett
Rob Lamb
Ross Towe
Rougier
Simm
Smith
Smith
Towe
Vatsala Nundloll
Voinov
Vrugt
Waldrop
Weiler
Westerberg
Westerberg
Will Simm
Wood
Publication venue: 'Elsevier BV'
Publication date: 27/09/2019
Field of study

The concept ‘models of everywhere’ was first introduced in the mid 2000s as a means of reasoning about the environmental science of a place, changing the nature of the underlying modelling process, from one in which general model structures are used to one in which modelling becomes a learning process about specific places, in particular capturing the idiosyncrasies of that place. At one level, this is a straightforward concept, but at another it is a rich multi-dimensional conceptual framework involving the following key dimensions: models of everywhere, models of everything and models at all times, being constantly re-evaluated against the most current evidence. This is a compelling approach with the potential to deal with epistemic uncertainties and nonlinearities. However, the approach has, as yet, not been fully utilised or explored. This paper examines the concept of models of everywhere in the light of recent advances in technology. The paper argues that, when first proposed, technology was a limiting factor but now, with advances in areas such as Internet of Things, cloud computing and data analytics, many of the barriers have been alleviated. Consequently, it is timely to look again at the concept of models of everywhere in practical conditions as part of a trans-disciplinary effort to tackle the remaining research questions. The paper concludes by identifying the key elements of a research agenda that should underpin such experimentation and deployment

Crossref

Sheffield Hallam University Research Archive

The University of Manchester - Institutional Repository

Lancaster E-Prints

Intelligent simulation of coastal ecosystems

Author: Pereira António Manuel Correia
Publication venue
Publication date: 01/01/2010
Field of study

Tese de doutoramento. Engenharia Informática. Faculdade de Engenharia. Universidade do Porto, Faculdade de Ciência e Tecnologia. Universidade Fernando Pessoa. 201

Repositório Aberto da Universidade do Porto

Support for flexible and transparent distributed computing

Author: Liu H.
Publication venue: UCL (University College London)
Publication date: 28/04/2010
Field of study

Modern distributed computing developed from the traditional supercomputing community rooted firmly in the culture of batch management. Therefore, the field has been dominated by queuing-based resource managers and work flow based job submission environments where static resource demands needed be determined and reserved prior to launching executions. This has made it difficult to support resource environments (e.g. Grid, Cloud) where the available resources as well as the resource requirements of applications may be both dynamic and unpredictable. This thesis introduces a flexible execution model where the compute capacity can be adapted to fit the needs of applications as they change during execution. Resource provision in this model is based on a fine-grained, self-service approach instead of the traditional one-time, system-level model. The thesis introduces a middleware based Application Agent (AA) that provides a platform for the applications to dynamically interact and negotiate resources with the underlying resource infrastructure. We also consider the issue of transparency, i.e., hiding the provision and management of the distributed environment. This is the key to attracting public to use the technology. The AA not only replaces user-controlled process of preparing and executing an application with a transparent software-controlled process, it also hides the complexity of selecting right resources to ensure execution QoS. This service is provided by an On-line Feedback-based Automatic Resource Configuration (OAC) mechanism cooperating with the flexible execution model. The AA constantly monitors utility-based feedbacks from the application during execution and thus is able to learn its behaviour and resource characteristics. This allows it to automatically compose the most efficient execution environment on the fly and satisfy any execution requirements defined by users. Two policies are introduced to supervise the information learning and resource tuning in the OAC. The Utility Classification policy classifies hosts according to their historical performance contributions to the application. According to this classification, the AA chooses high utility hosts and withdraws low utility hosts to configure an optimum environment. The Desired Processing Power Estimation (DPPE) policy dynamically configures the execution environment according to the estimated desired total processing power needed to satisfy users’ execution requirements. Through the introducing of flexibility and transparency, a user is able to run a dynamic/normal distributed application anywhere with optimised execution performance, without managing distributed resources. Based on the standalone model, the thesis further introduces a federated resource negotiation framework as a step forward towards an autonomous multi-user distributed computing world

UCL Discovery