Search CORE

5,004 research outputs found

Mining Knowledge in Astrophysical Massive Data Sets

Author: Brescia
D’Abrusco
D’Abrusco
Fabio Pasian
Giuseppe Longo
Massimo Brescia
Smareglia
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

Modern scientific data mainly consist of huge datasets gathered by a very large number of techniques and stored in very diversified and often incompatible data repositories. More in general, in the e-science environment, it is considered as a critical and urgent requirement to integrate services across distributed, heterogeneous, dynamic "virtual organizations" formed by different resources within a single enterprise. In the last decade, Astronomy has become an immensely data rich field due to the evolution of detectors (plates to digital to mosaics), telescopes and space instruments. The Virtual Observatory approach consists into the federation under common standards of all astronomical archives available worldwide, as well as data analysis, data mining and data exploration applications. The main drive behind such effort being that once the infrastructure will be completed, it will allow a new type of multi-wavelength, multi-epoch science which can only be barely imagined. Data Mining, or Knowledge Discovery in Databases, while being the main methodology to extract the scientific information contained in such MDS (Massive Data Sets), poses crucial problems since it has to orchestrate complex problems posed by transparent access to different computing environments, scalability of algorithms, reusability of resources, etc. In the present paper we summarize the present status of the MDS in the Virtual Observatory and what is currently done and planned to bring advanced Data Mining methodologies in the case of the DAME (DAta Mining & Exploration) project.Comment: Pages 845-849 1rs International Conference on Frontiers in Diagnostics Technologie

arXiv.org e-Print Archive

Crossref

Archivio della ricerca - Università degli studi di Napoli Federico II

Hybrid ant colony system and genetic algorithm approach for scheduling of jobs in computational grid

Author: Alobaedy Mustafa Muwafak
Ku-Mahamud Ku Ruhana
Publication venue: 'Maxwell Scientific Publication Corp.'
Publication date: 01/01/2015
Field of study

Metaheuristic algorithms have been used to solve scheduling problems in grid computing.However, stand-alone metaheuristic algorithms do not always show good performance in every problem instance. This study proposes a high level hybrid approach between ant colony system and genetic algorithm for job scheduling in grid computing.The proposed approach is based on a high level hybridization.The proposed hybrid approach is evaluated using the static benchmark problems known as ETC matrix.Experimental results show that the proposed hybridization between the two algorithms outperforms the stand-alone algorithms in terms of best and average makespan values

UUM Repository

Discrete and fuzzy dynamical genetic programming in the XCSF learning classifier system

Author: B Mesot
C Van den Broeck
CA Reiter
E Di Paulo
HP Schwefel
J Di
JE Moody
JL Elman
L Bull
L Glass
Larry Bull
M Sipper
MC Su
N Lemke
PL Lanzi
PL Lanzi
Richard J. Preen
SW Wilson
T Werner
TE Ingerson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

A number of representation schemes have been presented for use within learning classifier systems, ranging from binary encodings to neural networks. This paper presents results from an investigation into using discrete and fuzzy dynamical system representations within the XCSF learning classifier system. In particular, asynchronous random Boolean networks are used to represent the traditional condition-action production system rules in the discrete case and asynchronous fuzzy logic networks in the continuous-valued case. It is shown possible to use self-adaptive, open-ended evolution to design an ensemble of such dynamical systems within XCSF to solve a number of well-known test problems

arXiv.org e-Print Archive

Crossref

UWE Bristol Research Repository

MPICH-G2: A Grid-Enabled Implementation of the Message Passing Interface

Author: Foster I.
Karonis N. T.
Toonen B.
Publication venue
Publication date: 01/01/2002
Field of study

Application development for distributed computing "Grids" can benefit from tools that variously hide or enable application-level management of critical aspects of the heterogeneous environment. As part of an investigation of these issues, we have developed MPICH-G2, a Grid-enabled implementation of the Message Passing Interface (MPI) that allows a user to run MPI programs across multiple computers, at the same or different sites, using the same commands that would be used on a parallel computer. This library extends the Argonne MPICH implementation of MPI to use services provided by the Globus Toolkit for authentication, authorization, resource allocation, executable staging, and I/O, as well as for process creation, monitoring, and control. Various performance-critical operations, including startup and collective operations, are configured to exploit network topology information. The library also exploits MPI constructs for performance management; for example, the MPI communicator construct is used for application-level discovery of, and adaptation to, both network topology and network quality-of-service mechanisms. We describe the MPICH-G2 design and implementation, present performance results, and review application experiences, including record-setting distributed simulations.Comment: 20 pages, 8 figure

arXiv.org e-Print Archive

CiteSeerX

21st Century Simulation: Exploiting High Performance Computing and Data Analysis

Author: Baer Garth D.
Davis Dan M.
Gottschalk Thomas D.
Publication venue
Publication date: 01/12/2004
Field of study

This paper identifies, defines, and analyzes the limitations imposed on Modeling and Simulation by outmoded paradigms in computer utilization and data analysis. The authors then discuss two emerging capabilities to overcome these limitations: High Performance Parallel Computing and Advanced Data Analysis. First, parallel computing, in supercomputers and Linux clusters, has proven effective by providing users an advantage in computing power. This has been characterized as a ten-year lead over the use of single-processor computers. Second, advanced data analysis techniques are both necessitated and enabled by this leap in computing power. JFCOM's JESPP project is one of the few simulation initiatives to effectively embrace these concepts. The challenges facing the defense analyst today have grown to include the need to consider operations among non-combatant populations, to focus on impacts to civilian infrastructure, to differentiate combatants from non-combatants, and to understand non-linear, asymmetric warfare. These requirements stretch both current computational techniques and data analysis methodologies. In this paper, documented examples and potential solutions will be advanced. The authors discuss the paths to successful implementation based on their experience. Reviewed technologies include parallel computing, cluster computing, grid computing, data logging, OpsResearch, database advances, data mining, evolutionary computing, genetic algorithms, and Monte Carlo sensitivity analyses. The modeling and simulation community has significant potential to provide more opportunities for training and analysis. Simulations must include increasingly sophisticated environments, better emulations of foes, and more realistic civilian populations. Overcoming the implementation challenges will produce dramatically better insights, for trainees and analysts. High Performance Parallel Computing and Advanced Data Analysis promise increased understanding of future vulnerabilities to help avoid unneeded mission failures and unacceptable personnel losses. The authors set forth road maps for rapid prototyping and adoption of advanced capabilities. They discuss the beneficial impact of embracing these technologies, as well as risk mitigation required to ensure success

Caltech Authors

A Taxonomy of Data Grids for Distributed Data Sharing, Management and Processing

Author: Buyya Rajkumar
Ramamohanarao Kotagiri
Venugopal Srikumar
Publication venue
Publication date: 10/06/2005
Field of study

Data Grids have been adopted as the platform for scientific communities that need to share, access, transport, process and manage large data collections distributed worldwide. They combine high-end computing technologies with high-performance networking and wide-area storage management techniques. In this paper, we discuss the key concepts behind Data Grids and compare them with other data sharing and distribution paradigms such as content delivery networks, peer-to-peer networks and distributed databases. We then provide comprehensive taxonomies that cover various aspects of architecture, data transportation, data replication and resource allocation and scheduling. Finally, we map the proposed taxonomy to various Data Grid systems not only to validate the taxonomy but also to identify areas for future exploration. Through this taxonomy, we aim to categorise existing systems to better understand their goals and their methodology. This would help evaluate their applicability for solving similar problems. This taxonomy also provides a "gap analysis" of this area through which researchers can potentially identify new issues for investigation. Finally, we hope that the proposed taxonomy and mapping also helps to provide an easy way for new practitioners to understand this complex area of research.Comment: 46 pages, 16 figures, Technical Repor

arXiv.org e-Print Archive

CiteSeerX

University of Melbourne Institutional Repository

Simulation of networks of spiking neurons: A review of tools and strategies

Author: Beeman D.
Boustani S. El
Bower J. M.
Brette R.
Carnevale T.
Davison A. P.
Destexhe A.
Diesmann M.
Djurfeldt M.
Ermentrout B.
Goodman P. H.
Harris Jr. F. C.
Hines M.
Lansner A.
Morrison A.
Muller E.
Natschlager T.
Pecevski D.
Rochel O.
Rudolph M.
Vieville T.
Zirpe M.
Publication venue
Publication date: 01/01/2007
Field of study

We review different aspects of the simulation of spiking neural networks. We start by reviewing the different types of simulation strategies and algorithms that are currently implemented. We next review the precision of those simulation strategies, in particular in cases where plasticity depends on the exact timing of the spikes. We overview different simulators and simulation environments presently available (restricted to those freely available, open source and documented). For each simulation tool, its advantages and pitfalls are reviewed, with an aim to allow the reader to identify which simulator is appropriate for a given task. Finally, we provide a series of benchmark simulations of different types of networks of spiking neurons, including Hodgkin-Huxley type, integrate-and-fire models, interacting with current-based or conductance-based synapses, using clock-driven or event-driven integration strategies. The same set of models are implemented on the different simulators, and the codes are made available. The ultimate goal of this review is to provide a resource to facilitate identifying the appropriate integration strategy and simulation tool to use for a given modeling problem related to spiking neural networks.Comment: 49 pages, 24 figures, 1 table; review article, Journal of Computational Neuroscience, in press (2007

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

PubMed Central

Juelich Shared Electronic Resources

HAL-Ecole des Ponts ParisTech

Proceedings of Abstracts Engineering and Computer Science Research Conference 2019

Author: Adams Roderick
Amafabia Daerefa-a
Barker Trevor
Beka Nathan
Bhavsar Ronakben
Bonivart Agnes
Canoville Paul
Cañamero Lola
CHEN Yong Kang
CHEN Yong Kang
Chrysanthou Andreas
Counsell Nathan
Crook Brian
Davey Neil
David-West Opukuro
Denai Mouloud
Dhakal Hom
Drix Damien
Goncharenko Julia
Grasso Marzio
Hafner Verena Vanessa
Hall Samantha
Haritos George
Hassan Eheda
Helian Na
Herfatmanesh Mohammad Reza
Ismail Sikiru O.
Johnston Ian
Johnston Ian
Kadir Shabnam
Kaye Richard
Khan Imran
Kirner Raimund
Kirner Raimund
Klaholz Ingo
Klusak Jan
Kourtessis Pandelis
Lane Peter
Lekkala Himayasri Rao
Lilley Mariana
Mayor David
McCluskey Daniel
McCluskey Daniel
Menon Catherine
Metzner Christoph
Miko Rebecca
Montalvão Diogo
Mporas Iosif
Munro Ian
Nehaniv Chrystopher
Newman James
Nwawe Richard
Panday Deepak
Partou Helen
Pissanidis Georgios
Polani Daniel
Ren Guogang
Robinson Matthew
Rosiello Vincenzo
Sayers Paul
Schilstra Maria
Schirmer Pascal
Schmuker Michael
Siadati Rana
Sinha Ankur
Skaltsas Grigorios
Steffert Tony
Steuber Volker
Steuber Volker
Suckow Bjorn
Sun Yi
Sun Yichuang
Sunmola Funlade
Sutton Samuel
te Boekhorst Rene
Toffe Gilles
Tracey Mark
Tveretina Olga
Veneziano Vito
Verma Alok
Wang Yuan
Wernick Paul
Publication venue: University of Hertfordshire
Publication date: 01/09/2019
Field of study

© 2019 The Author(s). This is an open-access work distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. For further details please see https://creativecommons.org/licenses/by/4.0/. Note: Keynote: Fluorescence visualisation to evaluate effectiveness of personal protective equipment for infection control is © 2019 Crown copyright and so is licensed under the Open Government Licence v3.0. Under this licence users are permitted to copy, publish, distribute and transmit the Information; adapt the Information; exploit the Information commercially and non-commercially for example, by combining it with other Information, or by including it in your own product or application. Where you do any of the above you must acknowledge the source of the Information in your product or application by including or linking to any attribution statement specified by the Information Provider(s) and, where possible, provide a link to this licence: http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/This book is the record of abstracts submitted and accepted for presentation at the Inaugural Engineering and Computer Science Research Conference held 17th April 2019 at the University of Hertfordshire, Hatfield, UK. This conference is a local event aiming at bringing together the research students, staff and eminent external guests to celebrate Engineering and Computer Science Research at the University of Hertfordshire. The ECS Research Conference aims to showcase the broad landscape of research taking place in the School of Engineering and Computer Science. The 2019 conference was articulated around three topical cross-disciplinary themes: Make and Preserve the Future; Connect the People and Cities; and Protect and Care

University of Hertfordshire Research Archive

Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis

Author: Ben-Nun Tal
Hoefler Torsten
Publication venue
Publication date: 15/09/2018
Field of study

Deep Neural Networks (DNNs) are becoming an important tool in modern computing applications. Accelerating their training is a major challenge and techniques range from distributed algorithms to low-level circuit design. In this survey, we describe the problem from a theoretical perspective, followed by approaches for its parallelization. We present trends in DNN architectures and the resulting implications on parallelization strategies. We then review and model the different types of concurrency in DNNs: from the single operator, through parallelism in network inference and training, to distributed deep learning. We discuss asynchronous stochastic optimization, distributed system architectures, communication schemes, and neural architecture search. Based on those approaches, we extrapolate potential directions for parallelism in deep learning

arXiv.org e-Print Archive

Repository for Publications and Research Data