Search CORE

13,507 research outputs found

Data as a Service (DaaS) for sharing and processing of large data collections in the cloud

Author: Bucci Enrico
Ruiu Pietro
Terzo Olivier
Xhafa Xhafa Fatos
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

Data as a Service (DaaS) is among the latest kind of services being investigated in the Cloud computing community. The main aim of DaaS is to overcome limitations of state-of-the-art approaches in data technologies, according to which data is stored and accessed from repositories whose location is known and is relevant for sharing and processing. Besides limitations for the data sharing, current approaches also do not achieve to fully separate/decouple software services from data and thus impose limitations in inter-operability. In this paper we propose a DaaS approach for intelligent sharing and processing of large data collections with the aim of abstracting the data location (by making it relevant to the needs of sharing and accessing) and to fully decouple the data and its processing. The aim of our approach is to build a Cloud computing platform, offering DaaS to support large communities of users that need to share, access, and process the data for collectively building knowledge from data. We exemplify the approach from large data collections from health and biology domains.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

BioCloud Search EnGene: Surfing Biological Data on the Cloud

Author: DESSI NICOLETTA
MILIA GABRIELE
Pascariello E
PES BARBARA
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

The massive production and spread of biomedical data around the web introduces new challenges related to identify computational approaches for providing quality search and browsing of web resources. This papers presents BioCloud Search EnGene (BSE), a cloud application that facilitates searching and integration of the many layers of biological information offered by public large-scale genomic repositories. Grounding on the concept of dataspace, BSE is built on top of a cloud platform that severely curtails issues associated with scalability and performance. Like popular online gene portals, BSE adopts a gene-centric approach: researchers can find their information of interest by means of a simple “Google-like” query interface that accepts standard gene identification as keywords. We present BSE architecture and functionality and discuss how our strategies contribute to successfully tackle big data problems in querying gene-based web resources. BSE is publically available at: http://biocloud-unica.appspot.com/

Archivio istituzionale della ricerca - Università di Cagliari

From access and integration to mining of secure genomic data sets across the grid

Author: Sinnott R.O.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2007
Field of study

The UK Department of Trade and Industry (DTI) funded BRIDGES project (Biomedical Research Informatics Delivered by Grid Enabled Services) has developed a Grid infrastructure to support cardiovascular research. This includes the provision of a compute Grid and a data Grid infrastructure with security at its heart. In this paper we focus on the BRIDGES data Grid. A primary aim of the BRIDGES data Grid is to help control the complexity in access to and integration of a myriad of genomic data sets through simple Grid based tools. We outline these tools, how they are delivered to the end user scientists. We also describe how these tools are to be extended in the BBSRC funded Grid Enabled Microarray Expression Profile Search (GEMEPS) to support a richer vocabulary of search capabilities to support mining of microarray data sets. As with BRIDGES, fine grain Grid security underpins GEMEPS

Enlighten

University of Melbourne Institutional Repository

Recommended from our members

Challenges of ultra large scale integration of biomedical computing systems

Author: Begent R.
Brady J.M.
Finkelstein A.
Gavaghan D.
Kerr P.
Parkinson H.
Reddington F.
Wilkinson J.M.
Publication venue: IEEE Computer Society Press
Publication date: 01/01/2005
Field of study

The NCRI Informatics Initiative is overseeing the implementation of an informatics framework for the UK cancer research community. The framework advocates an integrated multidisciplinary method of working between scientific and medical communities. Key to this process is community adoption of high quality acquisition, storage, sharing and integration of diverse data elements to improve knowledge of the causes, prevention and treatment of cancer. The integration of the complex data and meta-data used by these multiple communities is a significant challenge and there are technical, resource-based and sociological issues to be addressed. In this paper we review progress aimed at establishing the framework and outline key challenges in ultra large scale integration of biomedical computing systems

City Research Online

UCL Discovery

Data access and integration in the ISPIDER proteomics grid

Author: C.A. Goble
E.M. Zdobnov
J. Smith
L.M. Haas
M. Antonioletti
M. Maibaum
P. Buneman
P. Mçbrien
R.G.G. Cattell
S. Bowers
S. Durinck
S.B. Davidson
T.M. Oinn
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Grid computing has great potential for supporting the integration of complex, fast changing biological data repositories to enable distributed data analysis. One scenario where Grid computing has such potential is provided by proteomics resources which are rapidly being developed with the emergence of affordable, reliable methods to study the proteome. The protein identifications arising from these methods derive from multiple repositories which need to be integrated to enable uniform access to them. A number of technologies exist which enable these resources to be accessed in a Grid environment, but the independent development of these resources means that significant data integration challenges, such as heterogeneity and schema evolution, have to be met. This paper presents an architecture which supports the combined use of Grid data access (OGSA-DAI), Grid distributed querying (OGSA-DQP) and data integration (AutoMed) software tools to support distributed data analysis. We discuss the application of this architecture for the integration of several autonomous proteomics data resources

CiteSeerX

Crossref

Birkbeck Institutional Research Online

The University of Manchester - Institutional Repository

Recommended from our members

FABRIC: A National-Scale Programmable Experimental Network Infrastructure

Author: Baldin I
Deelman E
Griffioen J
Lehman T
Monga IIS
Nikolich A
Ruth P
Wang KC
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

FABRIC is a unique national research infrastructure to enable cutting-edge and exploratory research at-scale in networking, cybersecurity, distributed computing and storage systems, machine learning, and science applications. It is an everywhere-programmable nationwide instrument comprised of novel extensible network elements equipped with large amounts of compute and storage, interconnected by high speed, dedicated optical links. It will connect a number of specialized testbeds for cloud research (NSF Cloud testbeds CloudLab and Chameleon), for research beyond 5G technologies (Platforms for Advanced Wireless Research or PAWR), as well as production high-performance computing facilities and science instruments to create a rich fabric for a wide variety of experimental activities

eScholarship - University of California