Search CORE

3,031 research outputs found

High-performance integrated virtual environment (HIVE) tools and applications for big data analysis

Author: Mazumder Raja
Simonyan Vahan
Publication venue: Health Sciences Research Commons
Publication date: 01/01/2014
Field of study

The High-performance Integrated Virtual Environment (HIVE) is a high-throughput cloud-based infrastructure developed for the storage and analysis of genomic and associated biological data. HIVE consists of a web-accessible interface for authorized users to deposit, retrieve, share, annotate, compute and visualize Next-generation Sequencing (NGS) data in a scalable and highly efficient fashion. The platform contains a distributed storage library and a distributed computational powerhouse linked seamlessly. Resources available through the interface include algorithms, tools and applications developed exclusively for the HIVE platform, as well as commonly used external tools adapted to operate within the parallel architecture of the system. HIVE is composed of a flexible infrastructure, which allows for simple implementation of new algorithms and tools. Currently, available HIVE tools include sequence alignment and nucleotide variation profiling tools, metagenomic analyzers, phylogenetic tree-building tools using NGS data, clone discovery algorithms, and recombination analysis algorithms. In addition to tools, HIVE also provides knowledgebases that can be used in conjunction with the tools for NGS sequence and metadata analysis

Directory of Open Access Journals

PubMed Central

George Washington University: Health Sciences Research Commons (HSRC)

DNA sequencing as a tool to monitor marine ecological status

Author: Andrew R. Thompson
Bernardo Duarte
Isabel Caçador
João C. Marques
Kelly D. Goodwin
Luke R. Thompson
Luke R. Thompson
Tim Kahlke
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

© 2017 Goodwin, Thompson, Duarte, Kahlke, Thompson, Marques and Caçador. Many ocean policies mandate integrated, ecosystem-based approaches to marine monitoring, driving a global need for efficient, low-cost bioindicators of marine ecological quality. Most traditional methods to assess biological quality rely on specialized expertise to provide visual identification of a limited set of specific taxonomic groups, a time-consuming process that can provide a narrow view of ecological status. In addition, microbial assemblages drive food webs but are not amenable to visual inspection and thus are largely excluded from detailed inventory. Molecular-based assessments of biodiversity and ecosystem function offer advantages over traditional methods and are increasingly being generated for a suite of taxa using a "microbes to mammals" or "barcodes to biomes" approach. Progress in these efforts coupled with continued improvements in high-throughput sequencing and bioinformatics pave the way for sequence data to be employed in formal integrated ecosystem evaluation, including food web assessments, as called for in the European Union Marine Strategy Framework Directive. DNA sequencing of bioindicators, both traditional (e.g., benthic macroinvertebrates, ichthyoplankton) and emerging (e.g., microbial assemblages, fish via eDNA), promises to improve assessment of marine biological quality by increasing the breadth, depth, and throughput of information and by reducing costs and reliance on specialized taxonomic expertise

Aquila Digital Community

OPUS - University of Technology Sydney

Directory of Open Access Journals

Frontiers - Publisher Connector

Estudo Geral

Universidade de Lisboa: Repositório.UL

Food Microbiol

Author
Publication venue
Publication date
Field of study

Next Generation Sequencing (NGS) combined with powerful bioinformatic approaches are revolutionising food microbiology. Whole genome sequencing (WGS) of single isolates allows the most detailed comparison possible hitherto of individual strains. The two principle approaches for strain discrimination, single nucleotide polymorphism (SNP) analysis and genomic multi-locus sequence typing (MLST) are showing concordant results for phylogenetic clustering and are complementary to each other. Metabarcoding and metagenomics, applied to total DNA isolated from either food materials or the production environment, allows the identification of complete microbial populations. Metagenomics identifies the entire gene content and when coupled to transcriptomics or proteomics, allows the identification of functional capacity and biochemical activity of microbial populations. The focus of this review is on the recent use and future potential of NGS in food microbiology and on current challenges. Guidance is provided for new users, such as public health departments and the food industry, on the implementation of NGS and how to critically interpret results and place them in a broader context. The review aims to promote the broader application of NGS technologies within the food industry as well as highlight knowledge gaps and novel applications of NGS with the aim of driving future research and increasing food safety outputs from its wider use.CC999999/Intramural CDC HHS/United States2019-06-01T00:00:00Z30621881PMC64922637184vault:3458

CDC Stacks

Systems Biology Knowledgebase for a New Era in Biology A Genomics:GTL Report from the May 2008 Workshop

Author
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date
Field of study

Crossref

On-premise containerized, light-weight software solutions for Biomedicine

Author: Le Duc Huy
Publication venue
Publication date: 01/01/2023
Field of study

Bioinformatics software systems are critical tools for analysing large-scale biological data, but their design and implementation can be challenging due to the need for reliability, scalability, and performance. This thesis investigates the impact of several software approaches on the design and implementation of bioinformatics software systems. These approaches include software patterns, microservices, distributed computing, containerisation and container orchestration. The research focuses on understanding how these techniques affect bioinformatics software systems’ reliability, scalability, performance, and efficiency. Furthermore, this research highlights the challenges and considerations involved in their implementation. This study also examines potential solutions for implementing container orchestration in bioinformatics research teams with limited resources and the challenges of using container orchestration. Additionally, the thesis considers microservices and distributed computing and how these can be optimised in the design and implementation process to enhance the productivity and performance of bioinformatics software systems. The research was conducted using a combination of software development, experimentation, and evaluation. The results show that implementing software patterns can significantly improve the code accessibility and structure of bioinformatics software systems. Specifically, microservices and containerisation also enhanced system reliability, scalability, and performance. Additionally, the study indicates that adopting advanced software engineering practices, such as model-driven design and container orchestration, can facilitate efficient and productive deployment and management of bioinformatics software systems, even for researchers with limited resources. Overall, we develop a software system integrating all our findings. Our proposed system demonstrated the ability to address challenges in bioinformatics. The thesis makes several key contributions in addressing the research questions surrounding the design, implementation, and optimisation of bioinformatics software systems using software patterns, microservices, containerisation, and advanced software engineering principles and practices. Our findings suggest that incorporating these technologies can significantly improve bioinformatics software systems’ reliability, scalability, performance, efficiency, and productivity.Bioinformatische Software-Systeme stellen bedeutende Werkzeuge für die Analyse umfangreicher biologischer Daten dar. Ihre Entwicklung und Implementierung kann jedoch aufgrund der erforderlichen Zuverlässigkeit, Skalierbarkeit und Leistungsfähigkeit eine Herausforderung darstellen. Das Ziel dieser Arbeit ist es, die Auswirkungen von Software-Mustern, Microservices, verteilten Systemen, Containerisierung und Container-Orchestrierung auf die Architektur und Implementierung von bioinformatischen Software-Systemen zu untersuchen. Die Forschung konzentriert sich darauf, zu verstehen, wie sich diese Techniken auf die Zuverlässigkeit, Skalierbarkeit, Leistungsfähigkeit und Effizienz von bioinformatischen Software-Systemen auswirken und welche Herausforderungen mit ihrer Konzeptualisierungen und Implementierung verbunden sind. Diese Arbeit untersucht auch potenzielle Lösungen zur Implementierung von Container-Orchestrierung in bioinformatischen Forschungsteams mit begrenzten Ressourcen und die Einschränkungen bei deren Verwendung in diesem Kontext. Des Weiteren werden die Schlüsselfaktoren, die den Erfolg von bioinformatischen Software-Systemen mit Containerisierung, Microservices und verteiltem Computing beeinflussen, untersucht und wie diese im Design- und Implementierungsprozess optimiert werden können, um die Produktivität und Leistung bioinformatischer Software-Systeme zu steigern. Die vorliegende Arbeit wurde mittels einer Kombination aus Software-Entwicklung, Experimenten und Evaluation durchgeführt. Die erzielten Ergebnisse zeigen, dass die Implementierung von Software-Mustern, die Zuverlässigkeit und Skalierbarkeit von bioinformatischen Software-Systemen erheblich verbessern kann. Der Einsatz von Microservices und Containerisierung trug ebenfalls zur Steigerung der Zuverlässigkeit, Skalierbarkeit und Leistungsfähigkeit des Systems bei. Darüber hinaus legt die Arbeit dar, dass die Anwendung von SoftwareEngineering-Praktiken, wie modellgesteuertem Design und Container-Orchestrierung, die effiziente und produktive Bereitstellung und Verwaltung von bioinformatischen Software-Systemen erleichtern kann. Zudem löst die Implementierung dieses SoftwareSystems, Herausforderungen für Forschungsgruppen mit begrenzten Ressourcen. Insgesamt hat das System gezeigt, dass es in der Lage ist, Herausforderungen im Bereich der Bioinformatik zu bewältigen und stellt somit ein wertvolles Werkzeug für Forscher in diesem Bereich dar. Die vorliegende Arbeit leistet mehrere wichtige Beiträge zur Beantwortung von Forschungsfragen im Zusammenhang mit dem Entwurf, der Implementierung und der Optimierung von Software-Systemen für die Bioinformatik unter Verwendung von Prinzipien und Praktiken der Softwaretechnik. Unsere Ergebnisse deuten darauf hin, dass die Einbindung dieser Technologien die Zuverlässigkeit, Skalierbarkeit, Leistungsfähigkeit, Effizienz und Produktivität bioinformatischer Software-Systeme erheblich verbessern kann

Institutional Repository of the Freie Universität Berlin

Taking hospital pathogen surveillance to the next level

Author: Couto Natacha
Feil Edward J.
Friedrich Alexander W.
Hegstad Kristin
Howden Benjamin P.
Novais Angela
Reuter Sandra
Werner Guido
Publication venue: Microbiology Society
Publication date: 26/04/2023
Field of study

High-throughput bacterial genomic sequencing and subsequent analyses can produce large volumes of high-quality data rapidly. Advances in sequencing technology, with commensurate developments in bioinformatics, have increased the speed and efficiency with which it is possible to apply genomics to outbreak analysis and broader public health surveillance. This approach has been focused on targeted pathogenic taxa, such as Mycobacteria, and diseases corresponding to different modes of transmission, including food-and-water-borne diseases (FWDs) and sexually transmitted infections (STIs). In addition, major healthcare-associated pathogens such as methicillin-resistant Staphylococcus aureus, vancomycin-resistant enterococci and carbapenemase-producing Klebsiella pneumoniae are the focus of research projects and initiatives to understand transmission dynamics and temporal trends on both local and global scales. Here, we discuss current and future public health priorities relating to genome-based surveillance of major healthcare-associated pathogens. We highlight the specific challenges for the surveillance of healthcare-associated infections (HAIs), and how recent technical advances might be deployed most effectively to mitigate the increasing public health burden they cause

Munin - Open Research Archive

Nanoinformatics 2010 Program

Author: Baker Nathan A
Chaka Anne
Cohen Yoram
Colvin Vicki
Fritts Martin
Geraci Charles L.
Hoover Mark D
Ku Sharon
Kulinowski Kristen M
Lippell Phil
Luo James
McLennan Michael
Morse Jeffrey
Ostraat Michele L
Rajan Krishna
Reznik-Zellen Rebecca
Schad Peter
Tuominen Mark T.
Publication venue
Publication date: 01/11/2010
Field of study

InterNano Nanomanufacturing Repository

Recommended from our members

New Frontiers for Organismal Biology

Author: Albertson Craig
Carey Hannah V.
Clayton David F.
Cummings Molly E.
Dewar Ken
Edwards Scott V.
Gross Louis J.
Hofmann Hans A.
Kingsolver Joel G.
Kültz Dietmar
Meaney Michael J.
Robinson Gene E.
Schlinger Barney A.
Shingleton Alexander W.
Sokolowski Marla B.
Somero George N.
Stanzione Daniel C.
Todgham Anne E.
Publication venue: 'University of California Press'
Publication date: 17/06/2014
Field of study

Understanding how complex organisms function as integrated units that constantly interact with their environment is a long-standing challenge in biology. To address this challenge, organismal biology reveals general organizing principles of physiological systems and behavior—in particular, in complex multicellular animals. Organismal biology also focuses on the role of individual variability in the evolutionary maintenance of diversity. To broadly advance these frontiers, cross-compatibility of experimental designs, methodological approaches, and data interpretation pipelines represents a key prerequisite. It is now possible to rapidly and systematically analyze complete genomes to elucidate genetic variation associated with traits and conditions that define individuals, populations, and species. However, genetic variation alone does not explain the varied individual physiology and behavior of complex organisms. We propose that such emergent properties of complex organisms can best be explained through a renewed emphasis on the context and life-history dependence of individual phenotypes to complement genetic data.Organismic and Evolutionary Biolog

Harvard University - DASH

Integrating Open Data on Cancer in Support to Tumor Growth Analysis

Author: Cemernek David
Holzinger Andreas
Jean-Quartier Claire
Jeanquartier Fleur
Schreck Tobias
Publication venue
Publication date: 01/01/2016
Field of study

TUGraz OPEN Library