Search CORE

AIR Universita degli studi di Milano

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Bari

Archivio Istituzionale della Ricerca- Università del Piemonte Orientale

University of Queensland eSpace

Community-driven development for computational biology at Sprints, Hackathons and Codefests

Author: Afgan Enis
Banck Michael
Bonnal Raoul JP
Booth Timothy
Chapman Brad A
Chilton John
Cock Peter JA
Guimera Roman Valls
Gumbel Markus
Harris Nomi
Holland Richard
Kaján László
Kalaš Matúš
Katayama Toshiaki
Kibukawa Eri
Möller Steffen
Powel David R
Prins Pjotr
Quinn Jacqueline
Sallou Olivier
Seemann Torsten
Sloggett Clare
Soiland-Reyes Stian
Spooner William
Steinbiss Sascha
Strozzi Francesco
Tille Andreas
Travis Anthony J
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Background: Computational biology comprises a wide range of technologies and approaches. Multiple technologies can be combined to create more powerful workflows if the individuals contributing the data or providing tools for its interpretation can find mutual understanding and consensus. Much conversation and joint investigation are required in order to identify and implement the best approaches. Traditionally, scientific conferences feature talks presenting novel technologies or insights, followed up by informal discussions during coffee breaks. In multi-institution collaborations, in order to reach agreement on implementation details or to transfer deeper insights in a technology and practical skills, a representative of one group typically visits the other. However, this does not scale well when the number of technologies or research groups is large. Conferences have responded to this issue by introducing Birds-of-a-Feather (BoF) sessions, which offer an opportunity for individuals with common interests to intensify their interaction. However, parallel BoF sessions often make it hard for participants to join multiple BoFs and find common ground between the different technologies, and BoFs are generally too short to allow time for participants to program together. Results: This report summarises our experience with computational biology Codefests, Hackathons and Sprints, which are interactive developer meetings. They are structured to reduce the limitations of traditional scientific meetings described above by strengthening the interaction among peers and letting the participants determine the schedule and topics. These meetings are commonly run as loosely scheduled "unconferences" (self-organized identification of participants and topics for meetings) over at least two days, with early introductory talks to welcome and organize contributors, followed by intensive collaborative coding sessions. We summarise some prominent achievements of those meetings and describe differences in how these are organised, how their audience is addressed, and their outreach to their respective communities. Conclusions: Hackathons, Codefests and Sprints share a stimulating atmosphere that encourages participants to jointly brainstorm and tackle problems of shared interest in a self-driven proactive environment, as well as providing an opportunity for new participants to get involved in collaborative projects

Aberdeen University Research

University of Bergen

Harvard University - DASH

The University of Manchester - Institutional Repository

NORA - Norwegian Open Research Archives

University of Melbourne Institutional Repository

NERC Open Research Archive

Characterization of Nucleotide Misincorporation Patterns in the Iceman's Mitochondrial DNA

Author: A Cooper
A Helgason
AJ Hansen
AW Briggs
Cristina Olivieri
E Willerslev
E Willerslev
EC Friedberg
Ermanno Rizzi
Franco Rollo
G Di Bernardo
G Eglinton
Gianluca De Bellis
Giorgio Corti
HN Poinar
HN Poinar
I Marota
Isolina Marota
J Binladen
JE Frey
JP Noonan
K Spindler
KA Eckert
L Ermini
Luca Ermini
M Banerjeea
M Hofreiter
M Höss
M Margulies
M Stiller
M Ubaldi
M. Thomas P. Gilbert
ML Sampietro
MT Gilbert
MTP Gilbert
MTP Gilbert
MTP Gilbert
MTP Gilbert
MTP Gilbert
O Handt
O Handt
P Brotherton
R Lamers
Raoul Bonnal
RM Andrews
S Pääbo
S Pääbo
S Pääbo
Stefania Luciani
T Lindahl
TA Brown
TA Hall
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

BACKGROUND: The degradation of DNA represents one of the main issues in the genetic analysis of archeological specimens. In the recent years, a particular kind of post-mortem DNA modification giving rise to nucleotide misincorporation ("miscoding lesions") has been the object of extensive investigations. METHODOLOGY/PRINCIPAL FINDINGS: To improve our knowledge regarding the nature and incidence of ancient DNA nucleotide misincorporations, we have utilized 6,859 (629,975 bp) mitochondrial (mt) DNA sequences obtained from the 5,350-5,100-years-old, freeze-desiccated human mummy popularly known as the Tyrolean Iceman or Otzi. To generate the sequences, we have applied a mixed PCR/pyrosequencing procedure allowing one to obtain a particularly high sequence coverage. As a control, we have produced further 8,982 (805,155 bp) mtDNA sequences from a contemporary specimen using the same system and starting from the same template copy number of the ancient sample. From the analysis of the nucleotide misincorporation rate in ancient, modern, and putative contaminant sequences, we observed that the rate of misincorporation is significantly lower in modern and putative contaminant sequence datasets than in ancient sequences. In contrast, type 2 transitions represent the vast majority (85%) of the observed nucleotide misincorporations in ancient sequences. CONCLUSIONS/SIGNIFICANCE: This study provides a further contribution to the knowledge of nucleotide misincorporation patterns in DNA sequences obtained from freeze-preserved archeological specimens. In the Iceman system, ancient sequences can be clearly distinguished from contaminants on the basis of nucleotide misincorporation rates. This observation confirms a previous identification of the ancient mummy sequences made on a purely phylogenetical basis. The present investigation provides further indication that the majority of ancient DNA damage is reflected by type 2 (cytosine-->thymine/guanine-->adenine) transitions and that type 1 transitions are essentially PCR artifacts

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Camerino

Computational pan-genomics: Status, promises and challenges

Author: Abeel T. (Thomas)
Alkan C. (Can)
Baaijens J.A. (Jasmijn)
Bakker P.I.W. (Paul) de
Boeva V. (Valentina)
Bonnal R.J.P. (Raoul)
Chiaromonte F. (Francesca)
Chikhi R. (Rayan)
Ciccarelli F.D. (Francesca)
Cijvat C.P. (Robin)
Datema E. (Erwin)
Dijkstra L.J. (Louis)
Duijn C.M. (Cornelia) van
Dutilh B.E. (Bas)
Eichler E.E. (Evan)
El-Kebir M. (Mohammed)
Ernst C. (Corinna)
Eskin E. (Eleazar)
Garrison E. (Erik)
Ghaffaari A. (Ali)
Guryev V. (Victor)
Kersey P. (Paul)
Klau G.W. (Gunnar)
Kloosterman W.P. (Wigard)
Korbel J.O. (Jan)
Lameijer E.-W. (Eric-Wubbo)
Langmead B. (Benjamin)
Marschall T. (Tobias)
Martin M. (Marcel)
Marz M. (Manja)
Medvedev P. (Paul)
Mu J.C. (John)
Mäkinen V. (Veli)
Neerincx P.B.T. (Pieter)
Novak A.M. (Adam)
Ouwens K. (Klaasjan)
Paten B. (Benedict)
Peterlongo P. (Pierre)
Pisanti N. (Nadia)
Porubsky D. (David)
Rahmann S. (Sven)
Raphael B.J. (Benjamin)
Reinert K. (Knut)
Ridder D. (Dick) de
Ridder J. (Jeroen) de
Rivals E. (Eric)
Sanders A.D. (Ashley)
Schlesner M. (Matthias)
Schulz-Trieglaff O. (Ole)
Schönhuth A. (Alexander)
Sheikhizadeh S. (Siavash)
Shneider C. (Carl)
Smit S. (Sandra)
The Computational Pan-Genomics Consortium
Valenzuela D. (Daniel)
Vandin F. (Fabio)
Wang J. (Jiayin)
Wessels L.F.A. (Lodewyk)
Ye K. (Kai)
Zhang Y. (Ying)
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different Computational methods and paradigms are needed.We will witness the rapid extension of Computational pan-genomics, a new sub-area of research in Computational biology. In this article, we generalize existing definitions and understand a pangenome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a Computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations

CWI's Institutional Repository

Erasmus University Digital Repository

miRiadne: a web tool for consistent integration of miRNA nomenclature

Author: Brown
De Candia
Donatella Carpi
Kozomara
Massimiliano Pagani
Raoul J. P. Bonnal
Riccardo L. Rossi
Sergio Abrignani
Valeria Ranzani
Van Peer
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Maastricht University Research Portal

FALDO: a semantic standard for describing the location of nucleotide and protein feature annotation

Author: Baran Joachim
Bolleman Jerven T.
Bonnal Raoul J. P.
Buels Robert
Cock Peter J. A.
Dumontier Michel
Fujisawa Takatomo
Hoehndorf Robert
Katayama Toshiaki
Mungall Christopher J.
Strozzi Francesco
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

BACKGROUND: Nucleotide and protein sequence feature annotations are essential to understand biology on the genomic, transcriptomic, and proteomic level. Using Semantic Web technologies to query biological annotations, there was no standard that described this potentially complex location information as subject-predicate-object triples. DESCRIPTION: We have developed an ontology, the Feature Annotation Location Description Ontology (FALDO), to describe the positions of annotated features on linear and circular sequences. FALDO can be used to describe nucleotide features in sequence records, protein annotations, and glycan binding sites, among other features in coordinate systems of the aforementioned “omics” areas. Using the same data format to represent sequence positions that are independent of file formats allows us to integrate sequence data from multiple sources and data types. The genome browser JBrowse is used to demonstrate accessing multiple SPARQL endpoints to display genomic feature annotations, as well as protein annotations from UniProt mapped to genomic locations. CONCLUSIONS: Our ontology allows users to uniformly describe – and potentially merge – sequence annotations from multiple sources. Data sources using FALDO can prospectively be retrieved using federalised SPARQL queries against public SPARQL endpoints and/or local private triple stores

eScholarship - University of California

The Ruby UCSC API: accessing the UCSC genome database using Ruby

Author: F Strozzi
H Li
H Mishima
Hiroyuki Mishima
J Aerts
Jan Aerts
Koh-ichiro Yoshiura
N Goto
P Schattner
PA Fujita
R Dowell
Raoul J P Bonnal
RH Ramirez-Gonzalez
RJP Bonnal
The ENCODE Project Consortium
Toshiaki Katayama
WJ Kent
WJ Kent
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Background: The University of California, Santa Cruz (UCSC) genome database is among the most used sources of genomic annotation in human and other organisms. The database offers an excellent web-based graphical user interface (the UCSC genome browser) and several means for programmatic queries. A simple application programming interface (API) in a scripting language aimed at the biologist was however not yet available. Here, we present the Ruby UCSC API, a library to access the UCSC genome database using Ruby.Results: The API is designed as a BioRuby plug-in and built on the ActiveRecord 3 framework for the object-relational mapping, making writing SQL statements unnecessary. The current version of the API supports databases of all organisms in the UCSC genome database including human, mammals, vertebrates, deuterostomes, insects, nematodes, and yeast. The API uses the bin index—if available—when querying for genomic intervals. The API also supports genomic sequence queries using locally downloaded *.2bit files that are not stored in the official MySQL database. The API is implemented in pure Ruby and is therefore available in different environments and with different Ruby interpreters (including JRuby).Conclusions: Assisted by the straightforward object-oriented design of Ruby and ActiveRecord, the Ruby UCSC API will facilitate biologists to query the UCSC genome database programmatically. The API is available through the RubyGem system. Source code and documentation are available at https://github.com/misshie/bioruby-ucsc-api/ under the Ruby license. Feedback and help is provided via the website at http://rubyucscapi.userecho.com/

Nagasaki University's Academic Output SITE: NAOSITE

Directory of Open Access Journals

Institutional Repositories DataBase (IRDB)

Nagasaki university's Academic Output SITE

Whole-Genome Pyrosequencing of an Epidemic Multidrug-Resistant Acinetobacter baumannii Strain Belonging to the European Clone II Group ▿ †

Author: Bonnal Raoul J. P.
Bordoni Roberta
Carattoli Alessandra
Cassone Antonio
De Bellis Gianluca
Fortini Daniela
Iacono Michele
Imperi Francesco
Sicheritz-Ponten Thomas
Villa Laura
Visca Paolo
Publication venue: American Society for Microbiology (ASM)
Publication date: 01/01/2008
Field of study

The whole-genome sequence of an epidemic, multidrug-resistant Acinetobacter baumannii strain (strain ACICU) belonging to the European clone II group and carrying the plasmid-mediated blaOXA-58 carbapenem resistance gene was determined. The A. baumannii ACICU genome was compared with the genomes of A. baumannii ATCC 17978 and Acinetobacter baylyi ADP1, with the aim of identifying novel genes related to virulence and drug resistance. A. baumannii ACICU has a single chromosome of 3,904,116 bp (which is predicted to contain 3,758 genes) and two plasmids, pACICU1 and pACICU2, of 28,279 and 64,366 bp, respectively. Genome comparison showed 86.4% synteny with A. baumannii ATCC 17978 and 14.8% synteny with A. baylyi ADP1. A conspicuous number of transporters belonging to different superfamilies was predicted for A. baumannii ACICU. The relative number of transporters was much higher in ACICU than in ATCC 17978 and ADP1 (76.2, 57.2, and 62.5 transporters per Mb of genome, respectively). An antibiotic resistance island, AbaR2, was identified in ACICU and had plausibly evolved by reductive evolution from the AbaR1 island previously described in multiresistant strain A. baumannii AYE. Moreover, 36 putative alien islands (pAs) were detected in the ACICU genome; 24 of these had previously been described in the ATCC 17978 genome, 4 are proposed here for the first time and are present in both ATCC 17978 and ACICU, and 8 are unique to the ACICU genome. Fifteen of the pAs in the ACICU genome encode genes related to drug resistance, including membrane transporters and ex novo acquired resistance genes. These findings provide novel insight into the genetic basis of A. baumannii resistance