Search CORE

66 research outputs found

DOME: recommendations for supervised machine learning validation in biology

Author: ELIXIR Machine Learning Focus Group
Fishman Dmytro
Garcia Gasulla Dario
Harrow Jennifer
Pollastri Gianluca
Psomopoulos Fotis E.
Titma Tiina
Tosatto Silvio C. E.
Walsh Ian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Supervised machine learning is widely used in biology and deserves more scrutiny. We present a set of community-wide recommendations (DOME) aiming to help establish standards of supervised machine learning validation in biology. Formulated as questions, the DOME recommendations improve the assessment and reproducibility of papers when included as supplementary material.The work of the Machine Learning Focus Group was funded by ELIXIR, the research infrastructure for life-science data. IW was funded by the A*STAR Career Development Award (project no. C210112057) from the Agency for Science, Technology and Research (A*STAR), Singapore. D.F. was supported by Estonian Research Council grants (PRG1095, PSG59 and ERA-NET TRANSCAN-2 (BioEndoCar)); Project No 2014-2020.4.01.16-0271, ELIXIR and the European Regional Development Fund through EXCITE Center of Excellence. S.C.E.T. has received funding from the European Union’s Horizon 2020 research and innovation programme under Marie Skłodowska-Curie Grant agreements No. 778247 and No. 823886, and Italian Ministry of University and Research PRIN 2017 grant 2017483NH8.Peer Reviewed"Article signat per 8 autors més 28 autors/es de l' ELIXIR Machine Learning Focus Group: Emidio Capriotti, Rita Casadio, Salvador Capella-Gutierrez, Davide Cirillo, Alessio Del Conte, Alexandros C. Dimopoulos, Victoria Dominguez Del Angel, Joaquin Dopazo, Piero Fariselli, José Maria Fernández, Florian Huber, Anna Kreshuk, Tom Lenaerts, Pier Luigi Martelli, Arcadi Navarro, Pilib Ó Broin, Janet Piñero, Damiano Piovesan, Martin Reczko, Francesco Ronzano, Venkata Satagopam, Castrense Savojardo, Vojtech Spiwok, Marco Antonio Tangaro, Giacomo Tartari, David Salgado, Alfonso Valencia & Federico Zambelli"Postprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

RNAcentral: A vision for an international database of RNA sequences

Author: Agrawal Shipra
Bateman Alex
Birney Ewan
Bruford Elspeth A
Bujnicki Janusz M
Cochrane Guy
Cole James R
Dinger Marcel E
Enright Anton J
Gardner Paul P
Gautheret Daniel
Griffiths-Jones Sam
Harrow Jen
Herrero Javier
Holmes Ian H
Huang Hsien-Da
Kelly Krystyna A
Kersey Paul
Kozomara Ana
Lowe Todd M
Marz Manja
Moxon Simon
Pruitt Kim D
Samuelsson Tore
Stadler Peter F
Vilella Albert J
Vogel Jan-Hinnerk
Williams Kelly P
Wright Mathew W
Zwieb Christian
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 23/09/2011
Field of study

During the last decade there has been a great increase in the number of noncoding RNA genes identified, including new classes such as microRNAs and piRNAs. There is also a large growth in the amount of experimental characterization of these RNA components. Despite this growth in information, it is still difficult for researchers to access RNA data, because key data resources for noncoding RNAs have not yet been created. The most pressing omission is the lack of a comprehensive RNA sequence database, much like UniProt, which provides a comprehensive set of protein knowledge. In this article we propose the creation of a new open public resource that we term RNAcentral, which will contain a comprehensive collection of RNA sequences and fill an important gap in the provision of biomedical databases. We envision RNA researchers from all over the world joining a federated RNAcentral network, contributing specialized knowledge and databases. RNAcentral would centralize key data that are currently held across a variety of databases, allowing researchers instant access to a single, unified resource. This resource would facilitate the next generation of RNA research and help drive further discoveries, including those that improve food production and human and animal health. We encourage additional RNA database resources and research groups to join this effort. We aim to obtain international network funding to further this endeavor

Crossref

UCL Discovery

PubMed Central

The University of Manchester - Institutional Repository

University of East Anglia digital repository

Normalized long read RNA sequencing in chicken reveals transcriptome complexity similar to human

Author: A Roberts
Alan L. Archibald
ARR Forrest
B Wang
BT Wilhelm
C Camacho
C Trapnell
D Sharon
David W. Burt
E Nagy
Elizabeth Tseng
ET Wang
H Chang
H Kiyosawa
Ian R. Paton
J Harrow
J Wang
J Zhang
JA Martin
L Kong
L Wang
LB Gardner
LE Maquat
Lel Eory
M Burset
MA Faghihi
MJ Chaisson
O Isken
R Thermann
RC Edgar
Richard I. Kuo
RJ Roberts
S Brogna
S Katayama
S Thomas
SE Abdel-Ghany
SP Gordon
T Derrien
T Hubbard
T Steijger
TD Wu
The UniProt Consortium
V Curwen
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2017
Field of study

Background: Despite the significance of chicken as a model organism, our understanding of the chicken transcriptome is limited compared to human. This issue is common to all non-human vertebrate annotations due to the difficulty in transcript identification from short read RNAseq data. While previous studies have used single molecule long read sequencing for transcript discovery, they did not perform RNA normalization and 5'-cap selection which may have resulted in lower transcriptome coverage and truncated transcript sequences. Results: We sequenced normalised chicken brain and embryo RNA libraries with Pacific Bioscience Iso-Seq. 5' cap selection was performed on the embryo library to provide methodological comparison. From these Iso-Seq sequencing projects, we have identified 60 k transcripts and 29 k genes within the chicken transcriptome. Of these, more than 20 k are novel lncRNA transcripts with ~3 k classified as sense exonic overlapping lncRNA, which is a class that is underrepresented in many vertebrate annotations. The relative proportion of alternative transcription events revealed striking similarities between the chicken and human transcriptomes while also providing explanations for previously observed genomic differences. Conclusions: Our results indicate that the chicken transcriptome is similar in complexity compared to human, and provide insights into other vertebrate biology. Our methodology demonstrates the potential of Iso-Seq sequencing to rapidly expand our knowledge of transcriptomics

Crossref

Directory of Open Access Journals

Edinburgh Research Explorer

University of Queensland eSpace

Evidence for Transcript Networks Composed of Chimeric RNAs in Human Cells

Author: A Dobin
A Pombo
Adam Frankish
AJ Walhout
Alex Dobin
Alexandre Reymond
Alfonso Valencia
Bryan R. Lajoie
CA Maher
Catherine Ucla
Chenwei Lin
Christelle Borel
CJ McManus
Cédric Howald
D Gordon
DA Jackson
David Martin
E Birney
E Gilboa
EL Sonnhammer
Erica Dumais
F Denoeud
F Ozsolak
G Parra
H Kaessmann
H Li
HM Temin
Ian Bell
J Cocquet
J Dostie
J Harrow
J Houseley
Jacqueline Chrast
JE Collins
Jennifer Harrow
JL Thorvaldsen
Job Dekker
John Stamatoyannopoulos
Jonathan M. Mudge
Jorg Drenkow
Josep Lluís Gelpí
Julien Lagarde
K Kannan
K Salehi-Ashtiani
Kourosh Salehi-Ashtiani
LG Wilming
Lila Ghamsari
M Krzywinski
MA Quail
Marc Vidal
MI Krzywinski
Michael L. Tress
MJ Fullwood
Modesto Orozco
Nynke L. van Berkum
P Akiva
P Kapranov
P Unneberg
Paolo Ribeca
Philipp Kapranov
Philippe Batut
R Durbin
R Khanin
Roderic Guigó
RR Bowman
Ryan R. Murray
S Djebali
S Rozen
Sarah Djebali
SF Altschul
SM Searle
Stylianos E. Antonarakis
SW Roy
Sylvain Foissac
Thomas Preiss
Thomas R. Gingeras
Tim Hubbard
TR Gingeras
Vincent Lacroix
WJ Kent
X Li
X Wu
Xinping Yang
Y Qu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The classic organization of a gene structure has followed the Jacob and Monod bacterial gene model proposed more than 50 years ago. Since then, empirical determinations of the complexity of the transcriptomes found in yeast to human has blurred the definition and physical boundaries of genes. Using multiple analysis approaches we have characterized individual gene boundaries mapping on human chromosomes 21 and 22. Analyses of the locations of the 5′ and 3′ transcriptional termini of 492 protein coding genes revealed that for 85% of these genes the boundaries extend beyond the current annotated termini, most often connecting with exons of transcripts from other well annotated genes. The biological and evolutionary importance of these chimeric transcripts is underscored by (1) the non-random interconnections of genes involved, (2) the greater phylogenetic depth of the genes involved in many chimeric interactions, (3) the coordination of the expression of connected genes and (4) the close in vivo and three dimensional proximity of the genomic regions being transcribed and contributing to parts of the chimeric RNAs. The non-random nature of the connection of the genes involved suggest that chimeric transcripts should not be studied in isolation, but together, as an RNA network

Cold Spring Harbor Laboratory Institutional Repository

Directory of Open Access Journals

Serveur académique lausannois

HAL Descartes

eScholarship@UMMS

UPF Digital Repository

ProdInra

Hal-Diderot

FigShare

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

INRIA a CCSD electronic archive server

PubMed Central

King's Research Portal

Diposit Digital de la Universitat de Barcelona

HAL-Rennes 1

A Simple Standard for Sharing Ontological Mappings (SSSOM).

Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Or are they associated in some other way? Such relationships between the mapped terms are often not documented, which leads to incorrect assumptions and makes them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Furthermore, the lack of descriptions of how mappings were done makes it hard to combine and reconcile mappings, particularly curated and automated ones. We have developed the Simple Standard for Sharing Ontological Mappings (SSSOM) which addresses these problems by: (i) Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in mappings explicit. (ii) Defining an easy-to-use simple table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data principles. (iii) Implementing open and community-driven collaborative workflows that are designed to evolve the standard continuously to address changing requirements and mapping practices. (iv) Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases in detail and survey some of the existing work on standardizing the exchange of mappings, with the goal of making mappings Findable, Accessible, Interoperable and Reusable (FAIR). The SSSOM specification can be found at http://w3id.org/sssom/spec. Database URL: http://w3id.org/sssom/spec

The Jackson Laboratory: The Mouseion at the JAXlibrary

BioCreative III interactive task: an overview

The BioCreative challenge evaluation is a community-wide effort for evaluating text mining and information extraction systems applied to the biological domain. The biocurator community, as an active user of biomedical literature, provides a diverse and engaged end user group for text mining tools. Earlier BioCreative challenges involved many text mining teams in developing basic capabilities relevant to biological curation, but they did not address the issues of system usage, insertion into the workflow and adoption by curators. Thus in BioCreative III (BC-III), the InterActive Task (IAT) was introduced to address the utility and usability of text mining tools for real-life biocuration tasks. To support the aims of the IAT in BC-III, involvement of both developers and end users was solicited, and the development of a user interface to address the tasks interactively was requested

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

Springer

Springer - Publisher Connector

PubMed Central

ZORA

ART

NORA - Norwegian Open Research Archives

Recommended from our members

Results of the ontology alignment evaluation initiative 2020

Author: Algergawy Alsayed
Amini Reihaneh
Faria Daniel
Fundulaki Irini
Harrow Ian
Hertling Sven
Hitzler Pascal
Jiménez-Ruiz Ernesto
Jonquet Clement
Karam Naouel
Khiat Abderrahmane
Laadhar Amir
Laadhar Amir
Lambrix Patrick
Li Huanyu
Li Ying
Paulheim Heiko
Pesquita Catia
Pour Mina Abd Nikooie
Saveta Tzanina
Shvaiko Pavel
Splendiani Andrea
Thiéblin Élodie
Trojahn Cassia
Vataščinová Jana
Yaman Beyza
Zamazal Ondřej
Zhou Lu
Publication venue: CEUR-WS
Publication date: 01/01/2020
Field of study

The Ontology Alignment Evaluation Initiative (OAEI) aims at comparing ontology matching systems on precisely defined test cases. These test cases can be based on ontologies of different levels of complexity and use different evaluation modalities (e.g., blind evaluation, open evaluation, or consensus). The OAEI 2020 campaign offered 12 tracks with 36 test cases, and was attended by 19 participants. This paper is an overall presentation of that campaign

City Research Online

Scientific Publications of the University of Toulouse II Le Mirail

MAnnheim DOCument Server

Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci.

Author: A Diefenbach
A Goios
A Hodgkins
A Kirby
AD Ewing
Adam Frankish
AG Doran
AL Rasmussen
Anne Czechanski
Anne Ferguson-Smith
Anthony G. Doran
B Paten
B Yalcin
B Yalcin
Beiyuan Fu
Benedict Paten
Binnaz Yalcin
C Durrant
Charles Steward
Chris J. Lelliott
Clayton E. Mathews
Cristina Sisu
Darren W. Logan
David J. Adams
David Thybert
Dent Earl
Dirk-Dominik Dolle
DM Church
DR Schrider
Duncan T. Odom
ED Boyden
EM Simpson
ES Lander
F Bauernfeind
Fabio C. P. Navarro
Fengtang Yang
FY Ideraabdullah
GA Churchill
GA Churchill
GA Taylor
Glen Threadgold
GTEx Consortium.
H Mi
H Zhang
I Sastalla
Ian T. Fiddes
J Flint
J Giordano
J Harrow
J Lilue
JA Beck
JA Weiner
James Gilbert
James Torrance
Jane Loveland
JE French
Jennifer Harrow
Jingtao Lilue
JL Americo
JL Levinsohn
JM Mudge
Joanna Collins
Joel Armstrong
Jonathan Flint
Jonathan Wood
JP Hunn
JT Simpson
K Boroviak
Kerstin Howe
KH Braunewell
Kim Wong
KL Svenson
KM Monroe
Lars Romoth
Laura Reinholdt
LD Shultz
Leo Goodstadt
Lesley Shirley
LL Lanier
LL Liebenauer
LR Saraiva
M Boniotto
M Li
M Stanke
M Stremlau
Marcela Sjoberg-Herrera
Mario Stanke
Mark Diekhans
Mark Gerstein
Mark Thomas
Matt Dunn
ME Dickinson
Mike Quail
Mikhail Kolmogorov
MN Loviglio
Monica Abrudan
MT Ferris
Naomi Park
NH Putnam
O Bustos
O Keller
P Broz
Paul Flicek
Paul Muir
PD Dummer
Petr Danecek
Q Liu
R Luo
Richard Durbin
Richard Mott
Ruth Bennett
S König
Sarah Pelan
SNP Kelada
Son K. Pham
SR Patierno
Stefanie Nachtweide
Stephan Collins
T O’Sullivan
TA Bell
Thomas M. Keane
TM Keane
WC Skarnes
William Chow
Ximena Ibarra-Soria
Y Cai
Z Ye
Z Zhang
Publication venue: Nat Genet
Publication date: 01/10/2018
Field of study

We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development

HAL-uB

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

HAL-Inserm

UCL Discovery

Apollo (Cambridge)

Brunel University Research Archive

Results of the Ontology Alignment Evaluation Initiative 2021

Author: Abd Nikooie Pour Mina
Algergawy Alsayed
Amardeilh Florence
Amini Reihaneh
Fallatah Omaima
Faria Daniel
Fundulaki Irini
Harrow Ian
Hertling Sven
Hitzler Pascal
Huschka Martin
Ibanescu Liliana
Jiménez-Ruiz Ernesto
Karam Naouel
Laadhar Amir
Lambrix Patrick
Li Huanyu
Li Ying
Michel Franck
Nasr Engy
Paulheim Heiko
Pesquita Catia
Portisch Jan
Roussey Catherine
Saveta Tzanina
Shvaiko Pavel
Splendiani Andrea
Trojahn Cássia
Vataščinová Jana
Yaman Beyza
Zamazal Ondrej
Zhou Lu
Publication venue: RWTH Aachen
Publication date: 01/01/2021
Field of study

The Ontology Alignment Evaluation Initiative (OAEI) aims at comparing ontology matching systems on precisely defined test cases. These test cases can be based on ontologies of different levels of complexity and use different evaluation modalities (e.g., blind evaluation, open evaluation, or consensus). The OAEI 2021 campaign offered 13 tracks and was attended by 21 participants. This paper is an overall presentation of that campaig

MAnnheim DOCument Server

A mixed methods pilot study with a cluster randomized control trial to evaluate the impact of a leadership intervention on guideline implementation in home care nursing

Author: A Donner
A Kitson
A McIntosh
AH Van de Ven
AJ Boulton
AJ Boulton
AM Hutchinson
Ann Tourangeau
B Davies
B Davies
B Littenberg
B McCormack
Barbara Davies
BL Davies
C Closs
C Estabrooks
C Thompson
C Vance
Canadian Diabetes Association (CDA)
Canadian Institutes of Health Research Natural Sciences and Engineering Research Council of Canada Social Sciences and Humanities Research Council of Canada
CB Stetler
CB Stetler
D Harrow
D Havens
D Rutledge
D Stacey
DA Davis
DA Waldman
DA Waldman
DJ Margolis
DK Ciliska
DL Hogan
DS Elenkov
E Lapierre
F Damanpour
G Yukl
G Yukl
GA Yukl
GD Valk
GP Browman
GR Baker
HA De Groot
HKS Laschinger
HL Orsted
I Dackert
I Dackert
Ian D Graham
ID Graham
J Angus
J Gleason Scott
J Logan
J Logan
J Manion
J Royle
J Rycroft-Malone
JA Birke
JL Mills
JM Grimshaw
JM Howell
K Graham
K Lorimer
K Parahoo
Kirsten Woodend
KN Kajermo
L Wallin
L Wu
LE Swayne
M Dobbins
M Miles
M Mumford
M West
MA West
MD Cabana
MK Campbell
N Edwards
N Edwards
N Edwards
Nancy Lefebre
P Leatt
R Grol
R Grol
RC Hoffman
Registered Nurses Association of Ontario
Registered Nurses Association of Ontario Nursing Best Practice Guidelines Project
RG Hamlin
RG Hamlin
RG Sibbald
RH Cosby
RM Bryar
S Camiah
S Dopson
S Hatcher
S Killip
S Redfern
SA Udod
SA Udod
SG Funk
SJ Hartman
SL Tsai
SN Weingart
T Greenhalgh
T Nolan
V Iles
V Iles
W Gifford
WA Gifford
Wendy A Gifford
WJ Burpitt
WR Shadish
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Foot ulcers are a significant problem for people with diabetes. Comprehensive assessments of risk factors associated with diabetic foot ulcer are recommended in clinical guidelines to decrease complications such as prolonged healing, gangrene and amputations, and to promote effective management. However, the translation of clinical guidelines into nursing practice remains fragmented and inconsistent, and a recent homecare chart audit showed less than half the recommended risk factors for diabetic foot ulcers were assessed, and peripheral neuropathy (the most significant predictor of complications) was not assessed at all. Strong leadership is consistently described as significant to successfully transfer guidelines into practice. Limited research exists however regarding which leadership behaviours facilitate and support implementation in nursing. The purpose of this pilot study is to evaluate the impact of a leadership intervention in community nursing on implementing recommendations from a clinical guideline on the nursing assessment and management of diabetic foot ulcers. Methods Two phase mixed methods design is proposed (ISRCTN 12345678). Phase I: Descriptive qualitative to understand barriers to implementing the guideline recommendations, and to inform the intervention. Phase II: Matched pair cluster randomized controlled trial (n = 4 centers) will evaluate differences in outcomes between two implementation strategies. Primary outcome: Nursing assessments of client risk factors, a composite score of 8 items based on Diabetes/Foot Ulcer guideline recommendations. Intervention: In addition to the organization's 'usual' implementation strategy, a 12 week leadership strategy will be offered to managerial and clinical leaders consisting of: a) printed materials, b) one day interactive workshop to develop a leadership action plan tailored to barriers to support implementation; c) three post-workshop teleconferences. Discussion This study will provide vital information on which leadership strategies are well received to facilitate and support guideline implementation. The anticipated outcomes will provide information to assist with effective management of foot ulcers for people with diabetes. By tracking clinical outcomes associated with guideline implementation, health care administrators will be better informed to influence organizational and policy decision-making to support evidence-based quality care. Findings will be useful to inform the design of future multi-centered trials on various clinical topics to enhance knowledge translation for positive outcomes. Trial Registration Current Control Trials ISRCTN0691089

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

PubMed Central