Search CORE

105 research outputs found

FlyBase 101 – the basics of navigating FlyBase

Author: Ashburner
Filion
Graveley
J. Thurmond
Kharchenko
Negre
P. McQuilton
S. E. St. Pierre
Stein
Publication venue: Oxford University Press
Publication date
Field of study

FlyBase (http://flybase.org) is the leading database and web portal for genetic and genomic information on the fruit fly Drosophila melanogaster and related fly species. Whether you use the fruit fly as an experimental system or want to apply Drosophila biological knowledge to another field of study, FlyBase can help you successfully navigate the wealth of available Drosophila data. Here, we review the FlyBase web site with novice and less-experienced users of FlyBase in mind and point out recent developments stemming from the availability of genome-wide data from the modENCODE project. The first section of this paper explains the organization of the web site and describes the report pages available on FlyBase, focusing on the most popular, the Gene Report. The next section introduces some of the search tools available on FlyBase, in particular, our heavily used and recently redesigned search tool QuickSearch, found on the FlyBase homepage. The final section concerns genomic data, including recent modENCODE (http://www.modencode.org) data, available through our Genome Browser, GBrowse

Crossref

PubMed Central

The Drosophila phenotype ontology

Author: David Osumi-Sutherland
Georgios V Gkoutos
Gillian H Millburn
Kathleen Falls
Laura Ponting
Nicholas H Brown
Osumi-Sutherland David
Peter A McQuilton
Raymund Stefancsik
Steven J Marygold
Publication venue
Publication date: 01/01/2013
Field of study

BACKGROUND: Phenotype ontologies are queryable classifications of phenotypes. They provide a widely-used means for annotating phenotypes in a form that is human-readable, programatically accessible and that can be used to group annotations in biologically meaningful ways. Accurate manual annotation requires clear textual definitions for terms. Accurate grouping and fruitful programatic usage require high-quality formal definitions that can be used to automate classification. The Drosophila phenotype ontology (DPO) has been used to annotate over 159,000 phenotypes in FlyBase to date, but until recently lacked textual or formal definitions. RESULTS: We have composed textual definitions for all DPO terms and formal definitions for 77% of them. Formal definitions reference terms from a range of widely-used ontologies including the Phenotype and Trait Ontology (PATO), the Gene Ontology (GO) and the Cell Ontology (CL). We also describe a generally applicable system, devised for the DPO, for recording and reasoning about the timing of death in populations. As a result of the new formalisations, 85% of classifications in the DPO are now inferred rather than asserted, with much of this classification leveraging the structure of the GO. This work has significantly improved the accuracy and completeness of classification and made further development of the DPO more sustainable. CONCLUSIONS: The DPO provides a set of well-defined terms for annotating Drosophila phenotypes and for grouping and querying the resulting annotation sets in biologically meaningful ways. Such queries have already resulted in successful function predictions from phenotype annotation. Moreover, such formalisations make extended queries possible, including cross-species queries via the external ontologies used in formal definitions. The DPO is openly available under an open source license in both OBO and OWL formats. There is good potential for it to be used more broadly by the Drosophila community, which may ultimately result in its extension to cover a broader range of phenotypes

Crossref

Aberystwyth Research Portal

Springer

Springer - Publisher Connector

University of Birmingham Research Portal

PubMed Central

Directly e-mailing authors of newly published papers encourages community curation

Author: Dowell
Fang
Gary B. Grumbling
Gillian H. Millburn
Helen I. Field
Hunter
Mazumder
McQuilton
Nicholas H. Brown
Stephanie M. Bunt
Steven J. Marygold
Swarbreck
Van Auken
Yook
Publication venue: Oxford University Press
Publication date
Field of study

Much of the data within Model Organism Databases (MODs) comes from manual curation of the primary research literature. Given limited funding and an increasing density of published material, a significant challenge facing all MODs is how to efficiently and effectively prioritize the most relevant research papers for detailed curation. Here, we report recent improvements to the triaging process used by FlyBase. We describe an automated method to directly e-mail corresponding authors of new papers, requesting that they list the genes studied and indicate (‘flag’) the types of data described in the paper using an online tool. Based on the author-assigned flags, papers are then prioritized for detailed curation and channelled to appropriate curator teams for full data extraction. The overall response rate has been 44% and the flagging of data types by authors is sufficiently accurate for effective prioritization of papers. In summary, we have established a sustainable community curation program, with the result that FlyBase curators now spend less time triaging and can devote more effort to the specialized task of detailed data extraction

Crossref

PubMed Central

BC4GO: a full-text corpus for the BioCreative IV GO task

Author: Arighi Cecilia N.
Done James
Hayman G. Thomas
Laulederkind Stanley J. F.
Li Donghui
Lu Zhiyong
Mao Yuqing
McQuilton Peter
Müller Hans-Michael
Schaeffer Mary L.
Sternberg Paul W.
Tweedie Susan
Van Auken Kimberly
Wang Shur-Jen
Wei Chih-Hsuan
Publication venue: 'Oxford University Press (OUP)'
Publication date: 28/07/2014
Field of study

Gene function curation via Gene Ontology (GO) annotation is a common task among Model Organism Database groups. Owing to its manual nature, this task is considered one of the bottlenecks in literature curation. There have been many previous attempts at automatic identification of GO terms and supporting information from full text. However, few systems have delivered an accuracy that is comparable with humans. One recognized challenge in developing such systems is the lack of marked sentence-level evidence text that provides the basis for making GO annotations. We aim to create a corpus that includes the GO evidence text along with the three core elements of GO annotations: (i) a gene or gene product, (ii) a GO term and (iii) a GO evidence code. To ensure our results are consistent with real-life GO data, we recruited eight professional GO curators and asked them to follow their routine GO annotation protocols. Our annotators marked up more than 5000 text passages in 200 articles for 1356 distinct GO terms. For evidence sentence selection, the inter-annotator agreement (IAA) results are 9.3% (strict) and 42.7% (relaxed) in F1-measures. For GO term selection, the IAAs are 47% (strict) and 62.9% (hierarchical). Our corpus analysis further shows that abstracts contain ∼10% of relevant evidence sentences and 30% distinct GO terms, while the Results/Experiment section has nearly 60% relevant sentences and >70% GO terms. Further, of those evidence sentences found in abstracts, less than one-third contain enough experimental detail to fulfill the three core criteria of a GO annotation. This result demonstrates the need of using full-text articles for text mining GO annotations. Through its use at the BioCreative IV GO (BC4GO) task, we expect our corpus to become a valuable resource for the BioNLP research community

Caltech Authors

Recommended from our members

Expansion of the Gene Ontology knowledgebase and resources

Author: Antonazzo G
Attrill H
Brown NH
Harris MA
Hayles J
Marygold SJ
McQuilton P
Millburn GH
Oliver SG
Ponting L
Rey AJ
Rutherford K
Stefancsik R
Tweedie S
Wood V
Publication venue: Nucleic Acids Research
Publication date: 04/01/2017
Field of study

The Gene Ontology (GO) is a comprehensive resource of computable knowledge regarding the functions of genes and gene products. As such, it is extensively used by the biomedical research community for the analysis of -omics and related data. Our continued focus is on improving the quality and utility of the GO resources, and we welcome and encourage input from researchers in all areas of biology. In this update, we summarize the current contents of the GO knowledgebase, and present several new features and improvements that have been made to the ontology, the annotations and the tools. Among the highlights are 1) developments that facilitate access to, and application of, the GO knowledgebase, and 2) extensions to the resource as well as increasing support for descriptions of causal models of biological systems and network biology. To learn more, visit http://geneontology.org/.National Institutes of Health/National Human Genome Research Institute [HG002273] awarded to the PI group formed by (alphabetically) Judith A. Blake, J. Michael Cherry, Suzanna E. Lewis, Paul W. Sternberg and Paul D. Thomas, as well as additional funding awarded to each participating institution. For more details please visit: http://geneontology.org/page/go-consortium-contributors-list. Funding for open access charge: National Institutes of Health/National Human Genome Research Institute [HG002273]

Apollo (Cambridge)

The Australasian COVID-19 Trial (ASCOT) to assess clinical outcomes in hospitalised patients with SARS-CoV-2 infection (COVID-19) treated with lopinavir/ritonavir and/or hydroxychloroquine compared to standard of care: A structured summary of a study protocol for a randomised controlled trial

Author: Aboltins C.
Anagnostou M.
Anderson S.
Arellano A.
Bhally H.
Bowen A.
Boyd M.
Burke A.
Burston V.J.
Chalmers R.
Chambers J.
Chang C.L-L
Chang J.
Charles P.
Chatterji A.
Chaw K.
Chean R.
Choong K.
Cochrane B.
Coghill S.
Commons R.
da Silva J.
Davis J.
Davis J.
Davis J.
Denholm J.T.
Dotel R.
Dummer J.
Flanagan K.
Foo H.
Gardiner B.
Gedye C.
Gilbey T.
Giola M.
Gray T.
Gray T.
Griffin P.
Grimwade K.
Hammond N.
Hart J.
Heather C.
Henderson A.
Hogg S.
Hudson B.
Hui S.
Jha V.
Knoblauch N.
Kwan B.C.H.
Lam E.
Lim L-L
Lister D.
Littleford R.
Lwin N.
Mahoney A.
Martinello M.
Matthews G.
Maze M.
Maze M.
McMahon J.
McQuilton Z.
Mina M.
Molton J.
Mora J.
Morpeth S.
Morpeth S.
Mostert C.
New D.
O’Brien D.
O’Sullivan M.
Paterson D.
Pillai P.
Post J.
Post J.
Price D.
Raby E.
Rafiei N.
Ratcliff A.
Rees M.
Restropo D.
Ritchie S.
Roberts J.
Robinson O.
Rogers B.
Rowe E.
Sasadeusz J.
Schulz T.
Sehu M.
Senanayake S.
Sheffield D.
Shum O.
Singh K.
Slack A.
Slow S.
Smith S.
Snelling T.
Su Y.
Sud A.
Sullivan R.
Tai A.
Tan S.J.
Tong S.Y.C.
Torresi J.
Trad A.
Trethewy C.
van Haal S.
Venkatesh B.
Verrall A.
Visvanathan K.
Williams J.
Wilson P.
Yong M.
Zentner D.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Objectives: To determine if lopinavir/ritonavir +/- hydroxychloroquine will reduce the proportion of participants who survive without requiring ventilatory support, 15 days after enrolment, in adult participants with non-critically ill SARS-CoV-2 infection. Trial design: ASCOT is an investigator-initiated, multi-centre, open-label, randomised controlled trial. Participants will have been hospitalised with confirmed COVID-19, and will be randomised 1:1:1:1 to receive lopinavir /ritonavir, hydroxychloroquine, both or neither drug in addition to standard of care management. Participants: Participants will be recruited from >80 hospitals across Australia and New Zealand, representing metropolitan and regional centres in both public and private sectors. Admitted patients will be eligible if aged ≥ 18 years, have confirmed SARS-CoV-2 by nucleic acid testing in the past 12 days and are expected to remain an inpatient for at least 48 hours from the time of randomisation. Potentially eligible participants will be excluded if admitted to intensive care or requiring high level respiratory support, are currently receiving study drugs or their use is contraindicated due to allergy, drug interaction or comorbidities (including baseline QTc prolongation of 470ms for women or 480ms for men), or death is anticipated imminently

Research Repository

Genome-wide fine-scale recombination rate variation in Drosophila melanogaster

Estimating fine-scale recombination maps of Drosophila from population genomic data is a challenging problem, in particular because of the high background recombination rate. In this paper, a new computational method is developed to address this challenge. Through an extensive simulation study, it is demonstrated that the method allows more accurate inference, and exhibits greater robustness to the effects of natural selection and noise, compared to a well-used previous method developed for studying fine-scale recombination rate variation in the human genome. As an application, a genome-wide analysis of genetic variation data is performed for two Drosophila melanogaster populations, one from North America (Raleigh, USA) and the other from Africa (Gikongoro, Rwanda). It is shown that fine-scale recombination rate variation is widespread throughout the D. melanogaster genome, across all chromosomes and in both populations. At the fine-scale, a conservative, systematic search for evidence of recombination hotspots suggests the existence of a handful of putative hotspots each with at least a tenfold increase in intensity over the background rate. A wavelet analysis is carried out to compare the estimated recombination maps in the two populations and to quantify the extent to which recombination rates are conserved. In general, similarity is observed at very broad scales, but substantial differences are seen at fine scales. The average recombination rate of the X chromosome appears to be higher than that of the autosomes in both populations, and this pattern is much more pronounced in the African population than the North American population. The correlation between various genomic features—including recombination rates, diversity, divergence, GC content, gene content, and sequence quality—is examined using the wavelet analysis, and it is shown that the most notable difference between D. melanogaster and humans is in the correlation between recombination and diversity

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Warwick Research Archives Portal Repository

FigShare

Overview of the interactive task in BioCreative V

Author: Afroza K. Irin
Andrew Chatr-Aryamontri
Arighi
Arighi
Arighi
Barbra Ferrell
Cathy H. Wu
Cecilia N. Arighi
Chu-Hsien Su
Comeau
David Campos
David Salgado
Emiliano Pereira
Evangelos Pafilis
Fabio Rinaldi
Gabriela Contreras
Georgios Gkoutos
Hamsa D. Tadepally
Hirschman
Hong-Jie Dai
Hui-Jou Chou
Ingrid Keseler
Jeyakumar Natarajan
Johanna McEntyre
Juliane Fluck
Karen Rothfels
Kimberly Van Auken
Krallinger
Lara Almeida
Lars J. Jensen
Laurel Cooper
Likert
Loukia Tsaprouni
Lucy Chilton
Lynette Hirschman
Marija Milacic
Mary Schaeffer
Matthew Mort
Nancy George
Nicole Vasilevsky
Onkar Singh
Peter McQuilton
Qinghua Wang
Raquel M. Silva
Raul Rodriguez-Esteban
Raymund Stefancsik
Riza Batista-Navarro
Sandra Orchard
Sangya Pundir
Shabbir S. Abdul
Sherri Matis-Mitchell
Shruti Rao
Silvia Jimenez
Socorro Gama-Castro
Sophia Ananiadou
Stanley J. F. Laulederkind
Sumit Madan
Suresh Subramani
Sérgio Matos
Toni R. Jue
Wu
Xiaodong Wang
Yalbi I. Balderas-Martínez
Zhiyong Lu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

Fully automated text mining (TM) systems promote efficient literature searching, retrieval, and review but are not sufficient to produce ready-to-consume curated documents. These systems are not meant to replace biocurators, but instead to assist them in one or more literature curation steps. To do so, the user interface is an important aspect that needs to be considered for tool adoption. The BioCreative Interactive task (IAT) is a track designed for exploring user-system interactions, promoting development of useful TM tools, and providing a communication channel between the biocuration and the TM communities. In BioCreative V, the IAT track followed a format similar to previous interactive tracks, where the utility and usability of TM tools, as well as the generation of use cases, have been the focal points. The proposed curation tasks are user-centric and formally evaluated by biocurators. In BioCreative V IAT, seven TM systems and 43 biocurators participated. Two levels of user participation were offered to broaden curator involvement and obtain more feedback on usability aspects. The full level participation involved training on the system, curation of a set of documents with and without TM assistance, tracking of time-on-task, and completion of a user survey. The partial level participation was designed to focus on usability aspects of the interface and not the performance per se. In this case, biocurators navigated the system by performing pre-designed tasks and then were asked whether they were able to achieve the task and the level of difficulty in completing the task. In this manuscript, we describe the development of the interactive task, from planning to execution and discuss major findings for the systems tested

University of Birmingham Research Portal

HAL AMU

The University of Manchester - Institutional Repository

MPG.PuRe

Hal-Diderot

University of Bedfordshire Repository

Crossref

Online Research @ Cardiff

HAL-Inserm

Copenhagen University Research Information System

PubMed Central

Oxford University Research Archive

Representation of anatomy in online atlases and databases: a survey and collection of patterns for interface design

Author: A Burger
A Visel
AP McMahon
AS Hammonds
B Boehm
B Eames
B Hill
BA Boer de
C Armit
C Armit
C James-Zorn
CC Cubbage
CC Fowlkes
CE Konikoff
CE Slyke Van
CJ Bult
CL Thompson
CM Smith
CM Smith
CM Smith
D Davidson
D Lee
D Osumi-Sutherland
D Salgado
DG Howe
DK Darnell
DP Hill
E Lécuyer
E Segerdell
E Segerdell
E-F Lee
EF Schmidt
ES Lein
G Diez-Roux
G Grumbling
GW Bell
H Shimizu
J Boline
J Sprague
JA Davies
JB Bowes
JH Christiansen
JH Finger
JI Alonso-Barba
JP Junker
K Hotta
K Ito
K Yook
KC Cheng
L Geffers
L Han
L Richardson
L Richardson
L Richardson
L Richardson
M Ashburner
M Belmamoune
M Belmamoune
M Brozovic
M Costa
M Fujita
M Meyer
M Wong
Melissa D. Clarkson
MH Little
MJ Gilchrist
N Buchon
N Crosetto
N Heintz
N Hopwood
N Milyaev
O Tassy
P McQuilton
P Tomancak
P Tomancak
PB Antin
R Bakker
R Baldock
R Hunt-Newbury
RA Baldock
RA Baldock
RJ Bryson-Richardson
RYN Lee
S Isogai
S Isogai
S Kumar
S Leonilli
S Yokoyama
SD Harding
SE St. Pierre
SG The
SM Sunkin
T Henrich
T Henrich
TF Hayamizu
TF Hayamizu
TW Harris
W-P Lee
Y Bradford
ZL Husz
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Nutrient-Driven tRNA Modification Alters Translational Fidelity and Genome-wide Protein Coding across an Animal Genus

Author: A Heger
B Suter
BN White
C Kimchi-Sarfaty
C Pathak
C Waldron
CA Charneski
Charles F. Aquadro
D Agashe
D. Allan Drummond
DA Drummond
DA Drummond
DB Goodman
DS Lawrie
EB Kramer
EB Kramer
Edward W. J. Wallace
EM Novoa
EP Rocha
EW Wallace
F Meier
G Das
G Kudla
G Kudla
G Sella
GL Igloi
GW Li
H Akashi
H Kasai
Harmit S. Malik
HJ Grosjean
J Robins
J Zaborske
JB Plotkin
JL Parmley
JM Ogle
John M. Zaborske
JR Powell
KA Geiler-Samerotte
KB Jacobson
L Duret
M Bulmer
M dos Reis
N Mantel
N Stoletzki
ND Singh
P McQuilton
P Shah
PF Agris
PG Higgs
PM Sharp
R Hershberg
S Noguchi
S Vicario
S Vicario
S Yokoyama
T Tuller
T Zhou
Tao Pan
TJ Siard
U Gunduz
V Stolc
Vanessa L. Bauer DuMont
W Gu
W Qian
W Ran
Y Chiari
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/12/2014
Field of study

<div>Natural selection favors efficient expression of encoded proteins, but the causes, mechanisms, and fitness consequences of evolved coding changes remain an area of aggressive inquiry. We report a large-scale reversal in the relative translational accuracy of codons across 12 fly species in the Drosophila/Sophophora genus. Because the reversal involves pairs of codons that are read by the same genomically encoded tRNAs, we hypothesize, and show by direct measurement, that a tRNA anticodon modification from guanosine to queuosine has coevolved with these genomic changes. Queuosine modification is present in most organisms but its function remains unclear. Modification levels vary across developmental stages in D. melanogaster, and, consistent with a causal effect, genes maximally expressed at each stage display selection for codons that are most accurate given stage-specific queuosine modification levels. In a kinetic model, the known increased affinity of queuosine-modified tRNA for ribosomes increases the accuracy of cognate codons while reducing the accuracy of near-cognate codons. Levels of queuosine modification in D. melanogaster reflect bioavailability of the precursor queuine, which eukaryotes scavenge from the tRNAs of bacteria and absorb in the gut. These results reveal a strikingly direct mechanism by which recoding of entire genomes results from changes in utilization of a nutrient.</div

Crossref

ZENODO

Directory of Open Access Journals

Dryad Digital Repository (Duke University)

PubMed Central

Edinburgh Research Explorer

Electronic Archiving System

FigShare