Search CORE

317 research outputs found

A graph-search framework for associating gene identifiers with documents

Author: A Yeh
AM Cohen
AM Cohen
AM Cohen
C Zhai
Consortium TGO
D Hanisch
E Hatcher
E Minkov
E Minkov
Einat Minkov
F Sha
J Crim
K Franzén
K Fundel
K Humphreys
L Hirschman
L Hirschman
M Collins
M Craven
R Bunescu
RI Kondor
T Rindflesch
U Leser
William W Cohen
WW Cohen
WW Cohen
WW Cohen
Y Altun
Y Freund
Z Kou
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: One step in the model organism database curation process is to find, for each article, the identifier of every gene discussed in the article. We consider a relaxation of this problem suitable for semi-automated systems, in which each article is associated with a ranked list of possible gene identifiers, and experimentally compare methods for solving this geneId ranking problem. In addition to baseline approaches based on combining named entity recognition (NER) systems with a "soft dictionary" of gene synonyms, we evaluate a graph-based method which combines the outputs of multiple NER systems, as well as other sources of information, and a learning method for reranking the output of the graph-based method. RESULTS: We show that named entity recognition (NER) systems with similar F-measure performance can have significantly different performance when used with a soft dictionary for geneId-ranking. The graph-based approach can outperform any of its component NER systems, even without learning, and learning can further improve the performance of the graph-based ranking approach. CONCLUSION: The utility of a named entity recognition (NER) system for geneId-finding may not be accurately predicted by its entity-level F1 performance, the most common performance measure. GeneId-ranking systems are best implemented by combining several NER systems. With appropriate combination methods, usefully accurate geneId-ranking systems can be constructed based on easily-available resources, without resorting to problem-specific, engineered components

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Correction to: Integrative analysis of loss-of-function variants in clinical and genomic data reveals novel genes associated with cardiovascular traits

Author: Akers Nicholas K.
Amadori Letizia
Ayers Kristin L.
Badgeley Marcus A.
Belbin Gillian M.
Betsholtz Christer
Björkegren Johan L. M.
Chen Rong
Darrow Bruce J.
Dudley Joel T.
Ermel Raili
Franzén Oscar
Giannarelli Chiara
Glicksberg Benjamin S.
Johnson Kipp W.
Kenny Eimear E.
Kovacic Jason C.
Li Li
Li Shuyu D.
Readhead Ben
Ren Hongxia
Ruusalepp Arno
Schadt Eric E.
Shameer Khader
Skogsberg Josefin
Sukhavasi Katyayani
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/11/2019
Field of study

Erratum for Integrative analysis of loss-of-function variants in clinical and genomic data reveals novel genes associated with cardiovascular traits. [BMC Med Genomics. 2019

IUPUIScholarWorks

Fifteen new risk loci for coronary artery disease highlight arterial-wall-specific mechanisms

Author: A Schröder
Abdulla al Shafi Majumder
Adam S Butterworth
Alex P Reiner
Anders Malarstig
Andrew D Johnson
Anne Justice
Anne Tybjærg-Hansen
AP Levy
AP Morris
AP Reiner
Arshed A Quyyumi
Asif Rasheed
AV Segrè
Benjamin B Sun
BF Voight
BL Harry
BP Fairfax
Børge G Nordestgaard
C Moore
Cara L Carty
Carl J Pepine
Chao A Hsiung
Charles Kooperberg
CJ Willer
D Qu
Daniel F Freitag
Daniel J Rader
Daniel R Barnes
Danish Saleheen
Devin Absher
Dewan S Alam
Dirk S Paul
DM Greenawalt
E Grundberg
EE Schadt
Elias L Salfati
Emanuele Di Angelantonio
Eric B Fauman
Eric Boerwinkle
F Innocenti
GA Roth
GR Abecasis
H Kirsten
H Lin
H Schunkert
Heribert Schunkert
HJ Westra
Hugh Watkins
I Holme
I-Te Lee
J Chen
J Dennis
J Erdmann
J Ernst
J Ernst
J Yang
Jeanette Erdmann
Jemma B Wilk
Jerome I Rotter
Joanna M M Howson
John A Spertus
John D Eicher
John Danesh
JR Privratsky
JR Staley
Julie A Johnson
Jyh-Ming J Juang
Kari E North
Katrine L Rasmussen
Kent D Taylor
Kristin Young
LD Ward
Lindsay L Waite
LM Boettger
Lucia A Hindorff
M Arnold
M Narahara
M Uhlen
M Uhlén
Mariaelisa Graff
N Franceschini
Nilesh J Samani
NJ Samani
NL Smith
Nora Franceschini
O Franzén
P Surendran
P Zanoni
Panos Deloukas
Philippe Frossard
Pia R Kamstrup
Praveen Surendran
R Goel
Rajiv Chowdhury
Ren-Hua Chung
Robin Young
Ron Do
S Purcell
Sekar Kathiresan
Stanley L Hazen
Steven Buyske
Sune F Nielsen
T Lappalainen
T Zeller
Themistocles L Assimes
Thomas Quertermous
TL Assimes
TM Teslovich
Tzung-Dau Wang
Ulrike Peters
V Nanda
W Tang
Wayne H H Sheu
Weang-Kee Ho
Wei Zhao
Wei-Yu Lin
Wen-Jane Lee
WJ Astle
X Zhang
X Zhou
Xiuqing Guo
Yi-Jen Hung
Yii-Der Ida Chen
Ying-Hsiang Chen
Z Wang
Å Johansson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Coronary artery disease (CAD) is a leading cause of morbidity and mortality worldwide. Although 58 genomic regions have been associated with CAD thus far, most of the heritability is unexplained, indicating that additional susceptibility loci await identification. An efficient discovery strategy may be larger-scale evaluation of promising associations suggested by genome-wide association studies (GWAS). Hence, we genotyped 56,309 participants using a targeted gene array derived from earlier GWAS results and performed meta-analysis of results with 194,427 participants previously genotyped, totaling 88,192 CAD cases and 162,544 controls. We identified 25 new SNP-CAD associations (P < 5 × 10(-8), in fixed-effects meta-analysis) from 15 genomic regions, including SNPs in or near genes involved in cellular adhesion, leukocyte migration and atherosclerosis (PECAM1, rs1867624), coagulation and inflammation (PROCR, rs867186 (p.Ser219Gly)) and vascular smooth muscle cell differentiation (LMOD1, rs2820315). Correlation of these regions with cell-type-specific gene expression and plasma protein levels sheds light on potential disease mechanisms

Crossref

Copenhagen University Research Information System

Carolina Digital Repository

eScholarship - University of California

Enlighten

Sperm design and variation in the New World blackbirds (Icteridae)

Author: AF Malo
AF Malo
AF Malo
BGM Jamieson
CK Cornwallis
CW LaMunyon
CW LaMunyon
D Urbach
DF Katz
DP Froman
DR Levitan
E García-Berthou
EH Morrow
EH Morrow
EJ Pellatt
EP Martins
G Arnqvist
G Burness
GA Parker
GA Parker
GA Parker
GA Parker
George M. Linz
GT Miller
HDM Moore
J Cohen
J Felsenstein
J Sivinski
JA Stoltz
JH Samour
JJL Higdon
JL Fitzpatrick
JL Fitzpatrick
JV Briskie
JV Briskie
L Locatello
LK Dybas
LZ Garamszegi
M Gomendio
M Pagel
MD Pagel
MJ Anderson
MJG Gage
MJG Gage
MJG Gage
MJG Gage
N Minoretti
O Kleven
PG Byrne
PH Harvey
PH Harvey
PI Ward
R Thornhill
RA Beatty
RA Beatty
RA Cardullo
RP Freckleton
RR Snook
S Balshine
S Calhim
S Calhim
S Humphries
S Immler
S Immler
S Immler
S Immler
S Lüpold
S Lüpold
S Pitnick
Stefan Lüpold
T Garland
T Pizzari
TE Pitcher
Tim R. Birkhead
TR Birkhead
TR Birkhead
WE Harris
WG Eberhard
WG Eberhard
WH Burrows
WV Holt
Å Franzén
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Post-copulatory sexual selection (PCSS) is thought to be one of the evolutionary forces responsible for the rapid and divergent evolution of sperm design. However, whereas in some taxa particular sperm traits are positively associated with PCSS, in other taxa, these relationships are negative, and the causes of these different patterns across taxa are poorly understood. In a comparative study using New World blackbirds (Icteridae), we tested whether sperm design was influenced by the level of PCSS and found significant positive associations with the level of PCSS for all sperm components but head length. Additionally, whereas the absolute length of sperm components increased, their variation declined with the intensity of PCSS, indicating stabilizing selection around an optimal sperm design. Given the diversity of, and strong selection on, sperm design, it seems likely that sperm phenotype may influence sperm velocity within species. However, in contrast to other recent studies of passerine birds, but consistent with several other studies, we found no significant link between sperm design and velocity, using four different species that vary both in sperm design and PCSS. Potential reasons for this discrepancy between studies are discussed

Crossref

DigitalCommons@University of Nebraska

ZORA

White Rose Research Online

BioInfer: a corpus for information extraction in the biomedical domain

Author: A Yakushiji
CF Baker
D Lin
DD Sleator
E Alphonse
E Tsivtsivadze
E Tsivtsivadze
F Ginter
Filip Ginter
G Hripcsak
H Shatkay
J Cohen
J Ding
J Kim
Jari Björne
JM Temkin
Jorma Boberg
Jouni Järvinen
Juho Heimonen
K Franzén
K Kipper
KB Cohen
KB Cohen
L Hirschman
L Salwinski
M Ashburner
N Daraselia
P Kingsbury
P Kingsbury
P Szolovits
S Aubin
S Pyysalo
S Pyysalo
S Pyysalo
S Siegel
Sampo Pyysalo
T Ohta
T Pahikkala
T Wattarujeekrit
Tapio Salakoski
TH King
Y Tateisi
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Lately, there has been a great interest in the application of information extraction methods to the biomedical domain, in particular, to the extraction of relationships of genes, proteins, and RNA from scientific publications. The development and evaluation of such methods requires annotated domain corpora. RESULTS: We present BioInfer (Bio Information Extraction Resource), a new public resource providing an annotated corpus of biomedical English. We describe an annotation scheme capturing named entities and their relationships along with a dependency analysis of sentence syntax. We further present ontologies defining the types of entities and relationships annotated in the corpus. Currently, the corpus contains 1100 sentences from abstracts of biomedical research articles annotated for relationships, named entities, as well as syntactic dependencies. Supporting software is provided with the corpus. The corpus is unique in the domain in combining these annotation types for a single set of sentences, and in the level of detail of the relationship annotation. CONCLUSION: We introduce a corpus targeted at protein, gene, and RNA relationships which serves as a resource for the development of information extraction systems and their components such as parsers and domain analyzers. The corpus will be maintained and further developed with a current version being available at

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Cardiometabolic risk loci share downstream cis- and trans-gene regulation across tissues and diseases

Author: Akers Nicholas K
Betsholtz Christer
Björkegren Johan L M
Cohain Ariella
Di Narzo Antonio
Ermel Raili
Foroughi-Asl Hassan
Franzén Oscar
Fullard John F
Gan Li-Ming
Giambartolomei Claudia
Giannarelli Chiara
Hao Ke
Kovacic Jason C
Köks Sulev
Losic Bojan
Michoel Tom
Roussos Panos
Ruusalepp Arno
Schadt Eric E
Skogsberg Josefin
Sukhavasi Katyayani
Talukdar Husain A
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 19/08/2016
Field of study

Edinburgh Research Explorer

Species-Area Relationships Are Controlled by Species Traits

Author: A Arrhenius
A Baz
A Gelman
AM Emmet
BA Wilcox
BM Bolker
C Parmesan
E Palm
E Öckinger
E Öckinger
F Altermatt
F Nordström
FW Preston
GD Powney
GM Mace
H Alexandersson
H Kreft
Hans Henrik Bruun
HJ Henriksen
I Steffan-Dewenter
I Svensson
I Svensson
J Beck
J Beck
J Hortal
J Itämies
J Kotiaho
J Krauss
J Nekola
J Pöyry
JA Thomas
JH Brown
JH Brown
JJ Lennon
JW Chapman
K Henle
KA Triantis
KA Triantis
KJ Gaston
KJ Gaston
L Brotons
L Cagnolo
L Huldén
LR Prugh
LR Taylor
M Franzén
M Lindeborg
M Mutanen
M Nieminen
Markus Franzén
ML Rosenzweig
MV Lomolino
MV Lomolino
N Davies
N Hydén
N Loder
O Karsholt
O Karsholt
O Karsholt
O Schweiger
Oliver Schweiger
P Inkinen
P Skou
P Skou
P Stadel Nielsen
P Sólymos
PE Betzholtz
Per-Eric Betzholtz
PJ Darlington
R Bommarco
R Kadmon
RD Holt
RD Holt
RH MacArthur
RLH Dennis
RLH Dennis
S Drakare
S Sekar
SC Stearns
SG Nilsson
SL Pimm
SP Hubbell
TH Sparks
W Ulrich
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The species-area relationship (SAR) is one of the most thoroughly investigated empirical relationships in ecology. Two theories have been proposed to explain SARs: classical island biogeography theory and niche theory. Classical island biogeography theory considers the processes of persistence, extinction, and colonization, whereas niche theory focuses on species requirements, such as habitat and resource use. Recent studies have called for the unification of these two theories to better explain the underlying mechanisms that generates SARs. In this context, species traits that can be related to each theory seem promising. Here we analyzed the SARs of butterfly and moth assemblages on islands differing in size and isolation. We tested whether species traits modify the SAR and the response to isolation. In addition to the expected overall effects on the area, traits related to each of the two theories increased the model fit, from 69% up to 90%. Steeper slopes have been shown to have a particularly higher sensitivity to area, which was indicated by species with restricted range (slope = 0.82), narrow dietary niche (slope = 0.59), low abundance (slope = 0.52), and low reproductive potential (slope = 0.51). We concluded that considering species traits by analyzing SARs yields considerable potential for unifying island biogeography theory and niche theory, and that the systematic and predictable effects observed when considering traits can help to guide conservation and management actions

Lund University Publications

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Investigation of three new mouse mammary tumor cell lines as models for transforming growth factor (TGF)-β and Neu pathway signaling studies: identification of a novel model for TGF-β-induced epithelial-to-mesenchymal transition

Author: A Philip
A Vincent-Salomon
AE Al-Moustafa
AE Gorska
AEG Lenferink
E Janda
EP Bottinger
G Pauletti
G Portella
GC Blobe
H Gobbi
H Gobbi
IL Andrulis
JA Barnard
JB Kim
JS Ross
L Yu
LM Wakefield
M Oft
MC Pepin
MJ Goumans
MT Nieman
N Dumont
P Franzén
PJ Miettinen
PM Siegel
R Derynck
R Derynck
R Markwald
RB Hazan
RR Fearon
RR Reddel
RS Muraoka
S Grunert
S Hirohashi
SD Markowitz
SS Hobbs
W Birchmeyer
WA Border
Y Shi
Y Yarden
YA Yang
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

INTRODUCTION: This report describes the isolation and characterization of three new murine mammary epithelial cell lines derived from mammary tumors from MMTV (mouse mammary tumor virus)/activated Neu + TβRII-AS (transforming growth factor [TGF]-β type II receptor antisense RNA) bigenic mice (BRI-JM01 and BRI-JM05 cell lines) and MMTV/activated Neu transgenic mice (BRI-JM04 cell line). METHODS: The BRI-JM01, BRI-JM04, and BRI-JM05 cell lines were analyzed for transgene expression, their general growth characteristics, and their sensitivities to several growth factors from the epidermal growth factor (EGF) and TGF-β families (recombinant human EGF, heregulin-β(1 )and TGF-β(1)). The BRI-JM01 cells were observed to undergo a striking morphologic change in response to TGF-β(1), and they were therefore further investigated for their ability to undergo a TGF-β-induced epithelial-to-mesenchymal transition (EMT) using motility assays and immunofluorescence microscopy. RESULTS: We found that two of the three cell lines (BRI-JM04 and BRI-JM05) express the Neu transgene, whereas, unexpectedly, both of the cell lines that were established from MMTV/activated Neu + TβRII-AS bigenic tumors (BRI-JM01 and BRI-JM05) do not express the TβRII-AS transgene. The cuboidal BRI-JM01 cells exhibit a short doubling time and are able to form confluent monolayers. The BRI-JM04 and BRI-JM05 cell lines are morphologically much less uniform, grow at a much slower rate, and do not form confluent monolayers. Only the BRI-JM05 cells can form colonies in soft agar. In contrast, all three cell lines form colonies in Matrigel, although the BRI-JM04 and BRI-JM05 cell lines do so more efficiently than the BRI-JM01 cell line. All three cell lines express the cell surface marker E-cadherin, confirming their epithelial character. Proliferation assays showed that the three cell lines respond differently to recombinant human EGF and heregulin-β(1), and that all are growth inhibited by TGF-β(1), but that only the BRI-JM01 cell line undergoes an EMT and exhibits increased motility upon TGF-β(1 )treatment. CONCLUSION: We suggest that the BRI-JM04 and BRI-JM05 cell lines can be used to investigate Neu oncogene driven mammary tumorigenesis, whereas the BRI-JM01 cell line will be useful for studying TGF-β(1)-induced EMT

NRC Publications Archive

Crossref

Springer - Publisher Connector

PubMed Central

Systematic evaluation of pleiotropy identifies 6 further loci associated with coronary artery disease

Author: Alver Maris
Asselta Rosanna
Auer Paul L.
Betsholtz Christer
Björkegren Johan L.
Braund Peter S.
Denny Josh C.
Do Ron
Doney Alex S.
Donnelly Louise A.
Dube Marie-Pierre
Duga Stefano
Eicher John D.
El-Mokhtari Nour Eddine
Erdmann Jeanette
Escher Stefan A.
Esko Tõnu
Farrall Martin
Ferrario Paola G.
Franke Andre
Franzén Oscar
Goel Anuj
Hamby Stephen E.
Heilmann Stefanie
Hengstenberg Christian
Hoffmann Per
Holmen Oddgeir L.
Hveem Kristian
Jansen Henning
Jansson Jan-Håkan
Johnson Andrew D.
Jöckel Karl-Heinz
Kanoni Stavroula
Kessler Thorsten
Kriebel Jennifer
Kruppa Jochen
König Inke R.
Laugwitz Karl L.
Lu Yingchang
Mahajan Anubha
Marouli Eirini
Martinelli Nicola
Marziliano Nicola
Masca Nicholas G.
McCarthy Mark I.
Meisinger Christa
Merlini Pier A.
Mihailov Evelin
Moebus Susanne
Morris Andrew D.
Nelson Christopher P.
Nikpay Majid
Olivieri Oliviero
Peloso Gina M.
Ruusalepp Arno
Schadt Eric E.
Schick Ursula M.
Scott Robert A.
Shaffer Christian
Stirrups Kathleen E.
Stitziel Nathan O.
van Capelleveen Julian C.
van Iperen Erik
Van Zuydam Natalie R.
Virtamo Jarma
Webb Thomas R.
Weeke Peter E.
Willenborg Christina
Won Hong-Hee
Zhang He
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

OPUS Augsburg

Benchmarking natural-language parsers for biological applications using dependency graphs

Author: A Bies
AB Clegg
Adrian J Shepherd
Andrew B Clegg
B Rosario
B Srinivas
C Friedman
C Grover
C Grover
D Blaheta
D Gildea
D Klein
D Klein
D Lin
D Lin
D Sleator
DM Bikel
E Charniak
E Tsivtsivadze
EB Camon
EJ Briscoe
G Sampson
G Schneider
G Schneider
IM Goldin
J Carroll
J Carroll
J Finkel
J Xiao
JM Temkin
K Franzén
K Knight
KB Cohen
L Smith
M Collins
M Lease
MC de Marneffe
MP Marcus
N Domedel-Puig
N Ge
O Sanchez
P Merlo
PG Mutalik
S Abney
S Kübler
S Pyysalo
ST Ahmed
T Briscoe
TC Rindflesch
Y Huang
Z Shi
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Interest is growing in the application of syntactic parsers to natural language processing problems in biology, but assessing their performance is difficult because differences in linguistic convention can falsely appear to be errors. We present a method for evaluating their accuracy using an intermediate representation based on dependency graphs, in which the semantic relationships important in most information extraction tasks are closer to the surface. We also demonstrate how this method can be easily tailored to various application-driven criteria. RESULTS: Using the GENIA corpus as a gold standard, we tested four open-source parsers which have been used in bioinformatics projects. We first present overall performance measures, and test the two leading tools, the Charniak-Lease and Bikel parsers, on subtasks tailored to reflect the requirements of a system for extracting gene expression relationships. These two tools clearly outperform the other parsers in the evaluation, and achieve accuracy levels comparable to or exceeding native dependency parsers on similar tasks in previous biological evaluations. CONCLUSION: Evaluating using dependency graphs allows parsers to be tested easily on criteria chosen according to the semantics of particular biological applications, drawing attention to important mistakes and soaking up many insignificant differences that would otherwise be reported as errors. Generating high-accuracy dependency graphs from the output of phrase-structure parsers also provides access to the more detailed syntax trees that are used in several natural-language processing techniques

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central