Search CORE

9 research outputs found

The COMBREX Project: Design, Methodology, and Initial Results

Author: Allen Benjamin
Anton Brian P.
Bateman Alex
Bhagwat Ashok S.
Blumenthal Robert M.
Bollinger J. Martin
Brenner Steven E.
Brown Peter J.
Chang Woo-Suk
Choi Han-Pil
Columbus Linda
Crécy-Lagard Valerié de
DeLisi Charles
Faller Lina L.
Ferguson Donald
Ferrer Manuel
Fomenkov Alexey
Friedberg Iddo
Gadda Giovanni
Galperin Michael Y.
Gobeill Julien
Greiner Russell
Guleria Jyotsna
Haft Daniel
Horn David
Housman Genevieve
Hu Jie
Hu Zhenjun
Hunt John
Karp Peter
Kasif Simon
Klimke William
Klitgord Niels
Krebs Carsten
Letovsky Stanley
Levy-Moonshine Ami
Macelis Dana
Madupu Ramana
Maksad Almaz
Mark McGettrick
Martín María J.
Mazumdar Varun
Miller Jeffrey H.
Monahan Caitlin
Morgan Richard D.
Osmani Lais
Osterman Andrei L.
O’Donovan Claire
Palsson Bernhard
Plata Germán
Pokrzywa Revonda
Rachlin John
Roberts Richard J.
Rochussen Krista
Rodionov Dmitry A.
Rodionova Irina A.
Ruch Patrick
Rudd Kenneth E.
Salzberg Steven L.
Segre Daniel
Setterdahl Aaron
Sjölander Kimmen
Spain James
Steffen Martin
Sutton Granger
Swaminathan Rajeswari
Söll Dieter
Tao Kevin
Tate John
Tchigvintsev Dmitri
Vitkup Dennis
Xu Shuang-yong
Yakunin Alexander F.
Yi-Chien Chang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 05/06/2019
Field of study

© 2013 Brian P. et al.Prior to the “genomic era,” when the acquisition of DNA sequence involved significant labor and expense, the sequencing of genes was strongly linked to the experimental characterization of their products. Sequencing at that time directly resulted from the need to understand an experimentally determined phenotype or biochemical activity. Now that DNA sequencing has become orders of magnitude faster and less expensive, focus has shifted to sequencing entire genomes. Since biochemistry and genetics have not, by and large, enjoyed the same improvement of scale, public sequence repositories now predominantly contain putative protein sequences for which there is no direct experimental evidence of function. Computational approaches attempt to leverage evidence associated with the ever-smaller fraction of experimentally analyzed proteins to predict function for these putative proteins. Maximizing our understanding of function over the universe of proteins in toto requires not only robust computational methods of inference but also a judicious allocation of experimental resources, focusing on proteins whose experimental characterization will maximize the number and accuracy of follow-on predictions.COMBREX is funded by a GO grant from the National Institute of General Medical Sciences (NIGMS) (1RC2GM092602-01).Peer Reviewe

Digital.CSIC

Biomarkers in the accurate subclassification of non-small-cell lung carcinoma for targeted therapy: issues and prospects

Author: Feller-Kopman D
Krencz I
Lais Osmani
Qing Kay Li
Travis WD
Publication venue: 'Future Medicine Ltd'
Publication date
Field of study

Crossref

Thousands of missed genes found in bacterial genomes and their analysis with COMBREX

Author: Ami Levy-Moonshine
Brian P Anton
Derrick E Wood
Henry Lin
Lais Osmani
Martin Steffen
Rajiswari Swaminathan
Simon Kasif
Steven L Salzberg
Wood Derrick E
Yi-Chien Chang
Publication venue: National Center for Biotechnology Information
Publication date: 01/01/2012
Field of study

The dramatic reduction in the cost of sequencing has allowed many researchers to join in the effort of sequencing and annotating prokaryotic genomes. Annotation methods vary considerably and may fail to identify some genes. Here we draw attention to a large number of likely genes missing from annotations using common tools such as Glimmer and BLAST. By analyzing 1,474 prokaryotic genome annotations in GenBank, we identify 13,602 likely missed genes that are homologs to non-hypothetical proteins, and 11,792 likely missed genes that are homologs only to hypothetical proteins, yet have supporting evidence of their protein-coding nature from COMBREX, a newly created gene function database. We also estimate the likelihood that each potential missing gene found is a genuine protein-coding gene using COMBREX. Our analysis of the causes of missed genes suggests that larger annotation centers tend to produce annotations with fewer missed genes than smaller centers, and many of the missed genes are short genes <300 bp. Over 1,000 of the likely missed genes could be associated with phenotype information available in COMBREX. 359 of these genes, found in pathogenic organisms, may be potential targets for pharmaceutical research. The newly identified genes are available on COMBREX’s website.https://doi.org/10.1186/1745-6150-7-3

Crossref

Springer

PubMed Central

Digital Repository at the University of Maryland

Aging gene signature of memory CD8+ T cells is associated with neurocognitive functioning in Alzheimer’s disease

Author: Adam P. Mecca
Christopher H. van Dyck
Heather Allore
Hong-Jai Park
Hugh Bartlett
Hye Sun Kim
Insoo Kang
Jennefer Par-Young
Juan Joseph Young
Lais Osmani
Min Sun Shin
Minhyung Kim
Richard Bucala
Serhan Unlu
Sungyong You
Publication venue: BMC
Publication date: 01/12/2023
Field of study

Abstract Background Memory CD8+ T cells expand with age. We previously demonstrated an age-associated expansion of effector memory (EM) CD8+ T cells expressing low levels of IL-7 receptor alpha (IL-7Rαlow) and the presence of its gene signature (i.e., IL-7Rαlow aging genes) in peripheral blood of older adults without Alzheimer’s disease (AD). Considering age as the strongest risk factor for AD and the recent finding of EM CD8+ T cell expansion, mostly IL-7Rαlow cells, in AD, we investigated whether subjects with AD have alterations in IL-7Rαlow aging gene signature, especially in relation to genes possibly associated with AD and disease severity. Results We identified a set of 29 candidate genes (i.e., putative AD genes) which could be differentially expressed in peripheral blood of patients with AD through the systematic search of publicly available datasets. Of the 29 putative AD genes, 9 genes (31%) were IL-7Rαlow aging genes (P < 0.001), suggesting the possible implication of IL-7Rαlow aging genes in AD. These findings were validated by RT-qPCR analysis of 40 genes, including 29 putative AD genes, additional 9 top IL-7R⍺low aging but not the putative AD genes, and 2 inflammatory control genes in peripheral blood of cognitively normal persons (CN, 38 subjects) and patients with AD (40 mild cognitive impairment and 43 dementia subjects). The RT-qPCR results showed 8 differentially expressed genes between AD and CN groups; five (62.5%) of which were top IL-7Rαlow aging genes (FGFBP2, GZMH, NUAK1, PRSS23, TGFBR3) not previously reported to be altered in AD. Unbiased clustering analysis revealed 3 clusters of dementia patients with distinct expression levels of the 40 analyzed genes, including IL-7Rαlow aging genes, which were associated with neurocognitive function as determined by MoCA, CDRsob and neuropsychological testing. Conclusions We report differential expression of “normal” aging genes associated with IL‐7Rαlow EM CD8+ T cells in peripheral blood of patients with AD, and the significance of such gene expression in clustering subjects with dementia due to AD into groups with different levels of cognitive functioning. These results provide a platform for studies investigating the possible implications of age-related immune changes, including those associated with CD8+ T cells, in AD

Directory of Open Access Journals

The COMBREX project: design, methodology, and initial results.

Author: Aaron Setterdahl
Alex Bateman
Alexander Yakunin
Alexey Fomenkov
Almaz Maksad
Ami Levy-Moonshine
Andrei L Osterman
Ashok S Bhagwat
Benjamin Allen
Bernhard Palsson
Brian P Anton
Caitlin Monahan
Carsten Krebs
Charles DeLisi
Claire O'Donovan
Dana Macelis
Daniel Haft
Daniel Segrè
David Horn
Dennis Vitkup
Dieter Söll
Dmitri Tchigvintsev
Dmitry A Rodionov
Donald Ferguson
Genevieve Housman
Germán Plata
Giovanni Gadda
Granger Sutton
Han-Pil Choi
Iddo Friedberg
Irina A Rodionova
J Martin Bollinger
James Spain
Jeffrey H Miller
Jie Hu
John Hunt
John Rachlin
John Tate
Julien Gobeill
Jyotsna Guleria
Kenneth E Rudd
Kevin Tao
Kimmen Sjölander
Krista Rochussen
Lais Osmani
Lina L Faller
Linda Columbus
Manuel Ferrer
Maria J Martin
Mark McGettrick
Martin Steffen
Michael Y Galperin
Niels Klitgord
Patrick Ruch
Peter Brown
Peter Karp
Rajeswari Swaminathan
Ramana Madupu
Revonda Pokrzywa
Richard D Morgan
Richard J Roberts
Robert M Blumenthal
Russell Greiner
Shuang-Yong Xu
Simon Kasif
Stanley Letovsky
Steven E Brenner
Steven L Salzberg
Valérie de Crécy-Lagard
Varun Mazumdar
William Klimke
Woo-Suk Chang
Yi-Chien Chang
Zhenjun Hu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Experimental data exists for only a vanishingly small fraction of sequenced microbial genes. This community page discusses the progress made by the COMBREX project to address this important issue using both computational and experimental resources

Crossref

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

University of Miami: Scholarship Miami

RERO DOC Digital Library

Definitions of COMBREX functional status symbols and fractions of microbial genes in COMBREX in each status category.

Author: Aaron Setterdahl (450762)
Alex Bateman (1395)
Alexander Yakunin (209006)
Alexey Fomenkov (450750)
Almaz Maksad (450740)
Ami Levy-Moonshine (450739)
Andrei L. Osterman (135474)
Ashok S. Bhagwat (111506)
Benjamin Allen (408337)
Bernhard Palsson (450761)
Brian P. Anton (422971)
Caitlin Monahan (450746)
Carsten Krebs (450759)
Charles DeLisi (10041)
Claire O'Donovan (2975)
Dana Macelis (57502)
Daniel Haft (57609)
Daniel Segrè (115737)
David Horn (24969)
Dennis Vitkup (2575)
Dieter Söll (69515)
Dmitri Tchigvintsev (450764)
Dmitry A. Rodionov (11371)
Donald Ferguson (450749)
Genevieve Housman (450745)
Germán Plata (40981)
Giovanni Gadda (450751)
Granger Sutton (38234)
Han-Pil Choi (422967)
Iddo Friedberg (9867)
Irina A. Rodionova (450752)
J. Martin Bollinger (450755)
James Spain (450753)
Jeffrey H. Miller (450760)
Jie Hu (130301)
John Hunt (316518)
John Rachlin (9898)
John Tate (450763)
Julien Gobeill (91134)
Jyotsna Guleria (450738)
Kenneth E. Rudd (13378)
Kevin Tao (450748)
Kimmen Sjölander (63453)
Krista Rochussen (450747)
Lais Osmani (450742)
Lina L. Faller (160214)
Linda Columbus (411169)
Manuel Ferrer (111944)
Maria J. Martin (14344)
Mark McGettrick (450741)
Martin Steffen (256788)
Michael Y. Galperin (67366)
Niels Klitgord (19888)
Patrick Ruch (29373)
Peter Brown (209993)
Peter Karp (450757)
Rajeswari Swaminathan (450744)
Ramana Madupu (2834)
Revonda Pokrzywa (450743)
Richard D. Morgan (14555)
Richard J. Roberts (8946)
Robert M. Blumenthal (450754)
Russell Greiner (6063)
Shuang-yong Xu (14559)
Simon Kasif (2952)
Stanley Letovsky (450765)
Steven E. Brenner (13943)
Steven L. Salzberg (70641)
Valérie de Crécy-Lagard (11347)
Varun Mazumdar (160221)
William Klimke (450758)
Woo-Suk Chang (450756)
Yi-Chien Chang (160237)
Zhenjun Hu (10036)
Publication venue
Publication date
Field of study

Experimentally characterized proteins are green. (Those in the green set that have been manually curated by the GSDB are also marked with a gold “G.”) Proteins with functional predictions but no experimental evidence are blue. Proteins with no available functional predictions are black.</p

FigShare

Schematic overview of the computational and experimental contributions of COMBREX and its users, and the interrelationships of these contributions.

Author: Aaron Setterdahl (450762)
Alex Bateman (1395)
Alexander Yakunin (209006)
Alexey Fomenkov (450750)
Almaz Maksad (450740)
Ami Levy-Moonshine (450739)
Andrei L. Osterman (135474)
Ashok S. Bhagwat (111506)
Benjamin Allen (408337)
Bernhard Palsson (450761)
Brian P. Anton (422971)
Caitlin Monahan (450746)
Carsten Krebs (450759)
Charles DeLisi (10041)
Claire O'Donovan (2975)
Dana Macelis (57502)
Daniel Haft (57609)
Daniel Segrè (115737)
David Horn (24969)
Dennis Vitkup (2575)
Dieter Söll (69515)
Dmitri Tchigvintsev (450764)
Dmitry A. Rodionov (11371)
Donald Ferguson (450749)
Genevieve Housman (450745)
Germán Plata (40981)
Giovanni Gadda (450751)
Granger Sutton (38234)
Han-Pil Choi (422967)
Iddo Friedberg (9867)
Irina A. Rodionova (450752)
J. Martin Bollinger (450755)
James Spain (450753)
Jeffrey H. Miller (450760)
Jie Hu (130301)
John Hunt (316518)
John Rachlin (9898)
John Tate (450763)
Julien Gobeill (91134)
Jyotsna Guleria (450738)
Kenneth E. Rudd (13378)
Kevin Tao (450748)
Kimmen Sjölander (63453)
Krista Rochussen (450747)
Lais Osmani (450742)
Lina L. Faller (160214)
Linda Columbus (411169)
Manuel Ferrer (111944)
Maria J. Martin (14344)
Mark McGettrick (450741)
Martin Steffen (256788)
Michael Y. Galperin (67366)
Niels Klitgord (19888)
Patrick Ruch (29373)
Peter Brown (209993)
Peter Karp (450757)
Rajeswari Swaminathan (450744)
Ramana Madupu (2834)
Revonda Pokrzywa (450743)
Richard D. Morgan (14555)
Richard J. Roberts (8946)
Robert M. Blumenthal (450754)
Russell Greiner (6063)
Shuang-yong Xu (14559)
Simon Kasif (2952)
Stanley Letovsky (450765)
Steven E. Brenner (13943)
Steven L. Salzberg (70641)
Valérie de Crécy-Lagard (11347)
Varun Mazumdar (160221)
William Klimke (450758)
Woo-Suk Chang (450756)
Yi-Chien Chang (160237)
Zhenjun Hu (10036)
Publication venue
Publication date
Field of study

Data and results specific to COMBREX are shown in boxes. External data imported into COMBREX are also shown, with arrows indicating entry points into the cycle. Methodology employed by COMBREX and its users is shown in blue type, as it is used to generate data. Not shown are two critical contributions to COMBREX: genome and cluster data imported from NCBI RefSeq and ProtClustDB, respectively, and NIH funding, which enables the grants that COMBREX issues to experimental laboratories.</p

FigShare