Search CORE

76 research outputs found

TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes

Author: Davidsen Tanja
Ganapathy Anurhada
Gwinn-Giglio Michelle
Haft Daniel H.
Nelson William C.
Richter Alexander R.
Selengut Jeremy D.
White Owen
Publication venue: Oxford University Press
Publication date: 06/12/2006
Field of study

TIGRFAMs is a collection of protein family definitions built to aid in high-throughput annotation of specific protein functions. Each family is based on a hidden Markov model (HMM), where both cutoff scores and membership in the seed alignment are chosen so that the HMMs can classify numerous proteins according to their specific molecular functions. Most TIGRFAMs models describe ‘equivalog’ families, where both orthology and lateral gene transfer may be part of the evolutionary history, but where a single molecular function has been conserved. The Genome Properties system contains a queriable set of metabolic reconstructions, genome metrics and extractions of information from the scientific literature. Its genome-by-genome assertions of whether or not specific structures, pathways or systems are present provide high-level conceptual descriptions of genomic content. These assertions enable comparative genomics, provide a meaningful biological context to aid in manual annotation, support assignments of Gene Ontology (GO) biological process terms and help validate HMM-based predictions of protein function. The Genome Properties system is particularly useful as a generator of phylogenetic profiles, through which new protein family functions may be discovered. The TIGRFAMs and Genome Properties systems can be accessed at and

CiteSeerX

Crossref

PubMed Central

A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine

Author: Anthony R. Kerlavage
Ishwar Chandramouliswaran
Izumi V. Hinkson
Izumi V. Hinkson
Juli D. Klemm
Tanja M. Davidsen
Warren A. Kibbe
Warren A. Kibbe
Publication venue: 'Frontiers Media SA'
Publication date: 01/09/2017
Field of study

Advancements in next-generation sequencing and other -omics technologies are accelerating the detailed molecular characterization of individual patient tumors, and driving the evolution of precision medicine. Cancer is no longer considered a single disease, but rather, a diverse array of diseases wherein each patient has a unique collection of germline variants and somatic mutations. Molecular profiling of patient-derived samples has led to a data explosion that could help us understand the contributions of environment and germline to risk, therapeutic response, and outcome. To maximize the value of these data, an interdisciplinary approach is paramount. The National Cancer Institute (NCI) has initiated multiple projects to characterize tumor samples using multi-omic approaches. These projects harness the expertise of clinicians, biologists, computer scientists, and software engineers to investigate cancer biology and therapeutic response in multidisciplinary teams. Petabytes of cancer genomic, transcriptomic, epigenomic, proteomic, and imaging data have been generated by these projects. To address the data analysis challenges associated with these large datasets, the NCI has sponsored the development of the Genomic Data Commons (GDC) and three Cloud Resources. The GDC ensures data and metadata quality, ingests and harmonizes genomic data, and securely redistributes the data. During its pilot phase, the Cloud Resources tested multiple cloud-based approaches for enhancing data access, collaboration, computational scalability, resource democratization, and reproducibility. These NCI-led efforts are continuously being refined to better support open data practices and precision oncology, and to serve as building blocks of the NCI Cancer Research Data Commons

Directory of Open Access Journals

A regionally coherent ecological fingerprint of climate change, evidenced from natural history collections

Author: Aagaard Kaare
Bakken Torkild
Davidsen Jan Grimsrud
Dunshea Glenn
Evankow Ann
Finstad Anders Gravbrøt
Hassel Kristian
Husby Magne
Hårsaker Karstein
Koksvik Jan Ivar
Nilsen Nellie Henriette
Petersen Tanja Kofod
Prestø Tommy
Ranke Peter Sjolte
Speed James David Mervyn
Turner Grace Winifred
Vange Vibekke
Publication venue: 'Wiley'
Publication date: 01/01/2022
Field of study

publishedVersio

Brage Nord Open Research Archive

PubMed Central

The comprehensive microbial resource

Author: Alice
Alice
Altschul
Ansong
Anuradha Ganapathy
Ashburner
Bairoch
Barrett
Beiko
Benson
Chandonia
Chiu
Clarke
Delcher
Delcher
Dethlefsen
Ducey
Durot
Erin Beck
Finn
Gibbons
Granger Sutton
Griffiths-Jones
Haft
Haft
Hulo
Humbert
Johnson
Kanehisa
Karp
Kersey
Kevin Galinsky
Klimke
Lone
Lowe
Maltsev
Mamirova
Mandel
Marienhagen
Mulder
Nicely
Nikhat Zafar
Owen White
Parks
Phil Goetz
Poole
Qi Yang
Ramana Madupu
Riley
Robert Montgomery
Roca
Rouillard
Schuijffel
Slater
Sonnhammer
Tanja Davidsen
Tatusov
Webb
Xiang
Publication venue: Oxford University Press
Publication date
Field of study

The Comprehensive Microbial Resource or CMR (http://cmr.jcvi.org) provides a web-based central resource for the display, search and analysis of the sequence and annotation for complete and publicly available bacterial and archaeal genomes. In addition to displaying the original annotation from GenBank, the CMR makes available secondary automated structural and functional annotation across all genomes to provide consistent data types necessary for effective mining of genomic data. Precomputed homology searches are stored to allow meaningful genome comparisons. The CMR supplies users with over 50 different tools to utilize the sequence and annotation data across one or more of the 571 currently available genomes. At the gene level users can view the gene annotation and underlying evidence. Genome level information includes whole genome graphical displays, biochemical pathway maps and genome summary data. Comparative tools display analysis between genomes with homology and genome alignment tools, and searches across the accessions, annotation, and evidence assigned to all genes/genomes are available. The data and tools on the CMR aid genomic research and analysis, and the CMR is included in over 200 scientific publications. The code underlying the CMR website and the CMR database are freely available for download with no license restrictions

Crossref

PubMed Central

Correction: Comparative Genomics of Emerging Human Ehrlichiosis Agents

Crossref

Directory of Open Access Journals

PubMed Central

Comparative Genomics of Emerging Human Ehrlichiosis Agents

Anaplasma (formerly Ehrlichia) phagocytophilum, Ehrlichia chaffeensis, and Neorickettsia (formerly Ehrlichia) sennetsu are intracellular vector-borne pathogens that cause human ehrlichiosis, an emerging infectious disease. We present the complete genome sequences of these organisms along with comparisons to other organisms in the Rickettsiales order. Ehrlichia spp. and Anaplasma spp. display a unique large expansion of immunodominant outer membrane proteins facilitating antigenic variation. All Rickettsiales have a diminished ability to synthesize amino acids compared to their closest free-living relatives. Unlike members of the Rickettsiaceae family, these pathogenic Anaplasmataceae are capable of making all major vitamins, cofactors, and nucleotides, which could confer a beneficial role in the invertebrate vector or the vertebrate host. Further analysis identified proteins potentially involved in vacuole confinement of the Anaplasmataceae, a life cycle involving a hematophagous vector, vertebrate pathogenesis, human pathogenesis, and lack of transovarial transmission. These discoveries provide significant insights into the biology of these obligate intracellular pathogens

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Pathema: a clade-specific bioinformatics resource center for pathogen research

Pathema (http://pathema.jcvi.org) is one of the eight Bioinformatics Resource Centers (BRCs) funded by the National Institute of Allergy and Infectious Disease (NIAID) designed to serve as a core resource for the bio-defense and infectious disease research community. Pathema strives to support basic research and accelerate scientific progress for understanding, detecting, diagnosing and treating an established set of six target NIAID Category A–C pathogens: Category A priority pathogens; Bacillus anthracis and Clostridium botulinum, and Category B priority pathogens; Burkholderia mallei, Burkholderia pseudomallei, Clostridium perfringens and Entamoeba histolytica. Each target pathogen is represented in one of four distinct clade-specific Pathema web resources and underlying databases developed to target the specific data and analysis needs of each scientific community. All publicly available complete genome projects of phylogenetically related organisms are also represented, providing a comprehensive collection of organisms for comparative analyses. Pathema facilitates the scientific exploration of genomic and related data through its integration with web-based analysis tools, customized to obtain, display, and compute results relevant to ongoing pathogen research. Pathema serves the bio-defense and infectious disease research community by disseminating data resulting from pathogen genome sequencing projects and providing access to the results of inter-genomic comparisons for these organisms

Crossref

PubMed Central

Recommended from our members

Comprehensive molecular characterization of gastric adenocarcinoma

Author: Abdel-Misih Raafat
Ajani Jaffer
Akbani Rehan
Albert Monique
Alexopoulou Iakovina
Ally Adrian
Alonso Shelley
Askoy B. Arman
Ayala Brenda
Balasundaram Miruna
Bartlett John
Bass Adam J.
Baylin Stephen B.
Beer David G.
Belyaev Smitry
Bennett Joseph
Benz Christopher
Bernard Brady
Beroukhim Rameen
Birol Inanc
Black Aaron D.
Bootwalla Moiz S.
Boussioutas Alex
Bowen Jay
Bowlby Reanne
Bristow Christopher A.
Brooks Denise
Brown Jennifer
Brzezinski Jakub
Burton Robert
Butterfield Yaron S. N.
Camargo M. Constanza
Carlsen Rebecca
Carney Julie Ann
Carter Scott L.
Cheong Jae-Ho
Cherniack Andrew
Cherniack Andrew D.
Chin Lynda
Cho Eunjung
Cho Juok
Chu Andy
Chu Justin
Chuah Eric
Chudamani Sudha
Chun Hye-Jung E.
Cibulskis Kristian
Ciriello Giovanni
Clarke Amanda
Crain Daniel
Curely Erin
Curley Erin
Curtis Christina
Davidsen Tanja
Demchok John A.
Dhalla Noreen
Dhir Rajiv
DiCara Daniel
Ding Li
Dolzhansky Oleg
Dresdner Gideon
Eley Greg
Engel Jay
Fedosenko Konstantin
Fisher Sheila
Frazer Scott
Gabriel Stacey B.
Gao Jianjiong
Gardner Johanna
Garman Katherine
Gastier-Foster Julie M.
Gehlenborg Nils
Getz Gad
Gross Benjamin
Guin Ranabir
Gulley Margaret
Hadjipanayis Angela
Haussler David
Heiman David I.
Helsel Carmen
Herman James G.
Hinoue Toshinori
Holt Robert A.
Hutter Carolyn M.
Iacocca Mary
Ibbs Matthew
Iype Lisa
Jacobsen Anders
Janjigian Yelena Y.
Jensen Mark A.
Jones Steven J.M.
Jung Joonil
Kasaian Katayoon
Kelsen David P.
Kemkes Ariane
Kim Hark K.
Kim Jaegil
Kim Jihun
Kim Sang-Bae
Korski Konstanty
Kramer Roger W.
Kreisberg Richard
Kucherlapati Raju
Kwon Sun-Young
Kycler Witold
Ladanyi Marc
Lai Phillip H.
Laird Peter W.
Lander Eric S.
Landreneau Rodney
Lau Kevin
Lawrence Michael S.
Lee Darlene
Lee Jae-Hyuk
Lee Ju-Seog
Lee Semin
Lee William
Leiserson Mark D. M.
Leporowska Ewa
Leraas Kristen M.
Li Haiyan A.
Lichtenberg Tara M.
Lichtenstein Lee
Lim Emilia
Lin Pei
Ling Shiyun
Liu Jia
Liu Wenbin
Liu Yingchun
Lu Yiling
Luketich James
Ma Yussanne
Mackiewicz Andrzej
Mahadeshwar Harshad S.
Mallery David
Manikhas Georgy
Marra Marco A.
Mayo Michael
McAllister Cynthia
McCall Shannon J.
McLellan Michael
Meyerson Matthew
Miller Michael
Mills Shaw Kenna R.
Mills Gordon
Mills Gordon B.
Moore Richard A.
Morris Scott
Mungall Andrew J.
Mungall Karen L.
Murawa Dawid
Murawa Pawel
Murray Bradley A.
Ng Sam
Ng Santa Cruz Sam
Nip Ka Ming
Niu Beifang
Noble Michael S.
Odze Robert
Ojesina Akinyemi I.
Pantazi Angeliki
Parfenov Michael
Park Do-Youn
Park Peter J.
Park Young S.
Paulauskis Joseph
Pedamallu Chandra
Pedamallu Chandra Sekhar
Pennathur Arjun
Penny Robert
Piazuelo M. Blanca
Pihl Todd
Potapova Olga
Protopopov Alexei
Rabeno Brenda
Rabkin Charles S.
Raman Rohini
Ramirez Nilsa C.
Ramirez Ricardo
Rao Arvind
Raphael Benjamin J.
Rathmell W. Kimryn
Ren Xiaojia
Reynolds Sheila M.
Robertson A. Gordon
Rosenberg Mara
Rovira Hector
Sakai Ryo
Saksena Gordon
Sander Chris
Santoso Netty
Schein Jacqueline E.
Schneider Barbara G.
Schultz Nikolaus
Schumacher Steven E.
Seidman Jonathan
Senbabaoglu Yasin
Seth Sahil
Shelton Candace
Shelton Troy
Shen Hui
Shen Ronglai
Sherman Mark
Sheth Margi
Shmulevich Ilya
Sinha Rileen
Sipahimalani Payal
Sofia Heidi J.
Song Xingzhi
Sougnez Carrie
Spychała Arkadiusz
Stojanov Petar
Stuart Josh M.
Suchorska Wiktoria M.
Sumer S. Onur
Sun Yichao
Tabak Barbara
Tabler Teresa R.
Tam Angela
Tang Jiabin
Tang Laura
Tarnuzzer Roy
Tasman Natalie
Tatka Honorata
Taylor Barry S.
Taylor-Weiner Amaro
Teresiak Marek
Thiessen Nina
Thorsson Vesteinn
Thorsson Vésteinn
Triche Timothy
Van Den Berg David J.
Verhaak Roeland G.W.
Voet Doug
Voronina Olga
Walton Jessica
Wan Yunhu
Wang Zhining
Weaver Stephanie
Weinhold Nils
Weinstein John N.
Weisenberger Daniel J.
Willis Joseph E.
Wise Lisa
Wiznerowicz Maciej
Wu Hsin-Ta
Xi Ruibin
Xu Andrew W.
Yang Da
Yang Liming
Yang Lixing
Zack Travis I.
Zenklusen Jean Claude
Zhang Hailei
Zhang Jianhua
Zhang Wei
Zmuda Erik
Zou Lihua
ŁaŸniak Radoslaw
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2014
Field of study

Gastric cancer is a leading cause of cancer deaths, but analysis of its molecular and clinical characteristics has been complicated by histological and aetiological heterogeneity. Here we describe a comprehensive molecular evaluation of 295 primary gastric adenocarcinomas as part of The Cancer Genome Atlas (TCGA) project. We propose a molecular classification dividing gastric cancer into four subtypes: tumours positive for Epstein–Barr virus, which display recurrent PIK3CA mutations, extreme DNA hypermethylation, and amplification of JAK2, CD274 (also known as PD-L1) and PDCD1LG2 (also knownasPD-L2); microsatellite unstable tumours, which show elevated mutation rates, including mutations of genes encoding targetable oncogenic signalling proteins; genomically stable tumours, which are enriched for the diffuse histological variant and mutations of RHOA or fusions involving RHO-family GTPase-activating proteins; and tumours with chromosomal instability, which show marked aneuploidy and focal amplification of receptor tyrosine kinases. Identification of these subtypes provides a roadmap for patient stratification and trials of targeted therapies

Harvard University - DASH