Search CORE

4,733 research outputs found

Detection of gene annotations and protein-protein interaction associated disorders through transitive relationships between integrated annotations

Author: Canakoglu A
Masseroli M
Quigliatti M
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background Gene function annotations, which are associations between a gene and a term of a controlled vocabulary describing gene functional features, are of paramount importance in modern biology. Datasets of these annotations, such as the ones provided by the Gene Ontology Consortium, are used to design novel biological experiments and interpret their results. Despite their importance, these sources of information have some known issues. They are incomplete, since biological knowledge is far from being definitive and it rapidly evolves, and some erroneous annotations may be present. Since the curation process of novel annotations is a costly procedure, both in economical and time terms, computational tools that can reliably predict likely annotations, and thus quicken the discovery of new gene annotations, are very useful. Methods We used a set of computational algorithms and weighting schemes to infer novel gene annotations from a set of known ones. We used the latent semantic analysis approach, implementing two popular algorithms (Latent Semantic Indexing and Probabilistic Latent Semantic Analysis) and propose a novel method, the Semantic IMproved Latent Semantic Analysis, which adds a clustering step on the set of considered genes. Furthermore, we propose the improvement of these algorithms by weighting the annotations in the input set. Results We tested our methods and their weighted variants on the Gene Ontology annotation sets of three model organism genes (Bos taurus, Danio rerio and Drosophila melanogaster ). The methods showed their ability in predicting novel gene annotations and the weighting procedures demonstrated to lead to a valuable improvement, although the obtained results vary according to the dimension of the input annotation set and the considered algorithm. Conclusions Out of the three considered methods, the Semantic IMproved Latent Semantic Analysis is the one that provides better results. In particular, when coupled with a proper weighting policy, it is able to predict a significant number of novel annotations, demonstrating to actually be a helpful tool in supporting scientists in the curation process of gene functional annotations

Archivio istituzionale della ricerca - Politecnico di Milano

Springer - Publisher Connector

PubMed Central

Genomic and proteomic data integration for comprehensive biodata search

Author: Canakoglu A
Masseroli M
Publication venue
Publication date: 01/01/2012
Field of study

Archivio istituzionale della ricerca - Politecnico di Milano

Genomic and proteomic data integration for comprehensive biodata search

Author
Publication venue: 'EMBnet Stichting'
Publication date
Field of study

Crossref

Bacterial riboproteogenomics : the era of N-terminal proteoform existence revealed

Author: Fijalkowska Daria
Fijalkowski Igor
Van Damme Petra
Willems Patrick
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2020
Field of study

With the rapid increase in the number of sequenced prokaryotic genomes, relying on automated gene annotation became a necessity. Multiple lines of evidence, however, suggest that current bacterial genome annotations may contain inconsistencies and are incomplete, even for so-called well-annotated genomes. We here discuss underexplored sources of protein diversity and new methodologies for high-throughput genome re-annotation. The expression of multiple molecular forms of proteins (proteoforms) from a single gene, particularly driven by alternative translation initiation, is gaining interest as a prominent contributor to bacterial protein diversity. In consequence, riboproteogenomic pipelines were proposed to comprehensively capture proteoform expression in prokaryotes by the complementary use of (positional) proteomics and the direct readout of translated genomic regions using ribosome profiling. To complement these discoveries, tailored strategies are required for the functional characterization of newly discovered bacterial proteoforms

Ghent University Academic Bibliography

GENOME INFORMATICS

Author: Birol I.
Cant J.
Champ M.
Publication venue
Publication date: 01/09/2010
Field of study

Cold Spring Harbor Laboratory Institutional Repository

Next generation analytic tools for large scale genetic epidemiology studies of complex diseases

Author: Albert
Andrieu
Bansal
Blot
Bochud
Breslow
Broeks
Carvajal-Carmona
Chen
Ciampa
Cirulli
Cordell
Cornelis
Cox
De Silva
Drake
Eichler
Gottesman
Green
Greene
Greenland
Han
Hill
Hoffmann
International HapMap Consortium
International HapMap Consortium
International HapMap3 Consortium
Jirtle
Kendler
Klein
Kooperberg
Kooperberg
Kraft
Lander
Langholz
Laurie
Li
Li
Liu
Liu
Madsen
Manolio
Mardis
Milne
Moore
Moore
Moore
Moore
Morgenthaler
Mukherjee
Mukherjee
Murcray
Neale
Peng
Pennisi
Piegorsch
Price
Richter
Ritchie
Ritchie
Rosenstiel
Rothman
Schadt
Schwarz
Siemiatycki
Sinnott-Armstrong
Smith
Stein
Stephens
Thomas
Thomas
Thomas
Thompson
Turner
Vansteelandt
Vineis
Weinberg
Zawistowski
Zeggini
Zhang
Zhang
Zhou
Publication venue: 'Wiley'
Publication date: 01/01/2012
Field of study

Over the past several years, genome‐wide association studies (GWAS) have succeeded in identifying hundreds of genetic markers associated with common diseases. However, most of these markers confer relatively small increments of risk and explain only a small proportion of familial clustering. To identify obstacles to future progress in genetic epidemiology research and provide recommendations to NIH for overcoming these barriers, the National Cancer Institute sponsored a workshop entitled “Next Generation Analytic Tools for Large‐Scale Genetic Epidemiology Studies of Complex Diseases” on September 15–16, 2010. The goal of the workshop was to facilitate discussions on (1) statistical strategies and methods to efficiently identify genetic and environmental factors contributing to the risk of complex disease; and (2) how to develop, apply, and evaluate these strategies for the design, analysis, and interpretation of large‐scale complex disease association studies in order to guide NIH in setting the future agenda in this area of research. The workshop was organized as a series of short presentations covering scientific (gene‐gene and gene‐environment interaction, complex phenotypes, and rare variants and next generation sequencing) and methodological (simulation modeling and computational resources and data management) topic areas. Specific needs to advance the field were identified during each session and are summarized. Genet. Epidemiol . 36 : 22–35, 2012. © 2011 Wiley Periodicals, Inc.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/93578/1/gepi20652.pd

Crossref

PubMed Central

Deep Blue Documents at the University of Michigan

Bioinformatics for personal genomics: development and application of bioinformatic procedures for the analysis of genomic data

Author: Bertoldi Loris
Publication venue
Publication date: 15/01/2018
Field of study

In the last decade, the huge decreasing of sequencing cost due to the development of high-throughput technologies completely changed the way for approaching the genetic problems. In particular, whole exome and whole genome sequencing are contributing to the extraordinary progress in the study of human variants opening up new perspectives in personalized medicine. Being a relatively new and fast developing field, appropriate tools and specialized knowledge are required for an efficient data production and analysis. In line with the times, in 2014, the University of Padua funded the BioInfoGen Strategic Project with the goal of developing technology and expertise in bioinformatics and molecular biology applied to personal genomics. The aim of my PhD was to contribute to this challenge by implementing a series of innovative tools and by applying them for investigating and possibly solving the case studies included into the project. I firstly developed an automated pipeline for dealing with Illumina data, able to sequentially perform each step necessary for passing from raw reads to somatic or germline variant detection. The system performance has been tested by means of internal controls and by its application on a cohort of patients affected by gastric cancer, obtaining interesting results. Once variants are called, they have to be annotated in order to define their properties such as the position at transcript and protein level, the impact on protein sequence, the pathogenicity and more. As most of the publicly available annotators were affected by systematic errors causing a low consistency in the final annotation, I implemented VarPred, a new tool for variant annotation, which guarantees the best accuracy (>99%) compared to the state-of-the-art programs, showing also good processing times. To make easy the use of VarPred, I equipped it with an intuitive web interface, that allows not only a graphical result evaluation, but also a simple filtration strategy. Furthermore, for a valuable user-driven prioritization of human genetic variations, I developed QueryOR, a web platform suitable for searching among known candidate genes as well as for finding novel gene-disease associations. QueryOR combines several innovative features that make it comprehensive, flexible and easy to use. The prioritization is achieved by a global positive selection process that promotes the emergence of the most reliable variants, rather than filtering out those not satisfying the applied criteria. QueryOR has been used to analyze the two case studies framed within the BioInfoGen project. In particular, it allowed to detect causative variants in patients affected by lysosomal storage diseases, highlighting also the efficacy of the designed sequencing panel. On the other hand, QueryOR simplified the recognition of LRP2 gene as possible candidate to explain such subjects with a Dent disease-like phenotype, but with no mutation in the previously identified disease-associated genes, CLCN5 and OCRL. As final corollary, an extensive analysis over recurrent exome variants was performed, showing that their origin can be mainly explained by inaccuracies in the reference genome, including misassembled regions and uncorrected bases, rather than by platform specific errors

Archivio istituzionale della ricerca - Università di Padova

Blood-based biomarkers for Alzheimer disease: mapping the road to the clinic.

Author: A Brinkmalm
A Deyati
A Hye
A Keller
A Montagne
A Nakamura
A Wojtowicz
AA Pimenova
AC Naj
AJ Gool van
Alzheimer’s Association
AM Fagan
B Dubois
B Dubois
B Olsson
B Roeben
B Vellas
BD James
BD Zipser
CM Niemeyer
Colin L. Masters
CR Jack Jr
D Galasko
D Inekci
D Kim
E Cavedo
E Younesi
F Baldacci
F Noorbakhsh
FS Collins
G Corso
G Nistico
G Wu
GB Frisoni
Genomes Project C.
Genomes Project C.
GG Kovacs
GH Lyman
GM McKhann
H Fukumoto
H Hampel
H Hampel
H Hampel
H Hampel
H Hampel
H Hampel
H Hampel
H Hampel
H Hampel
H Hampel
H Hampel
H Hampel
H Schröder
H Zetterberg
Harald Hampel
Henrik Zetterberg
HM Snyder
J Attems
J Kuhle
J Pannee
J Rahimi
JA Hardy
JC Rojas
JD Doecke
JD Rohrer
JI Castrillo
José L. Molinuevo
JQ Trojanowski
JT O’Brien
K Blennow
K Blennow
K Broich
K Henriksen
Kaj Blennow
KY Tzen
L Bertram
L Hood
LM Chen
M Ferretti
M Gisslén
M Hilario
M Mapstone
M Thambisetty
MM Mielke
MT Heneka
N Mattsson
N Mattsson
NJ Ashton
NL Kerr
O Hansson
OL Lopez
P Cannon
P Lewczuk
P Lewczuk
P Lewczuk
P Scheltens
P Schneider
PS Aisen
PS Weston
R Guo
R Vandenberghe
R Vassar
Richard Batrla
S Anand
S Garcia-Ptacek
S Hilal
S Janelidze
S Lista
S Lista
S Lista
S Lista
S Lista
S Lista
S Lista
S Lista
S Lista
S Yu
SC Burnham
SE O’Bryant
SE O’Bryant
SE O’Bryant
SE O’Bryant
Sid E. O’Bryant
Simone Lista
SJ Kiddle
SM Almeida de
SM Monte de la
SM Robinson
Steven J. Kiddle
U Andreasson
V Gligorijevic´
V Ovod
V Patterson
VA Krutovskikh
VB Gupta
X Robin
Y Iturria-Medina
Y Shen
Z Arvanitakis
Publication venue: Nat Rev Neurol
Publication date: 01/11/2018
Field of study

Biomarker discovery and development for clinical research, diagnostics and therapy monitoring in clinical trials have advanced rapidly in key areas of medicine - most notably, oncology and cardiovascular diseases - allowing rapid early detection and supporting the evolution of biomarker-guided, precision-medicine-based targeted therapies. In Alzheimer disease (AD), breakthroughs in biomarker identification and validation include cerebrospinal fluid and PET markers of amyloid-β and tau proteins, which are highly accurate in detecting the presence of AD-associated pathophysiological and neuropathological changes. However, the high cost, insufficient accessibility and/or invasiveness of these assays limit their use as viable first-line tools for detecting patterns of pathophysiology. Therefore, a multistage, tiered approach is needed, prioritizing development of an initial screen to exclude from these tests the high numbers of people with cognitive deficits who do not demonstrate evidence of underlying AD pathophysiology. This Review summarizes the efforts of an international working group that aimed to survey the current landscape of blood-based AD biomarkers and outlines operational steps for an effective academic-industry co-development pathway from identification and assay development to validation for clinical use.I recieved an honorarium from Roche Diagnostics for my participation in the advisory panel meeting leading to this pape

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

UCL Discovery

Apollo (Cambridge)

King's Research Portal

Toward reliable biomarker signatures in the age of liquid biopsies - how to standardize the small RNA-Seq workflow

Author: Buschmann Dominik
Haberberger Anna
Kirchner Benedikt
Pfaffl Michael W.
Riedmaier Irmgard
Schelling Gustav
Spornraft Melanie
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

Small RNA-Seq has emerged as a powerful tool in transcriptomics, gene expression profiling and biomarker discovery. Sequencing cell-free nucleic acids, particularly microRNA (miRNA), from liquid biopsies additionally provides exciting possibilities for molecular diagnostics, and might help establish disease-specific biomarker signatures. The complexity of the small RNA-Seq workflow, however, bears challenges and biases that researchers need to be aware of in order to generate high-quality data. Rigorous standardization and extensive validation are required to guarantee reliability, reproducibility and comparability of research findings. Hypotheses based on flawed experimental conditions can be inconsistent and even misleading. Comparable to the well-established MIQE guidelines for qPCR experiments, this work aims at establishing guidelines for experimental design and pre-analytical sample processing, standardization of library preparation and sequencing reactions, as well as facilitating data analysis. We highlight bottlenecks in small RNA-Seq experiments, point out the importance of stringent quality control and validation, and provide a primer for differential expression analysis and biomarker discovery. Following our recommendations will en-courage better sequencing practice, increase experimental transparency and lead to more reproducible small RNA-Seq results. This will ultimately enhance the validity of biomarker signatures, and allow reliable and robust clinical predictions