Search CORE

43 research outputs found

Genomic and proteomic data integration for comprehensive biodata search

Author: Canakoglu A
Masseroli M
Publication venue
Publication date: 01/01/2012
Field of study

Archivio istituzionale della ricerca - Politecnico di Milano

Detection of gene annotations and protein-protein interaction associated disorders through transitive relationships between integrated annotations

Author: Canakoglu A
Masseroli M
Quigliatti M
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background Gene function annotations, which are associations between a gene and a term of a controlled vocabulary describing gene functional features, are of paramount importance in modern biology. Datasets of these annotations, such as the ones provided by the Gene Ontology Consortium, are used to design novel biological experiments and interpret their results. Despite their importance, these sources of information have some known issues. They are incomplete, since biological knowledge is far from being definitive and it rapidly evolves, and some erroneous annotations may be present. Since the curation process of novel annotations is a costly procedure, both in economical and time terms, computational tools that can reliably predict likely annotations, and thus quicken the discovery of new gene annotations, are very useful. Methods We used a set of computational algorithms and weighting schemes to infer novel gene annotations from a set of known ones. We used the latent semantic analysis approach, implementing two popular algorithms (Latent Semantic Indexing and Probabilistic Latent Semantic Analysis) and propose a novel method, the Semantic IMproved Latent Semantic Analysis, which adds a clustering step on the set of considered genes. Furthermore, we propose the improvement of these algorithms by weighting the annotations in the input set. Results We tested our methods and their weighted variants on the Gene Ontology annotation sets of three model organism genes (Bos taurus, Danio rerio and Drosophila melanogaster ). The methods showed their ability in predicting novel gene annotations and the weighting procedures demonstrated to lead to a valuable improvement, although the obtained results vary according to the dimension of the input annotation set and the considered algorithm. Conclusions Out of the three considered methods, the Semantic IMproved Latent Semantic Analysis is the one that provides better results. In particular, when coupled with a proper weighting policy, it is able to predict a significant number of novel annotations, demonstrating to actually be a helpful tool in supporting scientists in the curation process of gene functional annotations

Archivio istituzionale della ricerca - Politecnico di Milano

Springer - Publisher Connector

PubMed Central

Data-driven genomic computing: making sense of signals from the genome

Author: Canakoglu A
Ceri S
Kaitoua A
Masseroli M
Pinoli P.
Publication venue: CEUR Workshop Proceedings (CEUR-WS.org)
Publication date: 01/01/2017
Field of study

Archivio istituzionale della ricerca - Politecnico di Milano

Explorative search of distributed bio-data to answer complex biomedical questions

Author: A Abid
A Birkland
A Bozzon
A Campi
A Canakoglu
A Canakoglu
A Kasprzyk
A Nekrutenko
B Ludäscher
D Churches
D Hull
D Martinenghi
D Martinenghi
D Smedley
E Deelman
E Deelman
F Lemoine
Giorgio Ghisalberti
H Parkinson
J Bhagat
M Brambilla
M Johnson
M Latendresse
M Masseroli
M Masseroli
M Masseroli
M Masseroli
Marco Masseroli
Matteo Picozzi
P Mork
R Fagin
R Lopez
R Stevens
S Ceri
S Cohen-Boulakia
S Cohen-Boulakia
Stefano Ceri
TA Tatusova
TJ Lee
Y Gil
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Computational algorithms to predict Gene Ontology annotations

Author: A Canakoglu
A Hamosh
A Lazaric
A Nuzzo
AJ Perez
B Done
D Chicco
D Chicco
D Croft
D Korobkin
Davide Chicco
DM Blei
E Lavezzo
F Pessina
G Pandey
G Yu
KG Becker
L Wang
M Ashburner
M Kanehisa
M Masseroli
M Masseroli
M Masseroli
M Zitnik
Marco Masseroli
MM Kordmahalleh
OD King
P Khatri
P Pinoli
P Pinoli
Pietro Pinoli
S Raychaudhuri
S Vembu
ST Dumais
T Fawcett
T. Hofmann
X Robin
Y Tao
Z Barutcuoglu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recommended from our members

Mapping the human genetic architecture of COVID-19

Author: Albertos R
Andrews SJ
Aschard H
Balaconis MK
Bernasconi A
Biesecker L
Birney E
Brent Richards J
Butler-Laporte G
Buxbaum JD
Byun J
Cadilla CL
Canakoglu A
Carnero-Montoro E
Ceri S
Chwialkowska K
Cordioli M
Daly M
Darwish D
Davis L
Deelen P
Dueker N
Dunham I
Dutta AK
Eric Kerchberger V
Faucon A
Fernandez-Cadenas I
Finucane H
Folkersen L
Francescatto M
Ganna A
Garmendia A
Ghoussaini M
Gómez-Cabrero D
Han Y
Harry E
Julienne H
Kanai M
Kanoni S
Karim MA
Karjalainen J
Kaunisto M
Kenneth Baillie J
Kim HN
Koelling N
Kousathanas A
Lee S
Li R
Liao RG
Limou S
Mari F
Marouli E
Martin AR
Marttila M
Mbarek H
Medina-Gómez C
Mehtonen J
Minica C
Moltke I
Moreno-Estrada A
Moya L
Nakanishi T
Nasir J
Neale BM
Nguyen H
Nolan C
Okada Y
Pasaniuc B
Pathak GA
Polimanti R
Priest J
Pérez-Tur J
Rahmouni S
Renieri A
Sankaran VG
Savage J
Schulte EC
Schurmann C
Schwartzentruber J
Sedaghati-Khayat B
Shi H
Sloofman L
Smith GD
Solomonson M
Striano P
Teumer A
Trankiem A
Uddin MJ
Uddin MM
Utrilla A
Vadgama N
van Heel D
Veerapen K
Verdugo RA
Wendt FR
Willer CJ
Wolford B
Yengo L
Zhou W
Zárate R
Publication venue: Springer Nature
Publication date: 08/07/2021
Field of study

Matters Arising to this article was published on 03 August 2022, available online at: https://doi.org/10.1038/s41586-022-04826-7 . A second Matters Arising to this article was published on 06 September 2023, available online at: https://doi.org/10.1038/s41586-023-06355-3 .Data availability: Summary statistics generated by the COVID-19 HGI are available at https://www.covid19hg.org/results/r5/ and are available in the GWAS Catalog (study code GCST011074). The analyses described here include the freeze-5 data. COVID-19 HGI continues to regularly release new data freezes. Summary statistics for non-European ancestry samples are not currently available due to the small individual sample sizes of these groups, but results for lead variants of 13 loci are reported in Supplementary Table 3. Individual level data can be requested directly from contributing studies, listed in Supplementary Table 1. We used publicly available data from GTEx (https://gtexportal.org/home/), the Neale lab (https://www.nealelab.is/uk-biobank/), Finucane lab (https://www.finucanelab.org), the FinnGen Freeze 4 cohort (https://www.finngen.fi/en/access_results) and the eQTL catalogue release 3 (https://www.ebi.ac.uk/eqtl/).Code availability: The code for summary statistics lift-over, the projection PCA pipeline including precomputed loadings and meta-analyses are available on GitHub (https://github.com/covid19-hg/) and the code for the Mendelian randomization and genetic correlation pipeline is available on GitHub at https://github.com/marcoralab/MRcovid.Reporting summary: Further information on research design is available in the Nature Research Reporting Summary linked to this paper online at: https://www.nature.com/articles/s41586-021-03767-x#MOESM2 .Supplementary information is available onlne at: https://www.nature.com/articles/s41586-021-03767-x#Sec24 .Extended data figures and tables are available online at: https://www.nature.com/articles/s41586-021-03767-x#Sec23 .Copyright © The Author(s) 2021. The genetic make-up of an individual contributes to the susceptibility and response to viral infection. Although environmental, clinical and social factors have a role in the chance of exposure to SARS-CoV-2 and the severity of COVID-191,2, host genetics may also be important. Identifying host-specific genetic factors may reveal biological mechanisms of therapeutic relevance and clarify causal relationships of modifiable environmental risk factors for SARS-CoV-2 infection and outcomes. We formed a global network of researchers to investigate the role of human genetics in SARS-CoV-2 infection and COVID-19 severity. Here we describe the results of three genome-wide association meta-analyses that consist of up to 49,562 patients with COVID-19 from 46 studies across 19 countries. We report 13 genome-wide significant loci that are associated with SARS-CoV-2 infection or severe manifestations of COVID-19. Several of these loci correspond to previously documented associations to lung or autoimmune and inflammatory diseases3–7. They also represent potentially actionable mechanisms in response to infection. Mendelian randomization analyses support a causal role for smoking and body-mass index for severe COVID-19 although not for type II diabetes. The identification of novel host genetic factors associated with COVID-19 was made possible by the community of human genetics researchers coming together to prioritize the sharing of data, results, resources and analytical frameworks. This working model of international collaboration underscores what is possible for future genetic discoveries in emerging pandemics, or indeed for any complex human disease

Brunel University Research Archive

Mapping the human genetic architecture of COVID-19

Author: Ahmad H.
Alarcon C.
Albertos R.
Algera A.
Andrews S.
Aschard H.
Aslibekyan S.
Atanasovska B.
Auton A.
Azuure C.
Baillie J.
Baillie S.
Balaconis M.
Ball C.
Banagan J.
Barbour A.
Bax D.
Bernasconi A.
Beudel M.
Biesecker L.
Birney E.
Boer C.
Bomers M.
Bonta P.
Bos L.
Botta M.
Boua P.
Brouwer M.
Bugiani M.
Bulle E.
Butler-Laporte G.
Buxbaum J.
Byun J.
Cadilla C.
Canakoglu A.
Carnero-Montoro E.
Chouchane O.
Chwialkowska K.
Cloherty A.
Coignet M.
Coker D.
Colombo F.
Cordioli M.
Daly M.
Darwish D.
Davis L.
de Brabander J.
de Bree G.
de Bruin S.
de Jong D.
de Vries H.
Deelen P.
Dongelmans D.
Dueker N.
Dunham I.
Dutta A.
Elbers P.
Esmaeeli S.
Esparza-Gordillo J.
Faucon A.
Favé M.
Fernandez-Cadenas I.
Ferwerda B.
Filshtein-Sonmez T.
Finucane H.
Fleuren L.
Folkersen L.
Francescatto M.
Francioli L.
Franke L.
Friedman P.
Ganna A.
Garmendia A.
Geerlings S.
Geerts B.
Geijtenbeek T.
Ghoussaini M.
Girbes A.
Goorhuis B.
Grobusch M.
Gómez-Cabrero D.
Hafkamp F.
Hagens L.
Hamann J.
Hamidi Z.
Han C.
Han Y.
Harris V.
Harry E.
Hemke R.
Hermans S.
Herrmann S.
Heunks L.
Hollmann M.
Horn J.
Hovius J.
Im H.
Im S.
Jan Bogaard H.
Jansen P.
Julienne H.
Kaja E.
Kanai M.
Kanoni S.
Karim M.
Karjalainen J.
Kaunisto M.
Kennis-Szilagyi I.
Kerchberger V.
Kim H.
Kim S.
Knight S.
Koelling N.
Koning R.
Kornilov S.
Kousathanas A.
Kromhout A.
Lee S.
Lee Y.
Lemaçon A.
Lenz T.
Li R.
Liao R.
Lim J.
Limou S.
Lio P.
Mari F.
Marouli E.
Martin A.
Marttila M.
Mazurek S.
Mbarek H.
McCurdy S.
Medina-Gómez C.
Mehtonen J.
Meltzer D.
Migeotte I.
Minica C.
Minnaar R.
Moltke I.
Moreno D.
Moreno-Estrada A.
Moya L.
Nakanishi T.
Nasir J.
Neale B.
Nellen J.
Nguyen H.
Niemi M.
Nossent E.
Nutescu E.
Okada Y.
O’Brien T.
O’Connell J.
O’Donnell P.
O’Leary K.
Park D.
Partha R.
Pasaniuc B.
Pasko D.
Patel S.
Pathak G.
Paulus F.
Pearson N.
Perera M.
Perumal S.
Peters E.
Polimanti R.
Posthuma D.
Preckel B.
Priest J.
Prijatelj V.
Prins J.
Prokić I.
Pérez-Tur J.
Raasveld J.
Raffat N.
Rahmouni S.
Reijnders T.
Renieri A.
Rhead B.
Richards J.
Roberts G.
Sankaran V.
Savage J.
Schinkel M.
Schulte E.
Schultz M.
Schurmann C.
Schuurman A.
Schwartzentruber J.
Sedaghati-Khayat B.
Shastri A.
Shelton J.
Shi H.
Sigaloff K.
Sipeky C.
Sivanadhan I.
Sloofman L.
Smit M.
Smith G.
Solomonson M.
Song H.
Stijnis C.
Stilma W.
Striano P.
Symons A.
Szentpeteri J.
Tanigawa Y.
Teles A.
Teumer A.
Teunissen C.
Thoral P.
Tissink E.
Trankiem A.
Tsonas A.
Tuck M.
Uddin M.
Uddin M.
Uffelmann E.
Utrilla A.
Vadgama N.
Vallerga C.
van Agtmael M.
van Baarle F.
van de Beek D.
van der Poll T.
van der Valk M.
van Heel D.
van Mourik N.
van Uffelen J.
van Vugt M.
Varnai R.
Veelo D.
Veerapen K.
Verdugo R.
Vlaar A.
von Hohenstaufen K.
Wang Q.
Weldon C.
Wendt F.
Wiersinga W.
Willer C.
Wolford B.
Wolterman R.
Wouters D.
Yang G.
Ye C.
Yengo L.
Zhou W.
Zwinderman K.
Zárate R.
Özer O.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

The genetic make-up of an individual contributes to the susceptibility and response to viral infection. Although environmental, clinical and social factors have a role in the chance of exposure to SARS-CoV-2 and the severity of COVID-19(1,2), host genetics may also be important. Identifying host-specific genetic factors may reveal biological mechanisms of therapeutic relevance and clarify causal relationships of modifiable environmental risk factors for SARS-CoV-2 infection and outcomes. We formed a global network of researchers to investigate the role of human genetics in SARS-CoV-2 infection and COVID-19 severity. Here we describe the results of three genome-wide association meta-analyses that consist of up to 49,562 patients with COVID-19 from 46 studies across19 countries. We report 13 genome-wide significant loci that are associated with SARS-CoV-2 infection or severe manifestations of COVID-19. Several of these loci correspond to previously documented associations to lung or autoimmune and inflammatory diseases(3-7). They also represent potentially actionable mechanisms in response to infection. Mendelian randomization analyses support a causal role for smoking and body-mass index for severe COVID-19 although not for type II diabetes. The identification of novel host genetic factors associated with COVID-19 was made possible by the community of human genetics researchers coming together to prioritize the sharing of data, results, resources and analytical frameworks. This working model of international collaboration underscores what is possible for future genetic discoveries in emerging pandemics, or indeed for any complex human disease.Radiolog

Leiden University Scholary Publications

MPG.PuRe

Integration of available multiple annotation data and detection of new annotations

Author: Canakoglu A
Masseroli M
Publication venue: BITS; 2014
Publication date: 01/01/2014
Field of study

Archivio istituzionale della ricerca - Politecnico di Milano

Protein-protein interaction associated disorders revealed via data integration

Author: Canakoglu A
Masseroli M
Publication venue: SIB Swiss Institute of Bioinformatics
Publication date: 01/01/2012
Field of study

Numerous protein-protein interaction (PPI) data are provided by using new high-throughput experimental and computational techniques; they are being collected in different databases. The data generally do not contain phenotypic or even functional or structural information about the interactors, which in many cases are available in other databases. Thus, to have widespread coverage, it is necessary to combine the data from different databases. For this purpose, we are developing a framework to create and maintain a data warehouse on the basis of a conceptual data model. Then, we applied an automatic association inference method, based on the transitive closure concept. In particular, by leveraging IntAct and Mint PPI data, Entrez protein encoding gene data and OMIM genetic disorder data, we inferred associations between proteins and genetic disorders and their phenotypes. In our data warehouse, 46,154 human PPIs regarding 12,178 distinct human proteins were integrated. These human proteins are encoded by 11,232 different human genes. By applying transitive closure concept, we identified 1,130 gene networks and found 1,136 human PPIs associated with 628 genetic disorders. The interactions between the proteins, that are associated to the specific disease with transitive closure method, will help researchers to focus on protein interactions of the disease. This will helps to reveal the disease because of malfunctioning protein interactions. Then possibly the disease treatment strategy such as synthetic protein engineering could be applied. This hypothesis shows the importance of the integration of the PPI data with the genetic disorder data

Archivio istituzionale della ricerca - Politecnico di Milano

ViruClust: Direct comparison of SARS-CoV-2 genomes and genetic variants in space and time

Author: Bernasconi A.
Canakoglu A.
Ceri S.
Chiara M.
Cilibrasi L.
Pinoli P.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2022
Field of study

Motivation: The ongoing evolution of SARS-CoV-2 and the rapid emergence of variants of concern at distinct geographic locations have relevant implications for the implementation of strategies for controlling the COVID-19 pandemic. Combining the growing body of data and the evidence on potential functional implications of SARS-CoV-2 mutations can suggest highly effective methods for the prioritization of novel variants of potential concern, e.g. increasing in frequency locally and/or globally. However, these analyses may be complex, requiring the integration of different data and resources. We claim the need for a streamlined access to up-To-date and high-quality genome sequencing data from different geographic regions/countries, and the current lack of a robust and consistent framework for the evaluation/comparison of the results. Results: To overcome these limitations, we developed ViruClust, a novel tool for the comparison of SARS-CoV-2 genomic sequences and lineages in space and time. ViruClust is made available through a powerful and intuitive web-based user interface. Sophisticated large-scale analyses can be executed with a few clicks, even by users without any computational background. To demonstrate potential applications of our method, we applied ViruClust to conduct a thorough study of the evolution of the most prevalent lineage of the Delta SARS-CoV-2 variant, and derived relevant observations. By allowing the seamless integration of different types of functional annotations and the direct comparison of viral genomes and genetic variants in space and time, ViruClust represents a highly valuable resource for monitoring the evolution of SARS-CoV-2, facilitating the identification of variants and/or mutations of potential concern

Archivio istituzionale della ricerca - Politecnico di Milano