1,053 research outputs found

    A unifying concept for the dependence of whole-crop N:P ratio on biomass : theory and experiment

    Get PDF
    Background and Aims: Numerous estimates have been made of the concentrations of N and P required for good growth of crop species but they have not been defined by any unifying model. The aim of the present study was to develop such a model for the dependence of the N : P ratio on crop mass, to test its validity and to use it to identify elements of similarity between different crop species and wild plants. Methods: A model was derived between plant N : P ratio (Rw) and its dry biomass per unit area (W) during growth with near optimum nutrition by considering that plants consist of growth-related tissue and storage-related tissue with N : P ratios Rg and Rs, respectively. Testing and calibration against experimental data on different crop species led to a simple equation between Rw and W which was tested against independent experimental data. Key Results: The validity of the model and equation was supported by 365 measurements of Rw in 38 field experiments on crops. Rg and Rs remained approximately constant throughout growth, with average values of 11Ā·8 and 5Ā·8 by mass. The model also approximately predicted the relationships between leaf N and P concentrations in 124 advisory estimates on immature tissues and in 385 wild species from published global surveys. Conclusions: The N : P ratio of the biomass of very different crops, during growth with near optimum levels of nutrients, is defined entirely in terms of crop biomass, an average N : P ratio of the storage/structure-related tissue of the crop and an average N : P ratio of the growth-related tissue. The latter is similar to that found in leaves of many wild plant species, and even micro-organisms and terrestrial and freshwater autotrophs

    Multi-view learning and data integration for omics data

    Get PDF
    2015 - 2016In recent years, the advancement of high-throughput technologies, combined with the constant decrease of the data-storage costs, has led to the production of large amounts of data from diļ¬€erent experiments that characterise the same entities of interest. This information may relate to speciļ¬c aspects of a phenotypic entity (e.g. Gene expression), or can include the comprehensive and parallel measurement of multiple molecular events (e.g., DNA modiļ¬cations, RNA transcription and protein translation) in the same samples. Exploiting such complex and rich data is needed in the frame of systems biology for building global models able to explain complex phenotypes. For example, theuseofgenome-widedataincancerresearch, fortheidentiļ¬cationof groups of patients with similar molecular characteristics, has become a standard approach for applications in therapy-response, prognosis-prediction, and drugdevelopment.ƂăMoreover, the integration of gene expression data regarding cell treatment by drugs, and information regarding chemical structure of the drugs allowed scientist to perform more accurate drug repositioning tasks. Unfortunately, there is a big gap between the amount of information and the knowledge in which it is translated. Moreover, there is a huge need of computational methods able to integrate and analyse data to ļ¬ll this gap. Current researches in this area are following two diļ¬€erent integrative methods: one uses the complementary information of diļ¬€erent measurements for the 7 i i ā€œTemplateā€ ā€” 2017/6/9 ā€” 16:42 ā€” page 8 ā€” #8 i i i i i i study of complex phenotypes on the same samples (multi-view learning); the other tends to infer knowledge about the phenotype of interest by integrating and comparing the experiments relating to it with respect to those of diļ¬€erent phenotypes already known through comparative methods (meta-analysis). Meta-analysis can be thought as an integrative study of previous results, usually performed aggregating the summary statistics from diļ¬€erent studies. Due to its nature, meta-analysis usually involves homogeneous data. On the other hand, multi-view learning is a more ļ¬‚exible approach that considers the fusion of different data sources to get more stable and reliable estimates. Based on the type of data and the stage of integration, new methodologies have been developed spanning a landscape of techniques comprising graph theory, machine learning and statistics. Depending on the nature of the data and on the statistical problem to address, the integration of heterogeneous data can be performed at diļ¬€erent levels: early, intermediate and late. Early integration consists in concatenating data from diļ¬€erent views in a single feature space. Intermediate integration consists in transforming all the data sources in a common feature space before combining them. In the late integration methodologies, each view is analysed separately and the results are then combined. The purpose of this thesis is twofold: the former objective is the deļ¬nition of a data integration methodology for patient sub-typing (MVDA) and the latter is the development of a tool for phenotypic characterisation of nanomaterials (INSIdEnano). In this PhD thesis, I present the methodologies and the results of my research. MVDA is a multi-view methodology that aims to discover new statistically relevant patient sub-classes. Identify patient subtypes of a speciļ¬c diseases is a challenging task especially in the early diagnosis. This is a crucial point for the treatment, because not allthe patients aļ¬€ected bythe same diseasewill have the same prognosis or need the same drug treatment. This problem is usually solved by using transcriptomic data to identify groups of patients that share the same gene patterns. The main idea underlying this research work is that to combine more omics data for the same patients to obtain a better characterisation of their disease proļ¬le. The proposed methodology is a late integration approach i i ā€œTemplateā€ ā€” 2017/6/9 ā€” 16:42 ā€” page 9 ā€” #9 i i i i i i based on clustering. It works by evaluating the patient clusters in each single view and then combining the clustering results of all the views by factorising the membership matrices in a late integration manner. The eļ¬€ectiveness and the performance of our method was evaluated on six multi-view cancer datasets related to breast cancer, glioblastoma, prostate and ovarian cancer. The omics data used for the experiment are gene and miRNA expression, RNASeq and miRNASeq, Protein Expression and Copy Number Variation. In all the cases, patient sub-classes with statistical signiļ¬cance were found, identifying novel sub-groups previously not emphasised in literature. The experiments were also conducted by using prior information, as a new view in the integration process, to obtain higher accuracy in patientsā€™ classiļ¬cation. The method outperformed the single view clustering on all the datasets; moreover, it performs better when compared with other multi-view clustering algorithms and, unlike other existing methods, it can quantify the contribution of single views in the results. The method has also shown to be stable when perturbation is applied to the datasets by removing one patient at a time and evaluating the normalized mutual information between all the resulting clusterings. These observations suggest that integration of prior information with genomic features in sub-typing analysis is an eļ¬€ective strategy in identifying disease subgroups. INSIdE nano (Integrated Network of Systems bIology Eļ¬€ects of nanomaterials) is a novel tool for the systematic contextualisation of the eļ¬€ects of engineered nanomaterials (ENMs) in the biomedical context. In the recent years, omics technologies have been increasingly used to thoroughly characterise the ENMs molecular mode of action. It is possible to contextualise the molecular eļ¬€ects of diļ¬€erent types of perturbations by comparing their patterns of alterations. While this approach has been successfully used for drug repositioning, it is still missing to date a comprehensive contextualisation of the ENM mode of action. The idea behind the tool is to use analytical strategies to contextualise or position the ENM with the respect to relevant phenotypes that have been studied in literature, (such as diseases, drug treatments, and other chemical exposures) by comparing their patterns of molecular alteration. This could greatly increase the knowledge on the ENM molecular eļ¬€ects and in turn i i ā€œTemplateā€ ā€” 2017/6/9 ā€” 16:42 ā€” page 10 ā€” #10 i i i i i i contribute to the deļ¬nition of relevant pathways of toxicity as well as help in predicting the potential involvement of ENM in pathogenetic events or in novel therapeutic strategies. The main hypothesis is that suggestive patterns of similarity between sets of phenotypes could be an indication of a biological association to be further tested in toxicological or therapeutic frames. Based on the expression signature, associated to each phenotype, the strength of similarity between each pair of perturbations has been evaluated and used to build a large network of phenotypes. To ensure the usability of INSIdE nano, a robust and scalable computational infrastructure has been developed, to scan this large phenotypic network and a web-based eļ¬€ective graphic user interface has been built. Particularly, INSIdE nano was scanned to search for clique sub-networks, quadruplet structures of heterogeneous nodes (a disease, a drug, a chemical and a nanomaterial) completely interconnected by strong patterns of similarity (or anti-similarity). The predictions have been evaluated for a set of known associations between diseases and drugs, based on drug indications in clinical practice, and between diseases and chemical, based on literature-based causal exposure evidence, and focused on the possible involvement of nanomaterials in the most robust cliques. The evaluation of INSIdE nano conļ¬rmed that it highlights known disease-drug and disease-chemical connections. Moreover, disease similarities agree with the information based on their clinical features, as well as drugs and chemicals, mirroring their resemblance based on the chemical structure. Altogether, the results suggest that INSIdE nano can also be successfully used to contextualise the molecular eļ¬€ects of ENMs and infer their connections to other better studied phenotypes, speeding up their safety assessment as well as opening new perspectives concerning their usefulness in biomedicine. [edited by author]Lā€™avanzamento tecnologico delle tecnologie high-throughput, combinato con il costante decremento dei costi di memorizzazione, ha portato alla produzione di grandi quantit`a di dati provenienti da diversi esperimenti che caratterizzano le stesse entit`a di interesse. Queste informazioni possono essere relative a speciļ¬ci aspetti fenotipici (per esempio lā€™espressione genica), o possono includere misure globali e parallele di diversi aspetti molecolari (per esempio modiļ¬che del DNA, trascrizione dellā€™RNA e traduzione delle proteine) negli stessi campioni. Analizzare tali dati complessi `e utile nel campo della systems biology per costruire modelli capaci di spiegare fenotipi complessi. Ad esempio, lā€™uso di dati genome-wide nella ricerca legata al cancro, per lā€™identiļ¬cazione di gruppi di pazienti con caratteristiche molecolari simili, `e diventato un approccio standard per una prognosi precoce piu` accurata e per lā€™identiļ¬cazione di terapie speciļ¬che. Inoltre, lā€™integrazione di dati di espressione genica riguardanti il trattamento di cellule tramite farmaci ha permesso agli scienziati di ottenere accuratezze elevate per il drug repositioning. Purtroppo, esiste un grosso divario tra i dati prodotti, in seguito ai numerosi esperimenti, e lā€™informazione in cui essi sono tradotti. Quindi la comunit`a scientiļ¬ca ha una forte necessit`a di metodi computazionali per poter integrare e analizzate tali dati per riempire questo divario. La ricerca nel campo delle analisi multi-view, segue due diversi metodi di analisi integrative: uno usa le informazioni complementari di diverse misure per studiare fenotipi complessi su diversi campioni (multi-view learning); lā€™altro tende ad inferire conoscenza sul fenotipo di interesse di una entit`a confrontando gli esperimenti ad essi relativi con quelli di altre entit`a fenotipiche gi`a note in letteratura (meta-analisi). La meta-analisi pu`o essere pensata come uno studio comparativo dei risultati identiļ¬cati in un particolare esperimento, rispetto a quelli di studi precedenti. A causa della sua natura, la meta-analisi solitamente coinvolge dati omogenei. Dā€™altra parte, il multi-view learning `e un approccio piu` ļ¬‚essibile che considera la fusione di diverse sorgenti di dati per ottenere stime piu` stabili e aļ¬ƒdabili. In base al tipo di dati e al livello di integrazione, nuove metodologie sono state sviluppate a partire da tecniche basate sulla teoria dei graļ¬, machine learning e statistica. In base alla natura dei dati e al problema statistico da risolvere, lā€™integrazione di dati eterogenei pu`o essere eļ¬€ettuata a diversi livelli: early, intermediate e late integration. Le tecniche di early integration consistono nella concatenazione dei dati delle diverse viste in un unico spazio delle feature. Le tecniche di intermediate integration consistono nella trasformazione di tutte le sorgenti dati in un unico spazio comune prima di combinarle. Nelle tecniche di late integration, ogni vista `e analizzata separatamente e i risultati sono poi combinati. Lo scopo di questa tesi `e duplice: il primo obbiettivo `e la deļ¬nizione di una metodologia di integrazione dati per la sotto-tipizzazione dei pazienti (MVDA) e il secondo `e lo sviluppo di un tool per la caratterizzazione fenotipica dei nanomateriali (INSIdEnano). In questa tesi di dottorato presento le metodologie e i risultati della mia ricerca. MVDA `e una tecnica multi-view con lo scopo di scoprire nuove sotto tipologie di pazienti statisticamente rilevanti. Identiļ¬care sottotipi di pazienti per una malattia speciļ¬ca `e un obbiettivo con alto rilievo nella pratica clinica, soprattutto per la diagnosi precoce delle malattie. Questo problema `e generalmente risolto usando dati di trascrittomica per identiļ¬care i gruppi di pazienti che condividono gli stessi pattern di alterazione genica. Lā€™idea principale alla base di questo lavoro di ricerca `e quello di combinare piu` tipologie di dati omici per gli stessi pazienti per ottenere una migliore caratterizzazione del loro proļ¬lo. La metodologia proposta `e un approccio di tipo late integration basato sul clustering. Per ogni vista viene eļ¬€ettuato il clustering dei pazienti rappresentato sotto forma di matrici di membership. I risultati di tutte le viste vengono poi combinati tramite una tecnica di fattorizzazione di matrici per ottenere i metacluster ļ¬nali multi-view. La fattibilit`a e le performance del nostro metodo sono stati valutati su sei dataset multi-view relativi al tumore al seno, glioblastoma, cancro alla prostata e alle ovarie. I dati omici usati per gli esperimenti sono relativi alla espressione dei geni, espressione dei mirna, RNASeq, miRNASeq, espressione delle proteine e della Copy Number Variation. In tutti i dataset sono state identiļ¬cate sotto-tipologie di pazienti con rilevanza statistica, identiļ¬cando nuovi sottogruppi precedentemente non noti in letteratura. Ulteriori esperimenti sono stati condotti utilizzando la conoscenza a priori relativa alle macro classi dei pazienti. Tale informazione `e stata considerata come una ulteriore vista nel processo di integrazione per ottenere una accuratezza piu` elevata nella classiļ¬cazione dei pazienti. Il metodo proposto ha performance migliori degli algoritmi di clustering clussici su tutti i dataset. MVDA ha ottenuto risultati migliori in confronto a altri algoritmi di integrazione di tipo ealry e intermediate integration. Inoltre il metodo `e in grado di calcolare il contributo di ogni singola vista al risultato ļ¬nale. I risultati mostrano, anche, che il metodo `e stabile in caso di perturbazioni del dataset eļ¬€ettuate rimuovendo un paziente alla volta (leave-one-out). Queste osservazioni suggeriscono che lā€™integrazione di informazioni a priori e feature genomiche, da utilizzare congiuntamente durante lā€™analisi, `e una strategia vincente nellā€™identiļ¬cazione di sotto-tipologie di malattie. INSIdE nano (Integrated Network of Systems bIology Eļ¬€ects of nanomaterials) `e un tool innovativo per la contestualizzazione sistematica degli eļ¬€etti delle nanoparticelle (ENMs) in contesti biomedici. Negli ultimi anni, le tecnologie omiche sono state ampiamente applicate per caratterizzare i nanomateriali a livello molecolare. Eā€™ possibile contestualizzare lā€™eļ¬€etto a livello molecolare di diversi tipi di perturbazioni confrontando i loro pattern di alterazione genica. Mentre tale approccio `e stato applicato con successo nel campo del drug repositioning, una contestualizzazione estensiva dellā€™eļ¬€etto dei nanomateriali sulle cellule `e attualmente mancante. Lā€™idea alla base del tool `e quello di usare strategie comparative di analisi per contestualizzare o posizionare i nanomateriali in confronto a fenotipi rilevanti che sono stati studiati in letteratura (come ad esempio malattie dellā€™uomo, trattamenti farmacologici o esposizioni a sostanze chimiche) confrontando i loro pattern di alterazione molecolare. Questo potrebbe incrementare la conoscenza dellā€™eļ¬€etto molecolare dei nanomateriali e contribuire alla deļ¬nizione di nuovi pathway tossicologici oppure identiļ¬care eventuali coinvolgimenti dei nanomateriali in eventi patologici o in nuove strategie terapeutiche. Lā€™ipotesi alla base `e che lā€™identiļ¬cazione di pattern di similarit`a tra insiemi di fenotipi potrebbe essere una indicazione di una associazione biologica che deve essere successivamente testata in ambito tossicologico o terapeutico. Basandosi sulla ļ¬rma di espressione genica, associata ad ogni fenotipo, la similarit`a tra ogni coppia di perturbazioni `e stata valuta e usata per costruire una grande network di interazione tra fenotipi. Per assicurare lā€™utilizzo di INSIdE nano, `e stata sviluppata una infrastruttura computazionale robusta e scalabile, allo scopo di analizzare tale network. Inoltre `e stato realizzato un sito web che permettesse agli utenti di interrogare e visualizzare la network in modo semplice ed eļ¬ƒciente. In particolare, INSIdE nano `e stato analizzato cercando tutte le possibili clique di quattro elementi eterogenei (un nanomateriale, un farmaco, una malattia e una sostanza chimica). Una clique `e una sotto network completamente connessa, dove ogni elemento `e collegato con tutti gli altri. Di tutte le clique, sono state considerate come signiļ¬cative solo quelle per le quali le associazioni tra farmaco e malattia e farmaco e sostanze chimiche sono note. Le connessioni note tra farmaci e malattie si basano sul fatto che il farmaco `e prescritto per curare tale malattia. Le connessioni note tra malattia e sostanze chimiche si basano su evidenze presenti in letteratura del fatto che tali sostanze causano la malattia. Il focus `e stato posto sul possibile coinvolgimento dei nanomateriali con le malattie presenti in tali clique. La valutazione di INSIdE nano ha confermato che esso mette in evidenza connessioni note tra malattie e farmaci e tra malattie e sostanze chimiche. Inoltre la similarit`a tra le malattie calcolata in base ai geni `e conforme alle informazioni basate sulle loro informazioni cliniche. Allo stesso modo le similarit`a tra farmaci e sostanze chimiche rispecchiano le loro similarit`a basate sulla struttura chimica. Nellā€™insieme, i risultati suggeriscono che INSIdE nano pu`o essere usato per contestualizzare lā€™eļ¬€etto molecolare dei nanomateriali e inferirne le connessioni rispetto a fenotipi precedentemente studiati in letteratura. Questo metodo permette di velocizzare il processo di valutazione della loro tossicit`a e apre nuove prospettive per il loro utilizzo nella biomedicina. [a cura dell'autore]XV n.s

    Embodied Energy Versus Operational Energy in a Nearly Zero Energy Building Case Study

    Get PDF
    Currently in the NZEB energy demand calculation method the Embodied Energy is not included, despite the state-of-the-art recognizes a relevant energy impact caused by raw materials extraction as well as components manufacturing, product final assembly and transportation. Aim of this study was to assess the Embodied Energy in a NZEB case study along with the Operational Energy, pointing out the importance of taking into account both these aspects since the earliest design stage. Within the research activity here presented, for accounting the EE, a worksheet was developed and implemented with over 65 materials taken from a database carried out by the authors, in order to encourage designers to properly manage these issues

    Existence and multiplicity of peaked bound states for nonlinear Schr\"odinger equations on metric graphs

    Full text link
    We establish existence and multiplicity of one-peaked and multi-peaked positive bound states for nonlinear Schr\"odinger equations on general compact and noncompact metric graphs. Precisely, we construct solutions concentrating at every vertex of odd degree greater than or equal to 33. We show that these solutions are not minimizers of the associated action and energy functionals. To the best of our knowledge, this is the first work exhibiting solutions concentrating at vertices with degree different than 11. The proof is based on a suitable Ljapunov-Schmidt reduction.Comment: 25 pages, 2 figure

    Energy Evaluation of a {PV}-Based Test Facility for Assessing Future Self-Sufficient Buildings

    Get PDF
    In recent years, investigations on advanced technological solutions aiming to achieve high-energy performance in buildings have been carried out by research centers and universities, in accordance with the reduction in buildingsā€™ energy consumption required by European Union. However, even if the research and design of new technological solutions makes it possible to achieve the regulatory objectives, a buildingā€™s performance during operation deviates from simulations. To deepen this topic, interesting studies have focused on testing these solutions on full-scale facilities used for real-life activities. In this context, a test facility will be built in the university campus of Politecnico di Torino (Italy). The facility has been designed to be an all-electric nearly Zero Energy Building (nZEB), where heating and cooling demand will be fulfilled by an air-source heat pump and photovoltaic generators will meet the energy demand. In this paper, the facility energy performance is evaluated through a dynamic simulation model. To improve energy self-sufficiency, the integration of lithium-ion batteries in a HVAC system is investigated and their storage size is optimized. Moreover, the facility has been divided into three units equipped with independent electric systems with the aim of estimating the benefits of local energy sharing. The simulation results clarify that the facility meets the expected energy performance, and that it is consistent with a typical European nZEB. The results also demonstrate that the local use of photovoltaic energy can be enhanced thanks to batteries and local energy sharing, achieving a greater independence from the external electrical grid. Furthermore, the analysis of the impact of the local energy sharing makes the case study of particular interest, as it represents a simplified approach to the energy community concept. Thus, the results clarify the academic potential for this facility, in terms of both research and didactic purposes

    Network Analysis of Microarray Data

    Get PDF
    DNA microarrays are widely used to investigate gene expression. Even though the classical analysis of microarray data is based on the study of differentially expressed genes, it is well known that genes do not act individually. Network analysis can be applied to study association patterns of the genes in a biological system. Moreover, it finds wide application in differential coexpression analysis between different systems. Network based coexpression studies have for example been used in (complex) disease gene prioritization, disease subtyping, and patient stratification.Peer reviewe

    Family childhood experiences reports in depressed patients: comparison between 2 time points

    Get PDF
    Research has shown some discrepancies in the reports of experiences from childhood when an individual is depressed, because a depressed mood may have biasing effects on autobiographical memory. The present study sought to clarify this issue by examining whether there is temporal stability in the report of childhood experiences in depressed subjects, or rather, if these experiences are influenced by the mood at the time of report. The study therefore carries implications for the credibility of childhood reports of depressed adults, for the validity of the questionnaire used ā€“ Family Background Questionnaire (FBQ), adapted from Melchert (1991) ā€“ and for the conclusions it might yield. We hypothesized that the report of the same childhood family experiences across the year would not be influenced by the mood disorder. To test this prediction, we solicited reports of family experiences in 25 depressed subjects (76% women and 24% men) across the course of one year . The diagnosis of Major Depressive Episode at the outset of the study was confirmed in all subjects with the use of Structured Clinical Interview for DSM-IV Axis I and the Beck Depression Inventory (BDI) to quantify level of depressive mood (M = 19.80, SD = 10.68). The report of childhood and family experiences was collected with the FBQ (Melchert, 1991; Melchert & Sayger, 1998), which consists of 124 items comprising 14 subscales. As hypothesized, results demonstrated that the reporting of childhood experiences in dthe family after approximately 1 year was not influenced by mood state of depression, which reinforces the reliability of childhood reports and the adequate reliability of the FBQ. However, there were significant differences between the first and second moment in the mood of the subjects (BDI), with a significant mood improvement after one year. These results are consistent with those of other authors which confirms that the use of questionnaires with more objective and specific items reduce the risk of biased responses on self-reporting childhood experiences

    VOLTA : adVanced mOLecular neTwork Analysis

    Get PDF
    Motivation: Network analysis is a powerful approach to investigate biological systems. It is often applied to study gene co-expression patterns derived from transcriptomics experiments. Even though co-expression analysis is widely used, there is still a lack of tools that are open and customizable on the basis of different network types and analysis scenarios (e.g. through function accessibility), but are also suitable for novice users by providing complete analysis pipelines. Results: We developed VOLTA, a Python package suited for complex co-expression network analysis. VOLTA is designed to allow users direct access to the individual functions, while they are also provided with complete analysis pipelines. Moreover, VOLTA offers when possible multiple algorithms applicable to each analytical step (e.g. multiple community detection or clustering algorithms are provided), hence providing the user with the possibility to perform analysis tailored to their needs. This makes VOLTA highly suitable for experienced users who wish to build their own analysis pipelines for a wide range of networks as well as for novice users for which a 'plug and play' system is provided.Peer reviewe

    Manually curated and harmonised transcriptomics datasets of psoriasis and atopic dermatitis patients

    Get PDF
    We present manually curated transcriptomics data of psoriasis and atopic dermatitis patients retrieved from the NCBI Gene Expression Omnibus and EBI ArrayExpress repositories. We collected 39 transcriptomics datasets, deriving from DNA microarrays and RNA-Sequencing technologies, for a total of 1677 samples. We provide quality-checked, homogenised and preprocessed gene expression matrices and their corresponding metadata tables along with the estimated surrogate variables. These data represent a ready-made valuable source of knowledge for translational researchers in the dermatology field.Peer reviewe

    The potential of a data centred approach & knowledge graph data representation in chemical safety and drug design

    Get PDF
    Big Data pervades nearly all areas of life sciences, yet the analysis of large integrated data sets remains a major challenge. Moreover, the field of life sciences is highly fragmented and, consequently, so is its data, knowledge, and standards. This, in turn, makes integrated data analysis and knowledge gathering across sub-fields a demanding task. At the same time, the integration of various research angles and data types is crucial for modelling the complexity of organisms and biological processes in a holistic manner. This is especially valid in the context of drug development and chemical safety assessment where computational methods can provide solutions for the urgent need of fast, effective, and sustainable approaches. At the same time, such computational methods require the development of methodologies suitable for an inte-grated and data centred Big Data view. Here we discuss Knowledge Graphs (KG) as a solution to a data centred analysis approach for drug and chemical development and safety assessment. KGs are knowledge bases, data analysis engines, and knowledge discovery systems all in one, allowing them to be used from simple data retrieval, over meta-analysis to complex predictive and knowledge discovery systems. Therefore, KGs have immense potential to advance the data centred approach, the re-usability, and infor-mativity of data. Furthermore, they can improve the power of analysis, and the complexity of modelled processes, all while providing knowledge in a natively human understandable network data model. (c) 2022 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY-NC-ND license (http://creative-commons.org/licenses/by-nc-nd/4.0/).Peer reviewe
    • ā€¦
    corecore