Search CORE

2,932 research outputs found

Statistical Methods in Integrative Genomics

Author: Richardson Sylvia
Sun Wei
Tseng George C.
Publication venue
Publication date: 01/01/2016
Field of study

Statistical methods in integrative genomics aim to answer important biology questions by jointly analyzing multiple types of genomic data (vertical integration) or aggregating the same type of data across multiple studies (horizontal integration). In this article, we introduce different types of genomic data and data resources, and then review statistical methods of integrative genomics, with emphasis on the motivation and rationale of these methods. We conclude with some summary points and future research directions

PubMed Central

Carolina Digital Repository

Development and evaluation of prostate cancer risk prediction models for use in the community

Author: Aladwani Mohammad
Publication venue
Publication date: 31/12/2022
Field of study

The University of Manchester - Institutional Repository

Recommended from our members

The biological embedding of early-life socioeconomic status and family adversity in children's genome-wide DNA methylation.

Author: Adler Nancy E
Boyce W Thomas
Bush Nicole R
Edgar Rachel D
Essex Marilyn J
Kobor Michael S
MacIsaac Julia L
McEwen Lisa M
Park Mina
Publication venue: eScholarship, University of California
Publication date: 01/11/2018
Field of study

AimTo examine variation in child DNA methylation to assess its potential as a pathway for effects of childhood social adversity on health across the life course.Materials & methodsIn a diverse, prospective community sample of 178 kindergarten children, associations between three types of social experience and DNA methylation within buccal epithelial cells later in childhood were examined.ResultsFamily income, parental education and family psychosocial adversity each associated with increased or decreased DNA methylation (488, 354 and 102 sites, respectively) within a unique set of genomic CpG sites. Gene ontology analyses pointed to genes serving immune and developmental regulation functions.ConclusionFindings provided support for DNA methylation as a biomarker linking early-life social experiences with later life health in humans

eScholarship - University of California

The Cancer Genome Atlas Clinical Explorer: a web and mobile interface for identifying clinical–genomic driver associations

Author
Publication venue: BioMed Central
Publication date: 27/10/2015
Field of study

Springer - Publisher Connector

Summaries of plenary, symposia, and oral sessions at the XXII World Congress of Psychiatric Genetics, Copenhagen, Denmark, 12-16 October 2014

Author: Aas Monica
Blokland Gabriëlla A.M.
Chawner Samuel J.R.A.
Choi Shing-Wan
Cormack Freida K.
DeLisi Lynn
Estrada Jose
Forsingdal Annika
Friedrich Maximilian
Ganesham Suhas
Hall Lynsey
Haslinger Denise
Huckins Laura
Loken Erik
Malan-Müller Stefanie
Martin Joanna
Misiewicz Zuzanna
Pagliaroli Luca
Pardiñas Antonio F.
Pisanu Claudia
Quadri Giorgia
Ranlund Siri
Santoro Marcos L.
Shaw Alex D.
Song Jie
Tesli Martin
Tropeano Maria
van der Voet Monique
Wolfe Kate
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 01/01/2016
Field of study

The XXII World Congress of Psychiatric Genetics, sponsored by the International Society of Psychiatric Genetics, took place in Copenhagen, Denmark, on 12-16 October 2014. A total of 883 participants gathered to discuss the latest findings in the field. The following report was written by student and postdoctoral attendees. Each was assigned one or more sessions as a rapporteur. This manuscript represents topics covered in most, but not all of the oral presentations during the conference, and contains some of the major notable new findings reported

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

IUPUIScholarWorks

Online Research @ Cardiff

Repositório Institucional UNIFESP

Computational models of gene expression regulation

Author: Behjati Ardakani Fatemeh
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2019
Field of study

Throughout the last several decades, many efforts have been put into elucidating the genetic or epigenetic defects that result in various diseases. Gene regulation, i.e., the process of how genes are turned on and off in the right place and at the right time, is a paramount and prevailing question for researchers. Thanks to the discoveries made by researchers in this field, our understanding of interactions between proteins and DNA or proteins with themselves, as well as the dynamics of chromatin structure under different conditions, have substantially advanced. Even though there has been a lot achieved through these discoveries, there are still many unknown aspects about gene regulation. For instance, proteins called transcription factors (TFs) recognize and bind to specific regions of DNA and recruit the transcriptional machinery, which is essential for gene regulation. As there have been more than 2000 TFs identified in the human genome, it is important to study where they bind to or which genes they target. Computational approaches are important, in particular, as the biological experiments are often very expensive and cannot be done for all TFs. In 2016, a competition named DREAM Challenge was held encouraging researchers to develop novel computational tools for predicting the binding sites of several TFs. The first chapter of this thesis describes our machine learning approach to address this challenge within the scope of the competition. Using ensembles of random forest classifiers, we formulated our framework such that it is able to benefit from the tissue specificity inherent in the data leading to better generalization. Also, our models were tailored for spotting cofactors involved in the binding of TFs of interest. Comparing the important TFs that our computational models suggested with protein-protein association networks revealed that the models preferentially select motifs of TFs that are potential interaction partners in those networks. Another important aspect beyond predicting TF binding is to link epigeneomics, such as histone modification (HM) data, with gene expression. We, particularly, concentrated on predicting expression in a subset of genes called bidirectional. Bidirectional genes are referred to as pairs of genes that are located on opposite strands of DNA close to each other. As the sequencing technologies advance, more such bidirectional configurations are being detected. This indicates that in order to understand the gene regulatory mechanisms, it would be beneficial to account for such promoter architectures. In the second and third chapters, we focused on genes having bidirectional promoter architectures utilizing high resolution epigenomic signatures and single cell RNA-seq data to dissect the complex epigenetic architecture at these promoters. Using single-cell RNA-seq data as the estimate of gene expression, we were able to generate a hypothetical model for gene regulation in bidirectional promoters. We showed that bidirectional promoters can be categorized into three architecture types with distinct characteristics. Each of these categories corresponds to a unique gene expression profile at single cell level. The single cell RNA-seq data proved to be a powerful means for studying gene regulation. Therefore, in the last chapter, we proposed a novel approach for predicting gene expression at the single cell level using cis-regulatory motifs as well as epigenetic features. To achieve this, we designed a tree-guided multi-task learning framework that considers each cell as a task. Through this framework we were able to explain the single cell gene expression values using either TF binding affinities or TF ChIP-seq data measured at specific genomic regions. This allowed us to identify distinct TFs that show cell-type specific regulation in induced pluripotent stem cells. Our approach does not only limit to TFs, rather it can take any type of data that can potentially be used in explaining gene expression at single cell level. We believe that our findings can be used in drug discovery and development that can regulate the presence of TFs or other regulatory factors, which lead the cell fate into abnormal states, to prevent or cure diseases.In den letzten Jahrzehnten wurden große Anstrengungen unternommen, um die genetischen oder epigenetischen Defekte aufzuklären, die zu verschiedenen Krankheiten führen. Die Genregulation, d.h. der Prozess der Ein- und Abschaltung der Gene am richtigen Ort und zur richtigen Zeit reguliert, ist für die Forscher eine Frage von zentraler Bedeutung. Dank der Entdeckungen von Forschern auf diesem Gebiet ist unser Verständnis der Wechselwirkungen zwischen zwischen den Proteinen und der DNA oder der Proteine untereinander sowie der Dynamik der Chromatinstruktur unter verschiedenen Bedingungen wesentlich fortgeschritten. Obwohl durch diese Entdeckungen viel erreicht wurde, gibt es noch viele unbekannte Aspekte der Genregulation. Beispielsweise erkennen Proteine, sogenannte Transkriptionsfaktoren (Transcription Factors, TFs), bestimmte Bereiche der DNA und binden an diese und rekrutieren die Transkriptionsmaschinerie, die für die Genregulation erforderlich ist. Da mehr als 2000 TFs im menschlichen Genom identifiziert wurden, ist es wichtig zu untersuchen, wo sie binden oder auf welche Gene sie abzielen. Rechnerische Ansätze sind insbesondere wichtig, da die biologischen Experimente oft sehr teuer sind und nicht für alle TFs durchgeführt werden können. Im Jahr 2016 fand ein Wettbewerb namens DREAM Challenge statt, bei dem Forscher aufgefordert wurden, neuartige Rechenwerkzeuge zur Vorhersage der Bindungsstellen mehrerer TFs zu entwickeln. Das erste Kapitel dieser Arbeit beschreibt unseren Ansatz des maschinellen Lernens, um diese Herausforderung im Rahmen des Wettbewerbs anzugehen. Unter Verwendung von Ensembles von Random Forest Klassifikatoren haben wir unser Framework so formuliert, dass es von der Gewebespezifität der Daten profitiert und damit zu einer besseren Generalisierung führt. Außerdem wurden unsere Modelle auf das Erkennen von Kofaktoren angepasst, die an der Bindung von TFs beteiligt sind, die für uns von Interesse sind. Der Vergleich der wichtigen TFs, die unsere Computermodelle mit Protein-Protein-Assoziationsnetzwerken vorschlugen, ergab, dass die Modelle bevorzugt Motive von TFs auswählen, die potenzielle Interaktionspartner in diesen Netzwerken sind. Ein weiterer wichtiger Aspekt, der über die Vorhersage der TF-Bindung hinausgeht, besteht darin, epigeneomische Faktoren wie Histonmodifikationsdaten (HM-Daten) mit der Genexpression zu verknüpfen. Wir konzentrierten uns insbesondere auf die Vorhersage der Expression in einer Untergruppe von Genen, die als bidirektional bezeichnet werden. Bidirektionale Gene werden als Paare von Genen bezeichnet, die sich auf gegenüberliegenden DNA-Strängen befinden und nahe beieinander liegen. Mit dem Fortschritt der Sequenzierungstechnologien werden immer mehr solche bidirektionalen Konfigurationen erkannt. Dies weist darauf hin, dass es zum Verständnis der Genregulationsmechanismen vorteilhaft wäre, solche Promotorarchitekturen zu berücksichtigen. Im zweiten und dritten Kapitel konzentrierten wir uns auf Gene mit bidirektionalen Promotorarchitekturen, um mit Hilfe von epigenomischen Signaturen und Einzelzell-RNA-Sequenzdaten die komplexe epigenetische Architektur an diesen Promotoren zu analysieren. Unter Verwendung von Einzelzell-RNA-Sequenzdaten als Schätzung der Genexpression konnten wir ein hypothetisches Modell für die Genregulation in bidirektionalen Promotoren aufstellen. Wir haben gezeigt, dass bidirektionale Promotoren in drei Architekturtypen mit unterschiedlichen Merkmalen eingeteilt werden können. Jede dieser Kategorien entspricht einem eindeutigen Genexpressionsprofil auf Einzelzellebene. Die Einzelzell-RNA-Sequenzdaten erwiesen sich als leistungsstarkes Mittel zur Untersuchung der Genregulation. Daher haben wir im letzten Kapitel einen neuen Ansatz zur Vorhersage der Genexpression auf Einzelzellebene unter Verwendung von cis-regulatorischen Motiven sowie epigenetischen Merkmalen vorgeschlagen. Um dies zu erreichen, haben wir ein baumgesteuertes Multitasking-Lernsystem entwickelt, das jede Zelle als eine Aufgabe betrachtet. Durch dieses Gerüst konnten wir die Einzelzellgenexpressionswerte entweder mit TF-Bindungsaffinitäten oder mit TF-ChIP-Sequenzdaten erklären, die in bestimmten Genomregionen gemessen wurden. Dies ermöglichte es uns, verschiedene TFs zu identifizieren, die eine zelltypspezifische Regulation in induzierten pluripotenten Stammzellen zeigen. Unser Ansatz beschränkt sich nicht nur auf TFs, sondern kann jede Art von Daten verwenden, die potentiell zur Erklärung der Genexpression auf Einzelzellebene verwendet werden können. Wir glauben, dass unsere Erkenntnisse für die Entdeckung und Entwicklung von Arzneimitteln verwendet werden können, die das Vorhandensein von TFs oder anderen regulatorischen Faktoren regulieren können, die die Zellen abnormal werden lassen, um Krankheiten zu verhindern oder zu heilen

Universaar

Acronym

MPG.PuRe

Systems genomics analysis of complex cognitive traits

Author: Freytag Virginie
Publication venue
Publication date: 01/01/2017
Field of study

The study of the genetic underpinnings of human cognitive traits is deemed an important tool to increase our understanding of molecular processes related to physiological and pathological cognitive functioning. The polygenic architecture of such complex traits implies that multiple naturally occurring genetic variations, each of small effect size, are likely to influence jointly the biological processes underlying cognitive ability. Genetic association results are yet devoid of biological context, thus limiting both the identification and functional interpretation of susceptibility variants. This biological gap can be reduced by the integrative analysis of intermediate molecular traits, as mediators of genomic action. In this thesis, I present results from two such systems genomics analyses, as attempts to identify molecular patterns underlying cognitive trait variability. In the first study, we adopted a system-level approach to investigate the relationship between global age-related patterns of epigenetic variation and cortical thickness, a brain morphometric measure that is linked to cognitive functioning. The integration of both genome-wide methylomic and genetic profiles allowed the identification of a peripheral molecular signature that showed association with both cortical thickness and episodic memory performance. In the second study, we explicitly modeled the interdependencies between local genetic markers and peripherally measured epigenetic variations. We thus generated robust estimators of epigenetic regulation and showed that these estimators resulted in the identification of epigenetic underpinnings of schizophrenia, a common genetically complex disorder. These results underscore the potential of systems genomics approaches, capitalizing on the integration of high-dimensional multi-layered molecular data, for the study of brain- related complex traits

edoc