Search CORE

3,750 research outputs found

A robust mean and variance test with application to high-dimensional phenotypes

Author: Davey Smith George
Lyon Matt S
Staley James R
Suderman Matthew J
Tilling Kate M
Windmeijer Frank
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Most studies of continuous health-related outcomes examine differences in mean levels (location) of the outcome by exposure. However, identifying effects on the variability (scale) of an outcome, and combining tests of mean and variability (location-and-scale), could provide additional insights into biological mechanisms. A joint test could improve power for studies of high-dimensional phenotypes, such as epigenome-wide association studies of DNA methylation at CpG sites. One possible cause of heterogeneity of variance is a variable interacting with exposure in its effect on outcome, so a joint test of mean and variability could help in the identification of effect modifiers. Here, we review a scale test, based on the Brown-Forsythe test, for analysing variability of a continuous outcome with respect to both categorical and continuous exposures, and develop a novel joint location-and-scale score (JLSsc) test. These tests were compared to alternatives in simulations and used to test associations of mean and variability of DNA methylation with gender and gestational age using data from the Accessible Resource for Integrated Epigenomics Studies (ARIES). In simulations, the Brown-Forsythe and JLSsc tests retained correct type I error rates when the outcome was not normally distributed in contrast to the other approaches tested which all had inflated type I error rates. These tests also identified > 7500 CpG sites for which either mean or variability in cord blood methylation differed according to gender or gestational age. The Brown-Forsythe test and JLSsc are robust tests that can be used to detect associations not solely driven by a mean effect

Oxford University Research Archive

Analysing multiple types of molecular profiles simultaneously: Connecting the needles in the haystack

Author: Boer J.M. (Judith)
Goeman J. (Jelle)
Menezes R.X. (Renée)
Mohammadi L. (Leila)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/02/2016
Field of study

Background: It has been shown that a random-effects framework can be used to test the association between a gene's expression level and the number of DNA copies of a set of genes. This gene-set modelling framework was later applied to find associations between mRNA expression and microRNA expression, by defining the gene sets using target prediction information. Methods and results: Here, we extend the model introduced by Menezes et al. 2009 to consider the effect of not just copy number, but also of other molecular profiles such as methylation changes and loss-of-heterozigosity (LOH), on gene expression levels. We will consider again sets of measurements, to improve robustness of results and increase the power to find associations. Our approach can be used genome-wide to find associations and yields a test to help separate true associations from noise. We apply our method to colon and to breast cancer samples, for which genome-wide copy number, methylation and gene expression profiles are available. Our findings include interesting gene expression-regulating mechanisms, which may involve only one of copy number or methylation, or both for the same samples. We even are able to find effects due to different molecular mechanisms in different samples. Conclusions: Our method can equally well be applied to cases where other types of molecular (high-dimensional) data are collected, such as LOH, SNP genotype and microRNA expression data. Computationally efficient, it represents a flexible and powerful tool to study associations between high-dimensional datasets. The method is freely available via the SIM BioConductor package

Erasmus University Digital Repository

FigShare

Statistical Inference for High-Dimensional Genetic Data

Author: Li Xuan
Publication venue
Publication date: 05/03/2019
Field of study

This dissertation focuses on three types of high-dimensional genetic data: protein sequences, DNA methylation data, and microRNA expression data. The four major parts are presented in Chapters 2-5, respectively. In Chapter 2, we develop a new clustering method for protein sequences. First, we reduce the dimensionality based on entropy. Second, the sequences are clustered using the Hamming distance vectors of chosen sites. We apply this new method to an influenza A H3N2 HA data set, which consists of 1960 viral sequences. Our method aggregates these sequences into 23 clusters. Based on the temporal evolution pattern of these clusters, we find that the dominant clusters change from time to time and are often different from the clusters housing vaccine strains. In Chapter 3, we conduct systematic simulation studies and real data analysis to compare the performance of seven statistical tests for equal-variance hypothesis. Our results show that Brown-Forsythe test and trimmed-mean-based-Levene's test have better performance on DNA methylation data in comparison with other tests. Detection of differential DNA methylation and differential variability have received a lot of attention in the literature. In Chapter 4, we derive the asymptotic distribution of a joint score test (AW), proposed by Anh and Wang (2013). Furthermore, we propose three improved joint score tests, namely iAW.Lev, iAW.BF, and iAW.TM. Systematic simulation studies show that at least one of the proposed tests performs better than the existing tests for data with outliers or from non-normal distributions. The real data analyses demonstrate that the three proposed tests have higher true validation rates than the existing tests. Besides DNA methylation, microRNA regulation is another important epigenetic mechanism. In Chapter 5, we propose a novel model-based clustering method to detect differentially variable (DV) miRNAs. We impose biologically meaningful structures on covariance matrices for each cluster of miRNAs. Simulation studies show that the proposed method performs better than other model-based methods when miRNA expression levels are from a multivariate normal distribution. In real data analysis, the proposed method has a higher validation rate than other methods

Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas

Author: Abdel-Rahman Mohamed H.
Abeshouse Adam
Adebamowo Clement
Adebamowo Sally N.
Agnew Kathy
Ahn Keunsoo
Ajani Jaffer A.
Akbani Rehan
Akbani Rehan
Akeredolu Teniola
Al-Ahmadie Hikmat
Al-Ahmadie Hikmat
Albert Monique
Alexopoulou Iakovina
Ally Adrian
Alvaro Domenico
Anderson Matthew L.
Andry Chris
Antenucci Anna
Anur Pavana
Anur Pavana
Appelbaum Elizabeth L.
Aredes Natália D.
Armenia Joshua
Arnaout Angel
Asa Sylvia L.
Auman J. Todd
Aymerich Marta
Aziz Dina
Bailey Matthew
Balasundaram Miruna
Balu Saianand
Barnett Gene
Barnholtz-Sloan Jill S.
Barrett Wendi
Bartlett John
Bathe Oliver
Baudin Eric
Baylin Stephen
Becker Karl-Friedrich
Beer David
Behera Madhusmita
Behrens Carmen
Bell Debra
Bell Sue
Bellair Michelle
Bennett Joseph
Benz Christopher
Benz Christopher
Berchuck Andrew
Berger Ashton C.
Bergeron Alain
Berkowitz Ross
Bernard Brady
Beroukhim Rameen
Beuschlein Felix
Bifulco Carlo
Bigner Darell
Birrer Michael
Birsoy Kivanc
Bocklage Therese
Bodenheimer Tom
Bondaruk Jolanta
Bootwalla Moiz S.
Borad Mitesh
Borgia Jeffrey A.
Bossler Aaron
Botnariuc Natalia
Boussioutas Alex
Bowen Jay
Bowlby Reanne
Bowlby Reanne
Bowman Rayleen
Bradford Carol
Bragazzi Maria Consiglia
Brat Daniel J.
Breggia Anne
Brennan Kevin
Brewer Cathy
Brimo Fadi
Broaddus Russell
Brooks Denise
Broom Bradley M.
Broudy Thomas
Bryce Alan H.
Bubley Glenn
Bueno Raphael
Bullman Susan
Burnette Andrew
Byers Lauren Averett
Caesar-Johnson Samantha J.
Calatozzolo Chiara
Campbell Joshua D.
Campo Elias
Campos Benito
Caraman Irina
Cardinale Vincenzo
Carey Francis
Carey Thomas
Carlotti Carlos Gilberto
Carlsen Rebecca
Carney Michael
Carpino Guido
Carroll Peter R.
Carter Candace
Carvalho Andre L.
Castle Erik
Castro Patricia D.
Catto James
Cebulla Colleen M.
Cernat Mircea
Chabot John
Chakravarty Debyani
Chambwe Nyasha
Chan June M.
Chan Timothy A.
Chandan Vishal
Chang Kyle
Chatila Walid K.
Chemencedji Inga
Chen Amy
Chen Chu
Chen Jianhong
Chen Ting-Wen
Chen Zhong
Cheng Feixiong
Cheng Hui
Cherniack Andrew D.
Cherniack Andrew D.
Chesla David
Chevalier Simone
Cheville John
Chiu Hua-Sheng
Cho Juok
Chuah Eric
Chudamani Sudha
Chung Ki
Cibulskis Carrie
Clipca Adrian
Coarfa Cristian
Colman Howard
Cope Leslie
Copland John A.
Corcoran Niall
Cordes Matthew G.
Costello Tony
Cottingham Sandra
Couce Marta
Covington Kyle
Crain Daniel
Cramer Daniel
Creaney Jenette
Creighton Chad J.
Creighton Chad J.
Cuppini Lucia
Curley Erin
Cuzzubbo Stefania
Czerniak Bogdan
Danilova Ludmila
Davis Amy
de Bruijn Ino
de Carvalho Ana C.
de Krijger Ronald
De Rienzo Assunta
De Rose Agostino
Defreitas Timothy
Delman Keith
Demchok John A.
Desjardins Laurence
Devine Karen
Deyarmin Brenda
Dhalla Noreen
Dhanasekaran Renumathy
Dhankani Varsha
Dhir Rajiv
Diao Lixia
Dimeco Francesco
Ding Li
Dinh Huyen
Dinkin Mikhail
Dipersio John
Disaia Philip
Doddapaneni Harshavardhan
Donehower Larry
Donehower Lawrence A.
Doruc Serghei
Dos Santos Jose Sebastião
Dottino Peter R.
Drake Bettina
Drill Esther
Drummond Jennifer
Drwiega Paul
Dubina Michael
Duell Rebecca
Duffy Elizabeth R.
Eckman John
Edenfield W. Jeffrey
Eijckenboom Wil
Elder J. Bradley
Engel Jay
Eschbacher Jennifer
Esmaeli Bita
Evason Kimberley
Facciolo Francesco
Fan Cheng
Fan Huihui
Fan Huihui
Fantacone-Campbell J. Leigh
Farnell Michael
Farver Carol
Fassnacht Martin
Fehrenbach Ashley
Felau Ina
Feldman Michael
Feltmate Colleen
Ferguson Martin L.
Finocchiaro Gaetano
Flores Elsa R.
Flotte Thomas
Fong Kwun M.
Force Seth
Forgie Ian
Frazer Scott
Fregnani José H.
Fronick Catrina C.
Fujimoto Junya
Fulop Jordonna
Fulton Lucinda A.
Fulton Robert S.
Gabra Hani
Gabriel Stacey B.
Galbraith Joseph
Gao Galen F.
Gao Jianjiong
Gardner Johanna
Gastier-Foster Julie M.
Gaudio Eugenio
Gay Carl M.
Gehlenborg Nils
Gerken Mark
Gershenwald Jeffrey
Getz Gad
Gevaert Olivier
Ghossein Ronald
Giama Nasra
Gibbs Richard A.
Gilbert Sebastien
Gillis Ad
Gimenez-Roqueplo Anne-Paule
Giné Eva
Giordano Thomas
Girard Nicolas
Giuliante Felice
Glenn Pat
Glenn Robert
Godwin Andrew K.
Godwin Eryn M.
Gonzalez Ana Maria Angulo
Goodman Marc
Gopalan Anuradha
Goparaju Chandra
Gorincioi Ghenadie
Govindan Ramaswamy
Graefen Markus
Grazi Gianluca
Grizzle William E.
Gross Benjamin E.
Guillermo Armando López
Gunaratne Preethi
Guo Charles
Ha Gavin
Haddad Andrea
Hagedorn Curt H.
Hale Walker
Han Yi
Hanh Phan Thi
Hansen Paul
Harr Jodi
Hartmann Arndt
Haydu Lauren
Hayes D. Neil
Hayes D. Neil
Hayward Nicholas
Heath Sharon
Hegde Apurva
Hegde Apurva M.
Heiman David I.
Heins Zachary J.
Henderson Joel
Hermes Beth
Hernandez Brenda
Herold-Mende Christel
Hersey Peter
Hess Julian
Hibshoosh Hanina
Hilty Joe
Hinoue Toshinori
Ho Thai
Hoadley Katherine A.
Hoadley Katherine A.
Holt Robert
Hooke Jeffrey A.
Hoon Dave
Horowitz Neil
Houck John
Hovens Christopher
Hoyle Alan P.
Hu Hai
Hu Jianhong
Huland Hartwig
Hung Nguyen Phi
Huntsman David
Hutter Carolyn M.
Iacocca Mary
Ittmann Michael
Jacobus Laura
Jakrot Valerie
Janssen Klaus-Peter
Jefferys Stuart R.
Jimeno Antonio
Jones Corbin D.
Jones Steven J. M.
Ju Zhenlin
Juhl Hartmut
Jungk Christin
Junker Kerstin
Kakavand Hojabr
Kalkanis Steven
Kanchi Rupa S.
Kanchi Rupa S.
Kandoth Cyriac
Kang Koo Jeong
Karlan Beth Y.
Kasaian Katayoon
Kasapi Melpomeni
Kastl Alison
Kebebew Electron
Kefford Richard
Kelley Robin K.
Kemp Rafael
Kendall Sara
Kendler Ady
Kendrick Michael
Khuri Fadlo
Kibel Adam
Kim Jaegil
Knijnenburg Theo
Knudson Michael
Knutson Tina
Kocher Jean-Pierre
Kohl Bernard
Kopp Karla
Korchina Viktoriya
Korkut Anil
Korpershoek Esther
Korst Robert
Kovatich Albert J.
Kramer Roger
Kucherlapati Melanie H.
Kucherlapati Raju S.
Kumar Bahavna
Kundra Ritika
Kvecher Leonid
Kycler Witold
La Konnor
Lacombe Louis
Ladanyi Marc
Lai Phillip H.
Laird Peter W.
Laird Peter W.
Landen Charles N.
Landrum Lisa
Lang James
Larson Caroline
Latour Mathieu
Lau Kevin
Lawrence Michael S.
Lazar Alexander J.
Lazar Alexander J.
Le Xuan
Lechan Ronald
Lee Darlene
Lee Jung Il
Lee Kenneth
Lee Sandra
Lehman Norman L.
Leinonen Kalle
Leraas Kristen M.
Levine Douglas A.
Lewis Lora
Ley Timothy
Li Jun
Li Wei
Liang Han
Lichtenberg Tara M.
Lin Pei
Linehan W. Marston
Ling Shiyun
Lipp Eric
Liptay Michael J.
Liu Jia
Liu Wenbin
Liu Xiuping
Liu Yuexin
Liu Yuexin
Logothetis Christopher
Lohavanichbutr Pawadee
Lolla Laxmi
Long Georgina
Longatto-Filho Adhemar
Looijenga Leendert
Lu Yiling
Luketich James
Luna Augustin
Lyadov Vladimir
Ma Deqin
Ma Wencai
Ma Yussanne
Madan Rashna
Maglinte Dennis T.
Magliocca Kelly
Maithel Shishir
Mallery David
Malykh Andrei
Mandt Randy
Manikhas George
Mann Graham
Mannel Robert
Mannelli Massimo
Mardis Elaine R.
Mariamidze Armaz
Mariani Odette
Marino Mirella
Marks Jeffrey
Marra Marco A.
Martignetti John A.
Martin Julie
Mattei Luca
Mayo Michael
Mccall Shannon
Mcgraw Mary
Mckercher Ginette
Mclellan Michael D.
Mclendon Roger
Mcpherson Christopher
Meier Sam
Melamed Jonathan
Meng Shaowu
Meric-Bernstam Funda
Merola Roberta
Mes-Masson Anne-Marie
Metwalli Adam R.
Meyerson Matthew
Mieczkowski Piotr A.
Mikkelsen Tom
Milhem Mohammed
Miller Christopher A.
Miller Judy
Miller Michael
Mills Gordon B.
Mirsaidi Cyrus
Moiseenko Fedor
Moncrieff Marc
Moore Kathleen
Moore Richard A.
Moran Cesar
Morgan Margaret
Morris Scott
Morrison Carl
Morton Donna
Mose Lisle E.
Moser Catherine
Moxley Katherine
Moyer Jeffey
Mungall Andrew J.
Mungall Karen
Mungall Karen L.
Mura Sergiu
Mural Richard J.
Murawa Dawid
Muto Michael
Muzny Donna
Myers Jerome
Nagorney David
Nair Praveen
Naresh Rashi
Naska Theresa
Nelson Mark
Ng Kwok-Shing
Nguyen Phuong
Nissan Moriah G.
Noble Michael S.
Noss Ardene
Noushmehr Houtan
O'Brien Daniel
O'Neill Brian Patrick
Ochoa Angelica
Ojesina Akinyemi I.
Olabode Oluwole
Olson Jeffrey J.
Omberg Larsson
Oosterhuis Wolter
Ostrom Quinn T.
Owonikoko Taofeek
Pacak Karel
Paklina Oxana
Parfitt Jeremy
Park Joong-Won
Parker Joel S.
Pass Harvey
Patel Tushar
Paulauskis Joseph
Pedamallu Chandra Sekhar
Pennell Nathan A.
Penny Robert
Perin Alessandro
Perou Amy H.
Perou Charles M.
Petersen Gloria
Peterson Lisa
Peto Myron
Peto Myron
Phillips Joanna
Phillips Sarah M.
Phu Bui Duc
Piché Alain
Pickens Alan
Pickering Curtis R.
Pihl Todd
Pilarski Robert
Pinero Edna M. Mora
Pinto Peter A.
Pirtac Maria
Pollo Bianca
Pool Mark
Porten Sima
Postier Russel
Potapova Olga
Powers James
Prados Michael
Prince Mark
Prunello Marcos
Que Florencia
Quinn Michael
Quintero-Aguilo Mario
Rabeno Brenda
Rader Janet S.
Rai Karan
Ramalingam Suresh
Ramirez Nilsa C.
Ramondetta Lois
Rao Arvind
Rassl Doris M.
Rathmell W. Kimryn
Raut Chandrajit P.
Raymond Daniel
Reis Rui M.
Reuter Victor
Reynolds Sheila
Reznik Ed
Rice David
Richards William G.
Rintoul Robert C.
Rivera Michael
Roach Jeffrey
Roberts Lewis
Robertson A. Gordon
Robertson A. Gordon
Robinson Bruce
Roggin Kevin
Roman-Roman Sergio
Rosenthal Howard G.
Rozek Laura
Rubin Mark A.
Rustgi Anil K.
Ryan Michael
Saad Fred
Sadeghi Sara
Sadeghi Sara
Saksena Gordon
Sanchez-Vega Francisco
Sander Chris
Sankarankuty Ajith
Santibanez Jireh
Sastre Xavier
Sauter Guido
Saw Robyn
Scapulatempo-Neto Cristovam
Scarpace Lisa
Schadendorf Dirk
Schein Jacqueline E.
Schiffman Mark
Schilero Cathy
Schlomm Thorsten
Schmidt Heather K.
Schmidt Laura S.
Schoenfield Lynn
Schultz Andre
Schultz Nikolaus
Schumacher Steven E.
Scolyer Richard
Secord Angeles
Seder Christopher W.
Sekhon Harman
Senecal Kelly
Sepulveda Antonia
Setdikova Galiya
Sexton Katherine C.
Shabunin Alexey
Shannon Kerwin
Sharp Alexis
Shelley Carl Simon
Shelton Candace
Shelton Troy
Shen Hui
Shen Hui
Shen Ronglei
Sheridan Robert
Sherman Mark
Sheth Margi
Shi Yan
Shih Juliann
Shih Juliann
Shimmel Kristen
Shin Dong M.
Shinbrot Eve
Shipman Cassaundra
Shmulevich Ilya
Shriver Craig D.
Sica Gabriel
Sifri Suzanne
Sigmund Rita
Signoretti Sabina
Silveira Henrique C. S.
Simko Jeffry
Simon Ronald
Simons Janae V.
Singer Samuel
Singh Bhuvanesh
Singh Rosy
Sipahimalani Payal
Skelly Tara
Sloan Andrew E.
Slotta-Huspenina Julia
Smallridge Robert
Smith Jennifer
Smith-McCune Karen
Smolenski Kathy
Smyrk Thomas
Sofia Heidi J.
Soloway Matthew G.
Somiari Stella
Sood Anil
Spellman Paul
Spillane Andrew
Stancul Irina
Stanton Melissa
Staugaitis Susan M.
Steele Ruth
Stepa Serghei
Stern Marc-Henri
Stoehr Christine
Stoehr Robert
Stoop Hans
Stretch Onathan
Stuart Joshua M.
Stuart Joshua M.
Su Tao
Sumazin Pavel
Sumer S. Onur
Sun Qiang
Sun Yichao
Swanson Patricia
Swisher Elizabeth
Synott Maria
Tam Angela
Tamakawa Raina
Tamboli Pheroze
Tan Donghui
Tang Yufang
Tarnuzzer Roy
Taubert Helge
Tavobilov Mikhail
Taylor Alison M.
Taylor Barry S.
Tcaciuc Diana
Tennstedt Pierre
Thiessen Nina
Thomas George
Thompson Eric
Thompson John
Thompson R. Houston
Thompson Timothy
Thorp Richard
Thorsson Vesteinn
Tien Nguyen Viet
Timmers Henri
Tirapelli Daniela
Tischler Arthur
Torbenson Michael
Troncoso Patricia
Tsai Kenneth Y.
Tsao Anne
Tse Kane
Tucker Kelinda
Têtu Bernard
Unterberg Andreas
Urba Walter
Valdivieso Federico
Van Bang Nguyen
Van Den Berg David J.
van Kessel Kim E.
Van Meir Erwin G.
Van Tine Brian
Van Waes Carter
Vandenberg Scott
Veluvolu Umadevi
Vicha Ales
Vidal Daniel O.
Vocke Cathy D.
Voet Doug
von Deimling Andreas
Voronina Olga
Wach Sven
Wakely Paul
Waldmann Jens
Walker Joan
Walter Vonn
Wan Yunhu
Wang Chen
Wang Jing
Wang Jing
Wang Jioajiao
Wang Linghua
Wang Min
Wang Timothy
Wang Zhining
Warnick Ronald
Weinstein John N.
Weinstein John N.
Weisenberger Daniel J.
Wentzensen Nicolas
Westervelt Peter
Wheeler David A.
Wilkerson Matthew D.
Williams Felicia
Wilmott James
Wilson Richard K.
Wise Lisa
Wistuba Ignacio
Wiznerowicz Maciej
Wolf Gregory
Wolinsky Yingli
Wong Christopher K.
Wong Christopher K.
Wong Tina
Worrell Robert
Wu Ye
Wullich Bernd
Xi Liu
Yang Hannah
Yang Ian
Yang Ju Dong
Yang Liming
Yau Christina
Yau Christina
Yena Peggy
Zach Leigh Anne
Zaren Howard
Zelinka Tomas
Zenklusen Jean C.
Zhang Hailei
Zhang Hongxin
Zhang Hongzheng
Zhang Jiashan (Julia)
Zhang Jiexin
Zhang Lizhi
Zhang Wei
Zhao Fengmei
Zhou Jane H.
Zhou Wanding
Zmuda Erik
Zuna Rosemary
Zuna Rosemary
Zwarthoff Ellen C.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs) from five sites associated with smokin

Directory of Open Access Journals

Archivio della ricerca- Università di Roma La Sapienza

Association Analysis Using Set-Based Approaches in the Post-GWAS Era

Author: Yasmeen Summaira
Publication venue: University Goettingen Repository
Publication date: 21/07/2021
Field of study

Genotyping arrays have greatly facilitated genetic epidemiological studies into genetic risk factors for numerous complex diseases such as psychiatric disorders. The use of genome-wide association analysis (GWAS) is unequivocally established. More recently, DNA methylation arrays have enabled genome-wide profiling of the methylome, in addition to contemporary genetic epidemiology study design. An example of one such study is the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) Lipidomics Study, which identified methylation markers (CpG markers) and single nucleotide polymorphisms (SNPs), associated with the change in triglyceride levels after drug intervention. Genotyping and methylation arrays assay several hundred thousand markers; however, single-marker association analysis suffers greatly from the burden of multiple testing. Set-based (SNP or CpG set) association approaches offer great flexibility, thus allowing the joint testing of a set of variants. For instance, a polygenic risk score (PRS) is a set-based approach, which, in addition to the strongly associated SNPs identified by large-scale GWAS, recruits SNPs with moderate to weak effects. The genotype information of the SNP set in the PRS is taken from an independent sample (target sample) and is then weighted by individual SNP effects derived from a relevant GWAS performed on a separate sample (discovery sample) into a cumulative score for each individual in the target sample. The resulting score, based on a SNP set or the PRS, is then regressed on the target phenotype. Such a regression model is evaluated by the amount of variance explained (R2) by the PRS in the target phenotype. Another strategy of set-based association analysis is kernel machine regression (KMR): a semi-parametric regression approach, in which the effects of markers within a set (CpG set or SNP set) are modelled via a kernel function and thus evaluated by a single-component variance test. A kernel function computes pairwise genomic similarity between the individuals, that is, the inner product of a set of variants under analysis, maybe comprising a gene or a biological pathway. For my first article, I performed a simulation study to evaluate the performance of PRS in correlated discovery and target traits by considering various sample sizes of the target sample, namely n=200, 500, and 1000. The PRS for correlated traits can be viewed as a situation of calculating schizophrenia-PRS for psychosocial endophenotypes such as global assessment functioning (GAF) score or positive and negative syndrome scale (PANSS) score. Considering such a situation, I simulated four correlated target traits that had varying degrees of correlation (r2) with the discovery trait, i.e., r2= 1.00, 0.8, 0.6, and 0.4. The results demonstrated that the average R2 estimates by the PRS roughly decreased by the square of the correlation between the target traits. In addition, the range of estimated R2 is most inflated in the sample size of the target trait n=200. Thus, the simulation findings alert researchers conducting clinical studies with endophenotypes to the fact that they need to pay attention to two important factors: first, the sample size of the target trait and secondly, the shared amount of genetic correlation between the target and discovery traits. In my second article, I implemented a KMR approach for set-based association testing of a CpG set. KMR has been successfully employed on SNP sets. In preparation of the second article, I used real and simulated datasets (based on a real dataset) provided by the Genetic Analysis Workshop 20 (GAW20) from the GOLDN study. GOLDN is a longitudinal study with individuals recruited from pedigrees. In my analysis, I only used independent individuals, which restricted the sample size in the real and simulated datasets to n<200. CpG sets were devised using the evidence of association reported by the GOLDN study in the real data set. For simulated datasets, true causal CpGs were provided by GAW20. Thus, I formulated candidate genomic regions of varying lengths while keeping the associated CpG(s) inside the region. The results replicated the evidence of association reported by GOLDN in the real data, and in simulated datasets albeit nominally. Moreover, in the simulated data, causal SNPs exert their full effect on the phenoytpes given when the causal CpG loci had no methylation (B-value=0). Thus, I also considered modelling an interaction term along with the main effects. The results yielded significant association. As part of the discussion, simulation results on the performance of the linear kernel for a CpG set with original (B-values) and logit transformed methylation values (M-values) indicated that logit transformation results in a loss of power. There, I also considered analysing an additive kernel that combines the genotype kernel and the methylation kernel and then tests for association with the phenotype. The initial simulations suggest that an additive kernel with a CpG set including hypo, semi, and hypermethylated sites simultaneously might not improve the model over only including a SNP set. However, it appears fruitful to investigate further the situation in which only one type of methylation state is present in a CpG set