119 research outputs found

    Statistical methods for gene selection and genetic association studies

    Get PDF
    This dissertation includes five Chapters. A brief description of each chapter is organized as follows. In Chapter One, we propose a signed bipartite genotype and phenotype network (GPN) by linking phenotypes and genotypes based on the statistical associations. It provides a new insight to investigate the genetic architecture among multiple correlated phenotypes and explore where phenotypes might be related at a higher level of cellular and organismal organization. We show that multiple phenotypes association studies by considering the proposed network are improved by incorporating the genetic information into the phenotype clustering. In Chapter Two, we first illustrate the proposed GPN to GWAS summary statistics. Then, we assess contributions to constructing a well-defined GPN with a clear representation of genetic associations by comparing the network properties with a random network, including connectivity, centrality, and community structure. The network topology annotations based on the sparse representations of GPN can be used to understand the disease heritability for the highly correlated phenotypes. In applications of phenome-wide association studies, the proposed GPN can identify more significant pairs of genetic variant and phenotype categories. In Chapter Three, a powerful and computationally efficient gene-based association test is proposed, aggregating information from different gene-based association tests and also incorporating expression quantitative trait locus information. We show that the proposed method controls the type I error rates very well and has higher power in the simulation studies and can identify more significant genes in the real data analyses. In Chapter Four, we develop six statistical selection methods based on the penalized regression for inferring target genes of a transcription factor (TF). In this study, the proposed selection methods combine statistics, machine learning , and convex optimization approach, which have great efficacy in identifying the true target genes. The methods will fill the gap of lacking the appropriate methods for predicting target genes of a TF, and are instrumental for validating experimental results yielding from ChIP-seq and DAP-seq, and conversely, selection and annotation of TFs based on their target genes. In Chapter Five, we propose a gene selection approach by capturing gene-level signals in network-based regression into case-control association studies with DNA sequence data or DNA methylation data, inspired by the popular gene-based association tests using a weighted combination of genetic variants to capture the combined effect of individual genetic variants within a gene. We show that the proposed gene selection approach have higher true positive rates than using traditional dimension reduction techniques in the simulation studies and select potentially rheumatoid arthritis related genes that are missed by existing methods

    Low-frequency elastic properties of glasses at low temperatures : investigations with double-paddle oscillators based on a dc-SQUID readout

    Get PDF
    Im Rahmen dieser Arbeit wurden niederfrequente Messungen der elastischen Eigenschaften von Gläsern (Quarzglas und BK7) bei tiefen Temperaturen mittels mechanischer Oszillatoren durchgeführt. Hauptschwerpunkt lag hierbei auf der experimentellen Entwicklung einer neuartigen Auslesetechnik für sogenannte Double Paddle Oszillatoren innerhalb eines Verdünnungskryostaten. Der induktive Detektionsmechanismus basiert hierbei auf der hohen Sensitivität von kommerziell erhältlichen dc-SQUIDs. Die Überlegenheit dieser Technik gegenüber der konventionellen kapazitiven Methode wurde durch Messungen an verschiedenen Gläsern demonstriert. Bereits in diesen ersten Messungen konnte die Sensitivität um mehr als eine Größenordnung gegenüber der kapazitiven Technik verbessert werden. Mit Hilfe des neuen Detektionsmechanismus wurden die relative Schallgeschwindigkeitsänderung und die innere Reibung von amorphem SiO2 und BK7 im Temperaturbereich zwischen 5mK und 1K für mehrere Frequenzen untersucht. Die Ergebnisse an Quarzglas stehen in guter Übereinstimmung mit konventionell durchgeführten Messungen, während für BK7 leichte Abweichungen zu früheren Messungen bei tiefsten Temperaturen beobachtet wurden

    Gene-based association tests using GWAS summary statistics and incorporating eQTL

    Get PDF
    Although genome-wide association studies (GWAS) have been successfully applied to a variety of complex diseases and identified many genetic variants underlying complex diseases via single marker tests, there is still a considerable heritability of complex diseases that could not be explained by GWAS. One alternative approach to overcome the missing heritability caused by genetic heterogeneity is gene-based analysis, which considers the aggregate effects of multiple genetic variants in a single test. Another alternative approach is transcriptome-wide association study (TWAS). TWAS aggregates genomic information into functionally relevant units that map to genes and their expression. TWAS is not only powerful, but can also increase the interpretability in biological mechanisms of identified trait associated genes. In this study, we propose a powerful and computationally efficient gene-based association test, called Overall. Using extended Simes procedure, Overall aggregates information from three types of traditional gene-based association tests and also incorporates expression quantitative trait locus (eQTL) information into a gene-based association test using GWAS summary statistics. We show that after a small number of replications to estimate the correlation among the integrated gene-based tests, the p values of Overall can be calculated analytically. Simulation studies show that Overall can control type I error rates very well and has higher power than the tests that we compared with. We also apply Overall to two schizophrenia GWAS summary datasets and two lipids GWAS summary datasets. The results show that this newly developed method can identify more significant genes than other methods we compared with

    A clustering linear combination method for multiple phenotype association studies based on GWAS summary statistics

    Get PDF
    There is strong evidence showing that joint analysis of multiple phenotypes in genome-wide association studies (GWAS) can increase statistical power when detecting the association between genetic variants and human complex diseases. We previously developed the Clustering Linear Combination (CLC) method and a computationally efficient CLC (ceCLC) method to test the association between multiple phenotypes and a genetic variant, which perform very well. However, both of these methods require individual-level genotypes and phenotypes that are often not easily accessible. In this research, we develop a novel method called sCLC for association studies of multiple phenotypes and a genetic variant based on GWAS summary statistics. We use the LD score regression to estimate the correlation matrix among phenotypes. The test statistic of sCLC is constructed by GWAS summary statistics and has an approximate Cauchy distribution. We perform a variety of simulation studies and compare sCLC with other commonly used methods for multiple phenotype association studies using GWAS summary statistics. Simulation results show that sCLC can control Type I error rates well and has the highest power in most scenarios. Moreover, we apply the newly developed method to the UK Biobank GWAS summary statistics from the XIII category with 70 related musculoskeletal system and connective tissue phenotypes. The results demonstrate that sCLC detects the most number of significant SNPs, and most of these identified SNPs can be matched to genes that have been reported in the GWAS catalog to be associated with those phenotypes. Furthermore, sCLC also identifies some novel signals that were missed by standard GWAS, which provide new insight into the potential genetic factors of the musculoskeletal system and connective tissue phenotypes

    Integrating External Controls by Regression Calibration for Genome-Wide Association Study

    Get PDF
    Genome-wide association studies (GWAS) have successfully revealed many disease-associated genetic variants. For a case-control study, the adequate power of an association test can be achieved with a large sample size, although genotyping large samples is expensive. A cost-effective strategy to boost power is to integrate external control samples with publicly available genotyped data. However, the naive integration of external controls may inflate the type I error rates if ignoring the systematic differences (batch effect) between studies, such as the differences in sequencing platforms, genotype-calling procedures, population stratification, and so forth. To account for the batch effect, we propose an approach by integrating External Controls into the Association Test by Regression Calibration (iECAT-RC) in case-control association studies. Extensive simulation studies show that iECAT-RC not only can control type I error rates but also can boost statistical power in all models. We also apply iECAT-RC to the UK Biobank data for M72 Fibroblastic disorders by considering genotype calling as the batch effect. Four SNPs associated with fibroblastic disorders have been detected by iECAT-RC and the other two comparison methods, iECAT-Score and Internal. However, our method has a higher probability of identifying these significant SNPs in the scenario of an unbalanced case-control association study

    Properties of localization in silicon-based lattice periodicity breaking photonic crystal waveguides

    Get PDF
    The light localization effects in silicon photonic crystal cavities at different disorder degrees have been studied using the finite difference time domain (FDTD) method in this paper. Numerical results showed that localization occurs and enhancement can be gained in the region of the cavity under certain conditions. The stabilities of the localization effects due to the structural perturbations have been investigated too. Detailed studies showed that when the degree of structural disorder is small(about 10%), the localization effects are stable, the maximum enhancement factor can reach 16.5 for incident wavelength of 785 nm and 23 for 850 nm in the cavity, with the degree of disorder about 8%. The equivalent diameter of the localized spot is almost constant at different disorder degrees, approximating to {\lambda \mathord{/ {\vphantom {\lambda 7}} \kern-\nulldelimiterspace} 7}λ/7, which turned out to be independent on the structural perturbation

    Efficient sunlight promoted nitrogen fixation from air under room temperature and ambient pressure via Ti/Mo composites

    Full text link
    Photocatalytic nitrogen fixation is an important pathway for carbon neutralization and sustainable development. Inspired by nitrogenase, the participation of molybdenum can effectively activate nitrogen. A novel Ti/Mo composites photocatalyst is designed by sintering the molybdenum acetylacetonate precursor with TiO2_{2}. The special carbon-coated hexagonal photocatalyst is obtained which photocatalytic nitrogen fixation performance is enhanced 16 times compared to pure TiO2_{2} at room temperature and ambient pressure. The abundant surface defects in this composite were confirmed to be the key factor for nitrogen fixation. The 15^{15}N2_{2} isotope labeling experiment was used to demonstrate the feasibility of nitrogen to ammonia conversion. Also, modelling on the interactions between light and the synthesized photocatalyst particle was examined for the light absorption. The optimum nitrogen fixation conditions have been examined, and the nitrogen fixation performance can reach up to 432 μ{\mu}g⋅\cdotgcat−1⋅_{\text{cat}}^{-1}\cdoth−1^{-1}. Numerical simulations via the field-only surface integral method were also carried out to study the interactions between light and the photocatalytic particles to further confirm that it can be a useful material for photocatalyst. This newly developed Ti/Mo composites provide a simple and effective strategy for photocatalytic nitrogen fixation from air directly under ambient conditions

    A New Hip Fracture Risk Index Derived from FEA-Computed Proximal Femur Fracture Loads and Energies-to-Failure

    Full text link
    Hip fracture risk assessment is an important but challenging task. Quantitative CT-based patient specific finite element analysis (FEA) computes the force (fracture load) to break the proximal femur in a particular loading condition. It provides different structural information about the proximal femur that can influence a subject overall fracture risk. To obtain a more robust measure of fracture risk, we used principal component analysis (PCA) to develop a global FEA computed fracture risk index that incorporates the FEA-computed yield and ultimate failure loads and energies to failure in four loading conditions (single-limb stance and impact from a fall onto the posterior, posterolateral, and lateral aspects of the greater trochanter) of 110 hip fracture subjects and 235 age and sex matched control subjects from the AGES-Reykjavik study. We found that the first PC (PC1) of the FE parameters was the only significant predictor of hip fracture. Using a logistic regression model, we determined if prediction performance for hip fracture using PC1 differed from that using FE parameters combined by stratified random resampling with respect to hip fracture status. The results showed that the average of the area under the receive operating characteristic curve (AUC) using PC1 was always higher than that using all FE parameters combined in the male subjects. The AUC of PC1 and AUC of the FE parameters combined were not significantly different than that in the female subjects or in all subjectsComment: 27 pages, 4 figure
    • …
    corecore