1,048 research outputs found

    A Unified Approach to Robust Inference for Genetic Covariance

    Full text link
    Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with complex traits. Many complex traits are found to have shared genetic etiology. Genetic covariance is defined as the underlying covariance of genetic values and can be used to measure the shared genetic architecture. The data of two outcomes may be collected from the same group or different groups of individuals and the outcomes can be of different types or collected based on different study designs. This paper proposes a unified approach to robust estimation and inference for genetic covariance of general outcomes that may be associated with genetic variants nonlinearly. We provide the asymptotic properties of the proposed estimator and show that our proposal is robust under certain model mis-specification. Our method under linear working models provides robust inference for the narrow-sense genetic covariance, even when both linear models are mis-specified. Various numerical experiments are performed to support the theoretical results. Our method is applied to an outbred mice GWAS data set to study the overlapping genetic effects between the behavioral and physiological phenotypes. The real data results demonstrate the robustness of the proposed method and reveal interesting genetic covariance among different mice developmental traits

    Inference Of Shared Genetic Architecture With Genome-Wide Association Data

    Get PDF
    Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with complex traits. Many complex traits and diseases share common genetic architecture. Studying the shared genetic architecture provides valuable insights into the underlying disease mechanisms. In this dissertation, we develop several statistical methods for investigating the shared genetic architecture based on GWAS data. We first discuss the quantification and estimation of the shared genetic architecture based on genetic covariance, which is defined as the underlying covariance of the genetic effects. We develop a unified approach to robust estimation and inference for genetic covariance of general outcomes that can be associated with genetic variants nonlinearly. The theoretical analysis shows that the proposed estimator is robust under certain model mis-specification. Various numerical experiments are performed to support the theoretical results. Application of this method to an outbred mice GWAS data set reveals interesting genetic covariance among different mice developmental traits. We then consider a practical challenge when the raw genotype data are unavailable, but only the GWAS summary association statistics are available. We develop a method of moments estimator of genetic correlation between two traits in the framework of high dimensional linear models. Theoretical properties of the estimator in terms of consistency and asymptotic normality are provided. Simulations and real data analysis results show that the proposed estimator is more robust and has better interpretability than the LD score regression method under different genetic architectures. Finally, in chapter 4 we discuss the problem of genome-wide detection and identification of shared genetic association, which is a global assessment of the existence of shared genetic architecture. The challenge is that the linkage disequilibrium (LD) between the SNPs makes test statistics highly dependent, which complicates the detection and identification. To account for such a dependency, an eigenvector-projected score statistic is proposed and a max-type test statistic (max-block) is developed for the genome-wide detection of shared associations. The max-block method is easy to calculate and is shown to control the genome-wide error rate. The method is applied to study shared cross-trait associations in 10 pediatric autoimmune diseases, leading to several regions that explain the genetic sharing between diseases

    Numerical Simulation on the Influence of Bridge Construction on River Flood Control in a Bottleneck Reach

    Get PDF
    Bottleneck reach regions with narrow and deep cross sections prevent sediment transport and weaken flood control capacity. In addition, bridge constructions can exacerbate the risk of flooding in these areas. In this study, the Longhai Railway Extension Project at the Xianyang reach of the Weihe River in China was selected as a typical object. A horizontal 2-D numerical model was used to assess the effects of three engineering plans on flood discharge capacity under three flood frequencies. Plan 1 was designed to include building a new bridge, demolishing the three original bridges and dredging a single section of the channel. Plan 2 was the same as Plan 1, except for the compound sections. Plan 3 was designed with the four bridges coexisting and no dredging projects carried out. The results indicated that Plan 3 will increase the water level by 0.2-0.3 m in the upstream reach. The cross-sectional area was approximately 370 m2 larger under Plan 1 than under Plan 2. Water levels of 300-, 100-, and 5-year flooding around the bridge were reduced by 0.9, 0.9, and 0.6 m, respectively. To improve flood control capacity, an effective dredging project must be executed to widen the river and reduce the water stage in the bottleneck reach where the bridge is constructed

    NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven Conversation

    Full text link
    In this paper, we propose a Chinese multi-turn topic-driven conversation dataset, NaturalConv, which allows the participants to chat anything they want as long as any element from the topic is mentioned and the topic shift is smooth. Our corpus contains 19.9K conversations from six domains, and 400K utterances with an average turn number of 20.1. These conversations contain in-depth discussions on related topics or widely natural transition between multiple topics. We believe either way is normal for human conversation. To facilitate the research on this corpus, we provide results of several benchmark models. Comparative results show that for this dataset, our current models are not able to provide significant improvement by introducing background knowledge/topic. Therefore, the proposed dataset should be a good benchmark for further research to evaluate the validity and naturalness of multi-turn conversation systems. Our dataset is available at https://ai.tencent.com/ailab/nlp/dialogue/#datasets.Comment: Accepted as a main track paper at AAAI 202

    Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation

    Full text link
    Recent advances in large-scale pre-training provide large models with the potential to learn knowledge from the raw text. It is thus natural to ask whether it is possible to leverage these large models as knowledge bases for downstream tasks. In this work, we answer the aforementioned question in unsupervised knowledge-grounded conversation. We explore various methods that best elicit knowledge from large models. Our human study indicates that, though hallucinations exist, large models post the unique advantage of being able to output common sense and summarize facts that cannot be directly retrieved from the search engine. To better exploit such generated knowledge in dialogue generation, we treat the generated knowledge as a noisy knowledge source and propose the posterior-based reweighing as well as the noisy training strategy. Empirical results on two benchmarks show advantages over the state-of-the-art methods.Comment: Accepted to EMNLP 2022 Main Conference. The code is publicly available at https://github.com/lyy1994/PLM_as_KB/tree/main/projects/plm_as_k

    Leaching resistance of hazardous waste cement solidification after accelerated carbonation

    Get PDF
    When cement-based materials are carbonated, some of their physicochemical properties are changed, which includes reductions of porosity by 20% and pH from 12-13 to 8–9. These changes can enhance the retention ability of cementitious solids containing hazard waste. This research studied the effect of carbonation on the leaching resistance of hazardous waste cement solidification. The finite element software COMSOL Multiphysics was used to simulate the process of accelerated carbonation and the effect of carbonation on leaching. Laboratory tests were conducted to validate the numerical models. Parametric studies from the numerical simulations revealed that carbonation could significantly improve leaching retention capabilities of cementitious solids containing hazardous wastes

    Abnormal traffic detection system in SDN based on deep learning hybrid models

    Full text link
    Software defined network (SDN) provides technical support for network construction in smart cities, However, the openness of SDN is also prone to more network attacks. Traditional abnormal traffic detection methods have complex algorithms and find it difficult to detect abnormalities in the network promptly, which cannot meet the demand for abnormal detection in the SDN environment. Therefore, we propose an abnormal traffic detection system based on deep learning hybrid model. The system adopts a hierarchical detection technique, which first achieves rough detection of abnormal traffic based on port information. Then it uses wavelet transform and deep learning techniques for fine detection of all traffic data flowing through suspicious switches. The experimental results show that the proposed detection method based on port information can quickly complete the approximate localization of the source of abnormal traffic. the accuracy, precision, and recall of the fine detection are significantly improved compared with the traditional method of abnormal traffic detection in SDN
    • …
    corecore