103 research outputs found

    Assessment of OvineSNP50 in Nigerian and Kenyan sheep populations

    Get PDF
    Deciphering genomic information requires markers that are polymorphic and sufficient enough to capture its vast array of genetic data. Polymorphic loci can differ greatly between breeds of the same species and the exclusion of the Nigerian and some African sheep breeds during the development of the OvineSNP50 chip necessitated the validation of SNPs included on the chip to allow for genomic applications of the excluded breeds. A total of sixty sheep samples were genotyped [10 each of the Balami, Uda, West African Dwarf and Yankasa from Nigeria, Dorper and Red Maasai from Kenya (East Africa)] using the Ovine 50k Illumina SNP bead chip. Results revealed that 33,994 SNPs (97.47%) of the called 34,876 SNPs were validated for downstream analysis. Mean heterozygosity values of 0.154 and 0.153 were obtained for polymorphic SNPs on sex and autosomal chromosomes respectively, while the values of 0.662 and 0.054 were obtained on the sex and autosomal chromosomes respectively for the mean identity-bystate (IBS). Six and three individuals violated the per ID and identity-by-state (IBS) thresholds, respectively. It was observed that the Ovine 50k Illumina SNP bead chip wasinformative in the Nigerian and East African sheep that were studied, and should be useful in examining the underlying genetic variation.Keywords: Sheep, OvineSNP50, genome-wide, call rate, minor allele frequenc

    Bayesian nonparametric models for name disambiguation and supervised learning

    Get PDF
    This thesis presents new Bayesian nonparametric models and approaches for their development, for the problems of name disambiguation and supervised learning. Bayesian nonparametric methods form an increasingly popular approach for solving problems that demand a high amount of model flexibility. However, this field is relatively new, and there are many areas that need further investigation. Previous work on Bayesian nonparametrics has neither fully explored the problems of entity disambiguation and supervised learning nor the advantages of nested hierarchical models. Entity disambiguation is a widely encountered problem where different references need to be linked to a real underlying entity. This problem is often unsupervised as there is no previously known information about the entities. Further to this, effective use of Bayesian nonparametrics offer a new approach to tackling supervised problems, which are frequently encountered. The main original contribution of this thesis is a set of new structured Dirichlet process mixture models for name disambiguation and supervised learning that can also have a wide range of applications. These models use techniques from Bayesian statistics, including hierarchical and nested Dirichlet processes, generalised linear models, Markov chain Monte Carlo methods and optimisation techniques such as BFGS. The new models have tangible advantages over existing methods in the field as shown with experiments on real-world datasets including citation databases and classification and regression datasets. I develop the unsupervised author-topic space model for author disambiguation that uses free-text to perform disambiguation unlike traditional author disambiguation approaches. The model incorporates a name variant model that is based on a nonparametric Dirichlet language model. The model handles both novel unseen name variants and can model the unknown authors of the text of the documents. Through this, the model can disambiguate authors with no prior knowledge of the number of true authors in the dataset. In addition, it can do this when the authors have identical names. I use a model for nesting Dirichlet processes named the hybrid NDP-HDP. This model allows Dirichlet processes to be clustered together and adds an additional level of structure to the hierarchical Dirichlet process. I also develop a new hierarchical extension to the hybrid NDP-HDP. I develop this model into the grouped author-topic model for the entity disambiguation task. The grouped author-topic model uses clusters to model the co-occurrence of entities in documents, which can be interpreted as research groups. Since this model does not require entities to be linked to specific words in a document, it overcomes the problems of some existing author-topic models. The model incorporates a new method for modelling name variants, so that domain-specific name variant models can be used. Lastly, I develop extensions to supervised latent Dirichlet allocation, a type of supervised topic model. The keyword-supervised LDA model predicts document responses more accurately by modelling the effect of individual words and their contexts directly. The supervised HDP model has more model flexibility by using Bayesian nonparametrics for supervised learning. These models are evaluated on a number of classification and regression problems, and the results show that they outperform existing supervised topic modelling approaches. The models can also be extended to use similar information to the previous models, incorporating additional information such as entities and document titles to improve prediction

    The Non-linear Dynamics of Meaning-Processing in Social Systems

    Full text link
    Social order cannot be considered as a stable phenomenon because it contains an order of reproduced expectations. When the expectations operate upon one another, they generate a non-linear dynamics that processes meaning. Specific meaning can be stabilized, for example, in social institutions, but all meaning arises from a horizon of possible meanings. Using Luhmann's (1984) social systems theory and Rosen's (1985) theory of anticipatory systems, I submit equations for modeling the processing of meaning in inter-human communication. First, a self-referential system can use a model of itself for the anticipation. Under the condition of functional differentiation, the social system can be expected to entertain a set of models; each model can also contain a model of the other models. Two anticipatory mechanisms are then possible: one transversal between the models, and a longitudinal one providing the modeled systems with meaning from the perspective of hindsight. A system containing two anticipatory mechanisms can become hyper-incursive. Without making decisions, however, a hyper-incursive system would be overloaded with uncertainty. Under this pressure, informed decisions tend to replace the "natural preferences" of agents and an order of cultural expectations can increasingly be shaped

    The Physics of Star Cluster Formation and Evolution

    Get PDF
    © 2020 Springer-Verlag. The final publication is available at Springer via https://doi.org/10.1007/s11214-020-00689-4.Star clusters form in dense, hierarchically collapsing gas clouds. Bulk kinetic energy is transformed to turbulence with stars forming from cores fed by filaments. In the most compact regions, stellar feedback is least effective in removing the gas and stars may form very efficiently. These are also the regions where, in high-mass clusters, ejecta from some kind of high-mass stars are effectively captured during the formation phase of some of the low mass stars and effectively channeled into the latter to form multiple populations. Star formation epochs in star clusters are generally set by gas flows that determine the abundance of gas in the cluster. We argue that there is likely only one star formation epoch after which clusters remain essentially clear of gas by cluster winds. Collisional dynamics is important in this phase leading to core collapse, expansion and eventual dispersion of every cluster. We review recent developments in the field with a focus on theoretical work.Peer reviewe

    New insights into the genetic etiology of Alzheimer's disease and related dementias

    Get PDF
    Characterization of the genetic landscape of Alzheimer's disease (AD) and related dementias (ADD) provides a unique opportunity for a better understanding of the associated pathophysiological processes. We performed a two-stage genome-wide association study totaling 111,326 clinically diagnosed/'proxy' AD cases and 677,663 controls. We found 75 risk loci, of which 42 were new at the time of analysis. Pathway enrichment analyses confirmed the involvement of amyloid/tau pathways and highlighted microglia implication. Gene prioritization in the new loci identified 31 genes that were suggestive of new genetically associated processes, including the tumor necrosis factor alpha pathway through the linear ubiquitin chain assembly complex. We also built a new genetic risk score associated with the risk of future AD/dementia or progression from mild cognitive impairment to AD/dementia. The improvement in prediction led to a 1.6- to 1.9-fold increase in AD risk from the lowest to the highest decile, in addition to effects of age and the APOE ε4 allele

    The Physics of the B Factories

    Get PDF
    corecore