6,293 research outputs found

    Information in the Context of Philosophy and Cognitive Sciences

    Get PDF
    This textbook briefly maps as many as possible areas and contexts in which information plays an important role. It attempts an approach that also seeks to explore areas of research that are not commonly associated, such as informatics, information and library science, information physics, or information ethics. Given that the text is intended especially for students of the Master's Degree in Cognitive Studies, emphasis is placed on a humane, philosophical and interdisciplinary approach. It offers rather directions of thought, questions, and contexts than a complete theory developed into mathematical and technical details

    Predicting Health Impacts of the World Trade Center Disaster: 1. Halogenated hydrocarbons, symptom syndromes, secondary victimization, and the burdens of history

    Get PDF
    The recent attack on the World Trade Center, in addition to direct injury and psychological trauma, has exposed a vast population to dioxins, dibenzofurans, related endocrine disruptors, and a multitude of other physiologically active chemicals arising from the decomposition of the massive quantities of halogenated hydrocarbons and other plastics within the affected buildings. The impacts of these chemical species have been compounded by exposure to asbestos, fiberglass, crushed glass, concrete, plastic, and other irritating dusts. To address the manifold complexities of this incident we combine recent theoretical perspectives on immune, CNS, and sociocultural cognition with empirical studies on survivors of past large toxic fires, other community-scale chemical exposure incidents, and the aftereffects of war. Our analysis suggests the appearance of complex, but distinct and characteristic, spectra of synergistically linked social, psychosocial, psychological and physical symptoms among the 100,000 or so persons most directly affected by the WTC attack. The different 'eigenpatterns' should become increasingly comorbid as a function of exposure. The expected outcome greatly transcends a simple 'Post Traumatic Stress Disorder' model, and may resemble a particularly acute form of Gulf War Syndrome. We explore the role of external social factors in subsequent exacerbation of the syndrome -- secondary victimization -- and study the path-dependent influence of individual and community-level historical patterns of stress. We suggest that workplace and other organizations can act as ameliorating intermediaries. Those without acess to such buffering structures appear to face a particularly bleak future

    Parsimonious Time Series Clustering

    Full text link
    We introduce a parsimonious model-based framework for clustering time course data. In these applications the computational burden becomes often an issue due to the number of available observations. The measured time series can also be very noisy and sparse and a suitable model describing them can be hard to define. We propose to model the observed measurements by using P-spline smoothers and to cluster the functional objects as summarized by the optimal spline coefficients. In principle, this idea can be adopted within all the most common clustering frameworks. In this work we discuss applications based on a k-means algorithm. We evaluate the accuracy and the efficiency of our proposal by simulations and by dealing with drosophila melanogaster gene expression data

    Reconciliation between operational taxonomic units and species boundaries

    Get PDF
    The development of high-throughput sequencing technologies has revolutionised the field of microbial ecology via 16S rRNA gene amplicon sequencing approaches. Clustering those amplicon sequencing reads into operational taxonomic units (OTUs) using a fixed cut-off is a commonly used approach to estimate microbial diversity. A 97% threshold was chosen with the intended purpose that resulting OTUs could be interpreted as a proxy for bacterial species. Our results show that the robustness of such a generalised cut-off is questionable when applied to short amplicons only covering one or two variable regions of the 16S rRNA gene. It will lead to biases in diversity metrics and makes it hard to compare results obtained with amplicons derived with different primer sets. The method introduced within this work takes into account the differential evolutional rates of taxonomic lineages in order to define a dynamic and taxonomic-dependent OTU clustering cut-off score. For a taxonomic family consisting of species showing high evolutionary conservation in the amplified variable regions, the cut-off will be more stringent than 97%. By taking into consideration the amplified variable regions and the taxonomic family when defining this cut-off, such a threshold will lead to more robust results and closer correspondence between OTUs and species. This approach has been implemented in a publicly available software package called DynamiC

    Systematic comparison of ranking aggregation methods for gene lists in experimental results

    Get PDF
    MOTIVATION: A common experimental output in biomedical science is a list of genes implicated in a given biological process or disease. The gene lists resulting from a group of studies answering the same, or similar, questions can be combined by ranking aggregation methods to find a consensus or a more reliable answer. Evaluating a ranking aggregation method on a specific type of data before using it is required to support the reliability since the property of a dataset can influence the performance of an algorithm. Such evaluation on gene lists is usually based on a simulated database because of the lack of a known truth for real data. However, simulated datasets tend to be too small compared to experimental data and neglect key features, including heterogeneity of quality, relevance and the inclusion of unranked lists. RESULTS: In this study, a group of existing methods and their variations that are suitable for meta-analysis of gene lists are compared using simulated and real data. Simulated data were used to explore the performance of the aggregation methods as a function of emulating the common scenarios of real genomic data, with various heterogeneity of quality, noise level and a mix of unranked and ranked data using 20 000 possible entities. In addition to the evaluation with simulated data, a comparison using real genomic data on the SARS-CoV-2 virus, cancer (non-small cell lung cancer) and bacteria (macrophage apoptosis) was performed. We summarize the results of our evaluation in a simple flowchart to select a ranking aggregation method, and in an automated implementation using the meta-analysis by information content algorithm to infer heterogeneity of data quality across input datasets. AVAILABILITY AND IMPLEMENTATION: The code for simulated data generation and running edited version of algorithms: https://github.com/baillielab/comparison_of_RA_methods. Code to perform an optimal selection of methods based on the results of this review, using the MAIC algorithm to infer the characteristics of an input dataset, can be downloaded here: https://github.com/baillielab/maic. An online service for running MAIC: https://baillielab.net/maic. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online

    Assumption 0 analysis: comparative phylogenetic studies in the age of complexity

    Get PDF
    Darwin's panoramic view of biology encompassed two metaphors: the phylogenetic tree, pointing to relatively linear (and divergent) complexity, and the tangled bank, pointing to reticulated (and convergent) complexity. The emergence of phylogenetic systematics half a century ago made it possible to investigate linear complexity in biology. Assumption 0, first proposed in 1986, is not needed for cases of simple evolutionary patterns, but must be invoked when there are complex evolutionary patterns whose hallmark is reticulated relationships. A corollary of Assumption 0, the duplication convention, was proposed in 1990, permitting standard phylogenetic systematic ontology to be used in discovering reticulated evolutionary histories. In 2004, a new algorithm, phylogenetic analysis for comparing trees (PACT), was developed specifically for use in analyses invoking Assumption 0. PACT can help discern complex evolutionary explanations for historical biogeographical, coevolutionary, phylogenetic, and tokogenetic processe

    Heritable clustering and pathway discovery in breast cancer integrating epigenetic and phenotypic data

    Get PDF
    BACKGROUND: In order to recapitulate tumor progression pathways using epigenetic data, we developed novel clustering and pathway reconstruction algorithms, collectively referred to as heritable clustering. This approach generates a progression model of altered DNA methylation from tumor tissues diagnosed at different developmental stages. The samples act as surrogates for natural progression in breast cancer and allow the algorithm to uncover distinct epigenotypes that describe the molecular events underlying this process. Furthermore, our likelihood-based clustering algorithm has great flexibility, allowing for incomplete epigenotype or clinical phenotype data and also permitting dependencies among variables. RESULTS: Using this heritable clustering approach, we analyzed methylation data obtained from 86 primary breast cancers to recapitulate pathways of breast tumor progression. Detailed annotation and interpretation are provided to the optimal pathway recapitulated. The result confirms the previous observation that aggressive tumors tend to exhibit higher levels of promoter hypermethylation. CONCLUSION: Our results indicate that the proposed heritable clustering algorithms are a useful tool for stratifying both methylation and clinical variables of breast cancer. The application to the breast tumor data illustrates that this approach can select meaningful progression models which may aid the interpretation of pathways having biological and clinical significance. Furthermore, the framework allows for other types of biological data, such as microarray gene expression or array CGH data, to be integrated

    Improving the family orientation process in Cuban Special Schools trough Nearest Prototype classification

    Get PDF
    Cuban Schools for children with Affective – Behavioral Maladies (SABM) have as goal to accomplish a major change in children behavior, to insert them effectively into society. One of the key elements in this objective is to give an adequate orientation to the children’s families; due to the family is one of the most important educational contexts in which the children will develop their personality. The family orientation process in SABM involves clustering and classification of mixed type data with non-symmetric similarity functions. To improve this process, this paper includes some novel characteristics in clustering and prototype selection. The proposed approach uses a hierarchical clustering based on compact sets, making it suitable for dealing with non-symmetric similarity functions, as well as with mixed and incomplete data. The proposal obtains very good results on the SABM data, and over repository databases
    • 

    corecore