89 research outputs found

    Minimum Density Hyperplanes

    Get PDF
    Associating distinct groups of objects (clusters) with contiguous regions of high probability density (high-density clusters), is central to many statistical and machine learning approaches to the classification of unlabelled data. We propose a novel hyperplane classifier for clustering and semi-supervised classification which is motivated by this objective. The proposed minimum density hyperplane minimises the integral of the empirical probability density function along it, thereby avoiding intersection with high density clusters. We show that the minimum density and the maximum margin hyperplanes are asymptotically equivalent, thus linking this approach to maximum margin clustering and semi-supervised support vector classifiers. We propose a projection pursuit formulation of the associated optimisation problem which allows us to find minimum density hyperplanes efficiently in practice, and evaluate its performance on a range of benchmark datasets. The proposed approach is found to be very competitive with state of the art methods for clustering and semi-supervised classification

    Real Time Sentiment Change Detection of Twitter Data Streams

    Full text link
    In the past few years, there has been a huge growth in Twitter sentiment analysis having already provided a fair amount of research on sentiment detection of public opinion among Twitter users. Given the fact that Twitter messages are generated constantly with dizzying rates, a huge volume of streaming data is created, thus there is an imperative need for accurate methods for knowledge discovery and mining of this information. Although there exists a plethora of twitter sentiment analysis methods in the recent literature, the researchers have shifted to real-time sentiment identification on twitter streaming data, as expected. A major challenge is to deal with the Big Data challenges arising in Twitter streaming applications concerning both Volume and Velocity. Under this perspective, in this paper, a methodological approach based on open source tools is provided for real-time detection of changes in sentiment that is ultra efficient with respect to both memory consumption and computational cost. This is achieved by iteratively collecting tweets in real time and discarding them immediately after their process. For this purpose, we employ the Lexicon approach for sentiment characterizations, while change detection is achieved through appropriate control charts that do not require historical information. We believe that the proposed methodology provides the trigger for a potential large-scale monitoring of threads in an attempt to discover fake news spread or propaganda efforts in their early stages. Our experimental real-time analysis based on a recent hashtag provides evidence that the proposed approach can detect meaningful sentiment changes across a hashtags lifetime

    Coupled Analysis of In Vitro and Histology Tissue Samples to Quantify Structure-Function Relationship

    Get PDF
    The structure/function relationship is fundamental to our understanding of biological systems at all levels, and drives most, if not all, techniques for detecting, diagnosing, and treating disease. However, at the tissue level of biological complexity we encounter a gap in the structure/function relationship: having accumulated an extraordinary amount of detailed information about biological tissues at the cellular and subcellular level, we cannot assemble it in a way that explains the correspondingly complex biological functions these structures perform. To help close this information gap we define here several quantitative temperospatial features that link tissue structure to its corresponding biological function. Both histological images of human tissue samples and fluorescence images of three-dimensional cultures of human cells are used to compare the accuracy of in vitro culture models with their corresponding human tissues. To the best of our knowledge, there is no prior work on a quantitative comparison of histology and in vitro samples. Features are calculated from graph theoretical representations of tissue structures and the data are analyzed in the form of matrices and higher-order tensors using matrix and tensor factorization methods, with a goal of differentiating between cancerous and healthy states of brain, breast, and bone tissues. We also show that our techniques can differentiate between the structural organization of native tissues and their corresponding in vitro engineered cell culture models

    The Indian cobra reference genome and transcriptome enables comprehensive identification of venom toxins

    Get PDF
    Snakebite envenoming is a serious and neglected tropical disease that kills ~100,000 people annually. High-quality, genome-enabled comprehensive characterization of toxin genes will facilitate development of effective humanized recombinant antivenom. We report a de novo near-chromosomal genome assembly of Naja naja, the Indian cobra, a highly venomous, medically important snake. Our assembly has a scaffold N50 of 223.35 Mb, with 19 scaffolds containing 95% of the genome. Of the 23,248 predicted protein-coding genes, 12,346 venom-gland-expressed genes constitute the \u27venom-ome\u27 and this included 139 genes from 33 toxin families. Among the 139 toxin genes were 19 \u27venom-ome-specific toxins\u27 (VSTs) that showed venom-gland-specific expression, and these probably encode the minimal core venom effector proteins. Synthetic venom reconstituted through recombinant VST expression will aid in the rapid development of safe and effective synthetic antivenom. Additionally, our genome could serve as a reference for snake genomes, support evolutionary studies and enable venom-driven drug discovery

    Epidemiology, practice of ventilation and outcome for patients at increased risk of postoperative pulmonary complications

    Get PDF
    BACKGROUND Limited information exists about the epidemiology and outcome of surgical patients at increased risk of postoperative pulmonary complications (PPCs), and how intraoperative ventilation was managed in these patients. OBJECTIVES To determine the incidence of surgical patients at increased risk of PPCs, and to compare the intraoperative ventilation management and postoperative outcomes with patients at low risk of PPCs. DESIGN This was a prospective international 1-week observational study using the ‘Assess Respiratory Risk in Surgical Patients in Catalonia risk score’ (ARISCAT score) for PPC for risk stratification. PATIENTS AND SETTING Adult patients requiring intraoperative ventilation during general anaesthesia for surgery in 146 hospitals across 29 countries. MAIN OUTCOME MEASURES The primary outcome was the incidence of patients at increased risk of PPCs based on the ARISCAT score. Secondary outcomes included intraoperative ventilatory management and clinical outcomes. RESULTS A total of 9864 patients fulfilled the inclusion criteria. The incidence of patients at increased risk was 28.4%. The most frequently chosen tidal volume (VT) size was 500 ml, or 7 to 9 ml kg1 predicted body weight, slightly lower in patients at increased risk of PPCs. Levels of positive end-expiratory pressure (PEEP) were slightly higher in patients at increased risk of PPCs, with 14.3% receiving more than 5 cmH2O PEEP compared with 7.6% in patients at low risk of PPCs (P < 0.001). Patients with a predicted preoperative increased risk of PPCs developed PPCs more frequently: 19 versus 7%, relative risk (RR) 3.16 (95% confidence interval 2.76 to 3.61), P < 0.001) and had longer hospital stays. The only ventilatory factor associated with the occurrence of PPCs was the peak pressure. CONCLUSION The incidence of patients with a predicted increased risk of PPCs is high. A large proportion of patients receive high VT and low PEEP levels. PPCs occur frequently in patients at increased risk, with worse clinical outcome

    Intelligent mining of large-scale bio-data: bioinformatics applications

    Get PDF
    Today, there is a collection of a tremendous amount of bio-data because of the computerized applications worldwide. Therefore, scholars have been encouraged to develop effective methods to extract the hidden knowledge in these data. Consequently, a challenging and valuable area for research in artificial intelligence has been created. Bioinformatics creates heuristic approaches and complex algorithms using artificial intelligence and information technology in order to solve biological problems. Intelligent implication of the data can accelerate biological knowledge discovery. Data mining, as biology intelligence, attempts to find reliable, new, useful and meaningful patterns in huge amounts of data. Hence, there is a high potential to raise the interaction between artificial intelligence and bio-data mining. The present paper argues how artificial intelligence can assist bio-data analysis and gives an up-to-date review of different applications of bio-data mining. It also highlights some future perspectives of data mining in bioinformatics that can inspire further developments of data mining instruments. Important and new techniques are critically discussed for intelligent knowledge discovery of different types of row datasets with applicable examples in human, plant and animal sciences. Finally, a broad perception of this hot topic in data science is given

    Epidemiology, practice of ventilation and outcome for patients at increased risk of postoperative pulmonary complications: LAS VEGAS - An observational study in 29 countries

    Get PDF
    BACKGROUND Limited information exists about the epidemiology and outcome of surgical patients at increased risk of postoperative pulmonary complications (PPCs), and how intraoperative ventilation was managed in these patients. OBJECTIVES To determine the incidence of surgical patients at increased risk of PPCs, and to compare the intraoperative ventilation management and postoperative outcomes with patients at low risk of PPCs. DESIGN This was a prospective international 1-week observational study using the ‘Assess Respiratory Risk in Surgical Patients in Catalonia risk score’ (ARISCAT score) for PPC for risk stratification. PATIENTS AND SETTING Adult patients requiring intraoperative ventilation during general anaesthesia for surgery in 146 hospitals across 29 countries. MAIN OUTCOME MEASURES The primary outcome was the incidence of patients at increased risk of PPCs based on the ARISCAT score. Secondary outcomes included intraoperative ventilatory management and clinical outcomes. RESULTS A total of 9864 patients fulfilled the inclusion criteria. The incidence of patients at increased risk was 28.4%. The most frequently chosen tidal volume (V T) size was 500 ml, or 7 to 9 ml kg−1 predicted body weight, slightly lower in patients at increased risk of PPCs. Levels of positive end-expiratory pressure (PEEP) were slightly higher in patients at increased risk of PPCs, with 14.3% receiving more than 5 cmH2O PEEP compared with 7.6% in patients at low risk of PPCs (P ˂ 0.001). Patients with a predicted preoperative increased risk of PPCs developed PPCs more frequently: 19 versus 7%, relative risk (RR) 3.16 (95% confidence interval 2.76 to 3.61), P ˂ 0.001) and had longer hospital stays. The only ventilatory factor associated with the occurrence of PPCs was the peak pressure. CONCLUSION The incidence of patients with a predicted increased risk of PPCs is high. A large proportion of patients receive high V T and low PEEP levels. PPCs occur frequently in patients at increased risk, with worse clinical outcome.</p

    Horse immunization with short-chain consensus α-neurotoxin generates antibodies against broad spectrum of elapid venomous species

    Get PDF
    Antivenoms are fundamental in the therapy for snakebites. In elapid venoms, there are toxins, e.g. short-chain α-neurotoxins, which are quite abundant, highly toxic, and consequently play a major role in envenomation processes. The core problem is that such α-neurotoxins are weakly immunogenic, and many current elapid antivenoms show low reactivity towards them. We have previously developed a recombinant consensus short-chain α-neurotoxin (ScNtx) based on sequences from the most lethal elapid venoms from America, Africa, Asia, and Oceania. Here we report that an antivenom generated by immunizing horses with ScNtx can successfully neutralize the lethality of pure recombinant and native short-chain α-neurotoxins, as well as whole neurotoxic elapid venoms from diverse genera such as Micrurus, Dendroaspis, Naja, Walterinnesia, Ophiophagus and Hydrophis. These results provide a proof-ofprinciple for using recombinant proteins with rationally designed consensus sequences as universal immunogens for developing next-generation antivenoms with higher effectiveness and broader neutralizing capacity.Universidad de Costa Rica/[741-B7-608]/UCR/Costa RicaDireccion General de Asuntos del Personal Academico/[IN203118]/DGAPA/MéxicoDireccion General de Asuntos del Personal Academico/[IN207218]/DGAPA/MéxicoUCR::Vicerrectoría de Investigación::Unidades de Investigación::Ciencias de la Salud::Instituto Clodomiro Picado (ICP
    corecore