14 research outputs found

    Comparison of Statistical Learning and Predictive Models on Breast Cancer Data and King County Housing Data

    No full text
    In this study, we evaluate the predictive performance of popular statistical learning methods, such as discriminant analysis, random forests, support vector machines, and neural networks via real data analysis. Two datasets, Breast Cancer Diagnosis in Wisconsin and House Sales in King County, are analyzed respectively to obtain the best models for prediction. Linear and Quadratic Discriminant Analysis are used in WDBC data set. Linear Regression and Elastic Net are used in KC house data set. Random Forest, Gradient Boosting Method, Support Vector Machines, and Neural Network are used in both datasets. Individual models and stacking of models are trained based on accuracy or R-squared from repeated cross-validation of training sets. The final models are evaluated by using test sets

    Genome of lethal Lepiota venenata and insights into the evolution of toxin-biosynthetic genes

    No full text
    Abstract Background Genomes of lethal Amanita and Galerina mushrooms have gradually become available in the past ten years; in contrast the other known amanitin-producing genus, Lepiota, is still vacant in this aspect. A fatal mushroom poisoning case in China has led to acquisition of fresh L. venenata fruiting bodies, based on which a draft genome was obtained through PacBio and Illumina sequencing platforms. Toxin-biosynthetic MSDIN family and Porlyl oligopeptidase B (POPB) genes were mined from the genome and used for phylogenetic and statistical studies to gain insights into the evolution of the biosynthetic pathway. Results The analysis of the genome data illustrated that only one MSDIN, named LvAMA1, exits in the genome, along with a POPB gene. No POPA homolog was identified by direct homology searching, however, one additional POP gene, named LvPOPC, was cloned and the gene structure determined. Similar to ApAMA1 in A. phalloides and GmAMA1 in G. marginata, LvAMA1 directly encodes α-amanitin. The two toxin genes were mapped to the draft genome, and the structures analyzed. Furthermore, phylogenetic and statistical analyses were conducted to study the evolution history of the POPB genes. Compared to our previous report, the phylogenetic trees unambiguously showed that a monophyletic POPB lineage clearly conflicted with the species phylogeny. In contrast, phylogeny of POPA genes resembled the species phylogeny. Topology and divergence tests showed that the POPB lineage was robust and these genes exhibited significantly shorter genetic distances than those of the house-keeping rbp2, a characteristic feature of genes with horizontal gene transfer (HGT) background. Consistently, same scenario applied to the only MSDIN, LvAMA1, in the genome. Conclusions To the best of our knowledge, this is the first reported genome of Lepiota. The analyses of the toxin genes indicate that the cyclic peptides are synthesized through a ribosomal mechanism. The toxin genes, LvAMA1 and LvPOPB, are not in the vicinity of each other. Phylogenetic and evolutionary studies suggest that HGT is the underlining cause for the occurrence of POPB and MSDIN in Amanita, Galerina and Lepiota, which are allocated in three distantly-related families

    Development of DNA Vaccine Candidate against SARS-CoV-2

    No full text
    Despite the existence of various types of vaccines and the involvement of the world’s leading pharmaceutical companies, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) remains the most challenging health threat in this century. Along with the increased transmissibility, new strains continue to emerge leading to the need for more vaccines that would elicit protectiveness and safety against the new strains of the virus. Nucleic acid vaccines seem to be the most effective approach in case of a sudden outbreak of infection or the emergence of a new strain as it requires less time than any conventional vaccine development. Hence, in the current study, a DNA vaccine encoding the trimeric prefusion-stabilized ectodomain (S1+S2) of SARS-CoV-2 S-protein was designed by introducing six additional prolines mutation, termed HexaPro. The three-dose regimen of designed DNA vaccine immunization in mice demonstrated the generation of protective antibodies
    corecore