17 research outputs found

    Some Submodular Data-Poisoning Attacks on Machine Learners

    No full text
    We study data-poisoning attacks using a machine teaching framework. For a family of NP-hard attack problems we pose them as submodular function maximization, thereby inheriting efficient greedy algorithms with theoretical guarantees. We demonstrate some attacks with experiments

    Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners

    No full text
    We investigate a problem at the intersection of machine learning and security: training-set attacks on machine learners. In such attacks an attacker contaminates the training data so that a specific learning algorithm would produce a model profitable to the attacker. Understanding training-set attacks is important as more intelligent agents (e.g. spam filters and robots) are equipped with learning capability and can potentially be hacked via data they receive from the environment. This paper identifies the optimal training-set attack on a broad family of machine learners. First we show that optimal training-set attack can be formulated as a bilevel optimization problem. Then we show that for machine learners with certain Karush-Kuhn-Tucker conditions we can solve the bilevel problem efficiently using gradient methods on an implicit function. As examples, we demonstrate optimal trainingset attacks on Support Vector Machines, logistic regression, and linear regression with extensive experiments. Finally, we discuss potential defenses against such attacks

    Encoding Low-Rank and Sparse Structures Simultaneously in Multi-task Learning

    Get PDF
    Multi-task learning (MTL) aims to improve the performance of each task by borrowing the knowledge learned from other related tasks. Identifying the underlying structures among tasks is crucial for MTL to understand the relationship among tasks. In this paper, we propose a novel multi-task learning model to simultaneously consider low-rank structure and sparse structure. Combining these two types of structures could not only improve the learner's performance, but also make the interpretation of learned structures easier. However, the standard subgradient optimization method for solving this problem could only achieve a rate of convergence O(1/ √ k). We propose a novel optimization method combining the Moreau approximation and an accelerated proximal method to achieve a rate of convergence O(1/k). We conduct experiments on synthetic data and several real-world data sets and the results show the gains of our model in comparison with state-of-the-art baselines

    The characteristics of the complete chloroplast genome of Staurogyne concinnula (Hance) O. Kuntze (Acanthaceae)

    No full text
    Staurogyne concinnula (Hance) O. Kuntze (Acanthaceae) is an important ornamental herb mainly distributed in the southern region of China, including Fujian, Guangdong, Hainan, and Taiwan provinces. However, the complete chloroplast genome of S. concinnula, which could serve as a genetic resource for studies on its taxonomy and evolution, is poorly studied at present. In this study, we reported the complete chloroplast genome of S. concinnula that was assembled using high-throughput sequencing data. The chloroplast genome was 153,783 bp long, with a typical quadripartite structure containing a small single-copy region (SSC; 17,855 bp), a large single-copy region (LSC; 84,636 bp) and a pair of inverted repeats (IRs; each 25,646 bp). The overall GC content of the chloroplast genome was 38.04%. A total of 86 protein-coding genes (PCGs), 8 rRNA genes, and 37 tRNA genes were predicted. Phylogenetic analysis based on the combined sequences of 86 PCGs with the other 16 closely related species of Acanthaceae indicated that S. concinnula is closely related to Avicennia marina. The genomic data and finding from the phylogenetic studies of S. concinnula could provide useful information and give light to in-depth studies on the evolution pattern of the understudied species, as well as Staurogyne

    The characteristics of the complete chloroplast genome of <i>Staurogyne concinnula</i> (Hance) O. Kuntze (Acanthaceae)

    No full text
    Staurogyne concinnula (Hance) O. Kuntze (Acanthaceae) is an important ornamental herb mainly distributed in the southern region of China, including Fujian, Guangdong, Hainan, and Taiwan provinces. However, the complete chloroplast genome of S. concinnula, which could serve as a genetic resource for studies on its taxonomy and evolution, is poorly studied at present. In this study, we reported the complete chloroplast genome of S. concinnula that was assembled using high-throughput sequencing data. The chloroplast genome was 153,783 bp long, with a typical quadripartite structure containing a small single-copy region (SSC; 17,855 bp), a large single-copy region (LSC; 84,636 bp) and a pair of inverted repeats (IRs; each 25,646 bp). The overall GC content of the chloroplast genome was 38.04%. A total of 86 protein-coding genes (PCGs), 8 rRNA genes, and 37 tRNA genes were predicted. Phylogenetic analysis based on the combined sequences of 86 PCGs with the other 16 closely related species of Acanthaceae indicated that S. concinnula is closely related to Avicennia marina. The genomic data and finding from the phylogenetic studies of S. concinnula could provide useful information and give light to in-depth studies on the evolution pattern of the understudied species, as well as Staurogyne.</p

    The complete chloroplast genome sequence of Vernonia amygdalina Delile

    No full text
    Vernonia amygdalina Delile is widely used in folkloric medicine for the treatment of various diseases. In this study, the complete chloroplast genome of V. amygdalina Delile was reported, which was assembled and annotated base on genome high-throughput sequencing data. This work provided the clues for the taxonomy of the herb and the potential to utilize the chloroplast genome sequence as a new study target. The length of V. amygdalina Delile chloroplast genome was 153,133bp, with two single-copy regions, each has the length of 84,245bp and 13,152bp respectively. This region were separated by two inverted repeat regions with 27,868bp in length. It was predicted to consist of 131 genes in total, including 86 protein-coding genes, 37 tRNA genes, 8 rRNA genes with GC content at 37.68%. Phylogenetic analysis by RAxML (Random Axelerated Maximum Likelikhood) showed V. amygdalina Delile is closest to Sonchus webbii in Compositae

    The complete chloroplast genome sequence of medicinal plant Alpinia chinensis (Retz.) Rosc

    No full text
    Alpinia chinensis (Retz.) Rosc is one of Chinese tradition herbal medicine and edible plant in China. In this report, we sequenced the complete chloroplast genome of A. chinensis. Through the assembly annotation of genome with high-throughput sequencing data, which help us to research the evolution. The length of chloroplast sequences was 163,590 bp with a large single-copy region (LSC) and a small single-copy region (SSC), also, two inverted repeat region A (IR), whose length was 88,951, 15,299, and 29,670 bp, respectively. A total of 138 genes were predicted in the complete chloroplast genome, with 36.4% GC content, including 93 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. From the phylogenetic analysis, we could conclude that A. chinensis (Retz.) Rosc. was close to Alpinia oxyphylla in Zingiberaceae

    The complete chloroplast genome sequence of Sauropus spatulifolius Beille

    No full text
    Sauropus spatulifolius Beille is one kind of Chinese herbal medicine with anti-inflammatory and analgesic activities. In this study, we reported the complete chloroplast genome of S. spatulifolius Beille, and assembled and annotated with high-throughput sequencing data, which would provide help for its taxonomy research. The chloroplast sequence was 154,707 bp , with two of 87,438 bp and 19,427 bp single-copy regions, which were separated by two inverted repeat regions with 23,921 bp . A total of 129 genes were predicted, with GC content of 36.61%. Phylogenetic analysis showed that S. spatulifolius Beille closest to Glochidion chodoense in Malpighiales

    The complete chloroplast genome sequence of Acanthopanax trifoliatus (Linn.) Merr.

    No full text
    Acanthopanax trifoliatus (Linn.) Merr. is an edible vegetables and medicinal plant from Asian countries. In this study, the complete chloroplast genome of A. trifoliatus was assembled and annotated by high-throughput sequencing. The total chloroplast genome size of A. trifoliatus was 156,716 bp, containing a large single-copy (LSC) region of 86,672 bp, a small single-copy (SSC) region of 18,174 bp, and a pair of inverted repeat regions of 25,935 bp. A total of 134 genes were predicted in the chloroplast genome of A. trifoliatus, including 89 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Phylogenetic analysis showed that A. trifoliatus was closely related to Eleutherococcus gracilistylus
    corecore