5 research outputs found

    Alchemical and structural distribution based representation for improved QML

    Full text link
    We introduce a representation of any atom in any chemical environment for the generation of efficient quantum machine learning (QML) models of common electronic ground-state properties. The representation is based on scaled distribution functions explicitly accounting for elemental and structural degrees of freedom. Resulting QML models afford very favorable learning curves for properties of out-of-sample systems including organic molecules, non-covalently bonded protein side-chains, (H2_2O)40_{40}-clusters, as well as diverse crystals. The elemental components help to lower the learning curves, and, through interpolation across the periodic table, even enable "alchemical extrapolation" to covalent bonding between elements not part of training, as evinced for single, double, and triple bonds among main-group elements

    Image_2_Genome-Wide Characterization of Endogenous Retroviruses in Bombyx mori Reveals the Relatives and Activity of env Genes.TIF

    No full text
    <p>Endogenous retroviruses (ERVs) are retroviral sequences that remain fixed in the host genome, where they could play an important role. Some ERVs have been identified in insects and proven to have infectious properties. However, no information is available regarding Bombyx mori ERVs (BmERVs) to date. Here, we systematically identified 256 potential BmERVs in the silkworm genome via a whole-genome approach. BmERVs were relatively evenly distributed across each of the chromosomes and accounted for about 25% of the silkworm genome. All BmERVs were classified as young ERVs, with insertion times estimated to be less than 10 million years. Seven BmERVs possessing the env genes were identified. With the exception of the Orf133 Helicoverpa armigera nuclear polyhedrosis virus, the env sequences of BmERVs were distantly related to genes encoding F (Fa and Fb) and GP64 proteins from Group I and Group II NPVs. In addition, only the amino acid sequence of the BmERV-21 envelope protein shared a similar putative furin-like cleavage site and fusion peptide with Group II baculoviruses. All of the env genes in the seven BmERVs were verified to exist in the genome and be expressed in the midgut and fat bodies, which suggest that BmERVs might play an important role in the host biology.</p

    Data_Sheet_3_Genome-Wide Characterization of Endogenous Retroviruses in Bombyx mori Reveals the Relatives and Activity of env Genes.XLSX

    No full text
    <p>Endogenous retroviruses (ERVs) are retroviral sequences that remain fixed in the host genome, where they could play an important role. Some ERVs have been identified in insects and proven to have infectious properties. However, no information is available regarding Bombyx mori ERVs (BmERVs) to date. Here, we systematically identified 256 potential BmERVs in the silkworm genome via a whole-genome approach. BmERVs were relatively evenly distributed across each of the chromosomes and accounted for about 25% of the silkworm genome. All BmERVs were classified as young ERVs, with insertion times estimated to be less than 10 million years. Seven BmERVs possessing the env genes were identified. With the exception of the Orf133 Helicoverpa armigera nuclear polyhedrosis virus, the env sequences of BmERVs were distantly related to genes encoding F (Fa and Fb) and GP64 proteins from Group I and Group II NPVs. In addition, only the amino acid sequence of the BmERV-21 envelope protein shared a similar putative furin-like cleavage site and fusion peptide with Group II baculoviruses. All of the env genes in the seven BmERVs were verified to exist in the genome and be expressed in the midgut and fat bodies, which suggest that BmERVs might play an important role in the host biology.</p

    Image_1_Genome-Wide Characterization of Endogenous Retroviruses in Bombyx mori Reveals the Relatives and Activity of env Genes.TIF

    No full text
    <p>Endogenous retroviruses (ERVs) are retroviral sequences that remain fixed in the host genome, where they could play an important role. Some ERVs have been identified in insects and proven to have infectious properties. However, no information is available regarding Bombyx mori ERVs (BmERVs) to date. Here, we systematically identified 256 potential BmERVs in the silkworm genome via a whole-genome approach. BmERVs were relatively evenly distributed across each of the chromosomes and accounted for about 25% of the silkworm genome. All BmERVs were classified as young ERVs, with insertion times estimated to be less than 10 million years. Seven BmERVs possessing the env genes were identified. With the exception of the Orf133 Helicoverpa armigera nuclear polyhedrosis virus, the env sequences of BmERVs were distantly related to genes encoding F (Fa and Fb) and GP64 proteins from Group I and Group II NPVs. In addition, only the amino acid sequence of the BmERV-21 envelope protein shared a similar putative furin-like cleavage site and fusion peptide with Group II baculoviruses. All of the env genes in the seven BmERVs were verified to exist in the genome and be expressed in the midgut and fat bodies, which suggest that BmERVs might play an important role in the host biology.</p

    Data_Sheet_2_Genome-Wide Characterization of Endogenous Retroviruses in Bombyx mori Reveals the Relatives and Activity of env Genes.XLSX

    No full text
    <p>Endogenous retroviruses (ERVs) are retroviral sequences that remain fixed in the host genome, where they could play an important role. Some ERVs have been identified in insects and proven to have infectious properties. However, no information is available regarding Bombyx mori ERVs (BmERVs) to date. Here, we systematically identified 256 potential BmERVs in the silkworm genome via a whole-genome approach. BmERVs were relatively evenly distributed across each of the chromosomes and accounted for about 25% of the silkworm genome. All BmERVs were classified as young ERVs, with insertion times estimated to be less than 10 million years. Seven BmERVs possessing the env genes were identified. With the exception of the Orf133 Helicoverpa armigera nuclear polyhedrosis virus, the env sequences of BmERVs were distantly related to genes encoding F (Fa and Fb) and GP64 proteins from Group I and Group II NPVs. In addition, only the amino acid sequence of the BmERV-21 envelope protein shared a similar putative furin-like cleavage site and fusion peptide with Group II baculoviruses. All of the env genes in the seven BmERVs were verified to exist in the genome and be expressed in the midgut and fat bodies, which suggest that BmERVs might play an important role in the host biology.</p
    corecore