9 research outputs found

    Q–Q plots of the Hurst exponents.

    No full text
    <p>A, B, and C: <i>Escherichia coli</i> MG1655 I; D, E, and F: <i>Salmonella enterica subsp</i>. <i>enterica serovar Typhimurium str</i>. 14028S. A and D show Q–Q plots of the Hurst exponents of the full-gene-sets of the two objects from respective organisms. B and E show Q–Q plots of the Hurst exponents of the essential genes of the two objects from respective organisms. C and F show Q–Q plots of the Hurst exponents of the nonessential genes of the two objects from respective organisms.</p

    Information of the analyzed objects.

    No full text
    <p><sup>a</sup> NO nonessential genes information provided in DEG.</p><p>Information of the analyzed objects.</p

    Significance levels of 33 objects in a normal distribution based on the hurstSpec method in smoothed mode.

    No full text
    <p>Significance levels of 33 objects in a normal distribution based on the hurstSpec method in smoothed mode.</p

    Selection of key sequence-based features for prediction of essential genes in 31 diverse bacterial species

    No full text
    <div><p>Genes that are indispensable for survival are essential genes. Many features have been proposed for computational prediction of essential genes. In this paper, the least absolute shrinkage and selection operator method was used to screen key sequence-based features related to gene essentiality. To assess the effects, the selected features were used to predict the essential genes from 31 bacterial species based on a support vector machine classifier. For all 31 bacterial objects (21 Gram-negative objects and ten Gram-positive objects), the features in the three datasets were reduced from 57, 59, and 58, to 40, 37, and 38, respectively, without loss of prediction accuracy. Results showed that some features were redundant for gene essentiality, so could be eliminated from future analyses. The selected features contained more complex (or key) biological information for gene essentiality, and could be of use in related research projects, such as gene prediction, synthetic biology, and drug design.</p></div

    Comparison of the prediction performance.

    No full text
    <p>Comparison of the prediction performance.</p

    Comparison of the classification results of original and selected features.

    No full text
    <p>Comparison of the classification results of original and selected features.</p

    Three ROC curves for predicting essential genes based on the original and selected features.

    No full text
    <p>(A) ROC curves for Gram-negative dataset. (B) ROC curves for Gram-positive dataset. (C) ROC curves for Full dataset.</p

    Information on the 31 bacterial species.

    No full text
    <p>Information on the 31 bacterial species.</p
    corecore