4,355 research outputs found

    Microsatellites reveal a strong subdivision of genetic structure in Chinese populations of the mite Tetranychus urticae Koch (Acari: Tetranychidae)

    Get PDF
    Contact: [email protected] audienceBACKGROUND: Two colour forms of the two-spotted spider mite (Tetranychus urticae Koch) coexist in China: a red (carmine) form, which is considered to be native and a green form which is considered to be invasive. The population genetic diversity and population genetic structure of this organism were unclear in China, and there is a controversy over whether they constitute distinct species. To address these issues, we genotyped a total of 1,055 individuals from 18 red populations and 7 green populations in China using eight microsatellite loci. RESULTS: We identified 109 alleles. We found a highly significant genetic differentiation among the 25 populations (global F-ST = 0.506, global F-ST({ENA}) = 0.473) and a low genetic diversity in each population. In addition, genetic diversity of the red form mites was found to be higher than the green form. Pearson correlations between statistics of variation (AR and H-E) and geographic coordinates (latitude and longitude) showed that the genetic diversity of the red form was correlated with latitude. Using Bayesian clustering, we divided the Chinese mite populations into five clades which were well congruent with their geographic distributions. CONCLUSIONS: Spider mites possess low levels of genetic diversity, limit gene flow between populations and significant and IBD (isolation by distance) effect. These factors in turn contribute to the strong subdivision of genetic structure. In addition, population genetic structure results don't support the separation of the two forms of spider mite into two species. The morphological differences between the two forms of mites may be a result of epigenetic effects

    Unsupervised Text Topic-Related Gene Extraction for Large Unbalanced Datasets

    Get PDF
    There is a common notion that traditional unsupervised feature extraction algorithms follow the assumption that the distribution of the different clusters in a dataset is balanced. However, feature selection is guided by the calculation of similarities among features when topic keywords are extracted from a large number of unmarked, unbalanced text datasets. As a result, the selected features cannot truly reflect the information of the original data set, which thus affects the subsequent performance of classifiers. To solve this problem, a new method of extracting unsupervised text topic-related genes is proposed in this paper. Firstly, a sample cluster group is obtained by factor analysis and a density peak algorithm, based on which the dataset is marked. Then, considering the influence of the unbalanced distribution of sample clusters on feature selection, the CHI statistical matrix feature selection method, which combines average local density and information entropy together, is used to strengthen the features of low-density small-sample clusters. Finally, a related gene extraction method based on the exploration of high-order relevance in multidimensional statistical data is described, which uses independent component analysis to enhance the generalisability of the selected features. In this way, unsupervised text topic-related genes can be extracted from large unbalanced datasets. The results of experiments suggest that the proposed method of extracting unsupervised text topic-related genes is better than existing methods in extracting text subject terms from low-density small-sample clusters, and has higher prematurity and feature dimension-reduction ability

    Prediction models for solitary pulmonary nodules based on curvelet textural features and clinical parameters

    Get PDF
    Lung cancer, one of the leading causes of cancer-related deaths, usually appears as solitary pulmonary nodules (SPNs) which are hard to diagnose using the naked eye. In this paper, curvelet-based textural features and clinical parameters are used with three prediction models [a multilevel model, a least absolute shrinkage and selection operator (LASSO) regression method, and a support vector machine (SVM)] to improve the diagnosis of benign and malignant SPNs. Dimensionality reduction of the original curvelet-based textural features was achieved using principal component analysis. In addition, non-conditional logistical regression was used to find clinical predictors among demographic parameters and morphological features. The results showed that, combined with 11 clinical predictors, the accuracy rates using 12 principal components were higher than those using the original curvelet-based textural features. To evaluate the models, 10-fold cross validation and back substitution were applied. The results obtained, respectively, were 0.8549 and 0.9221 for the LASSO method, 0.9443 and 0.9831 for SVM, and 0.8722 and 0.9722 for the multilevel model. All in all, it was found that using curvelet-based textural features after dimensionality reduction and using clinical predictors, the highest accuracy rate was achieved with SVM. The method may be used as an auxiliary tool to differentiate between benign and malignant SPNs in CT images

    Ethyl 1-(6-chloro-3-pyridylmeth­yl)-5-ethoxy­methyl­eneamino-1H-1,2,3-triazole-4-carboxyl­ate

    Get PDF
    In the title compound, C14H16ClN5O3, there is evidence for significant electron delocalization in the triazolyl system. Intra­molecular C—H⋯O and inter­molecular C—H⋯O and C—H⋯N hydrogen bonds stabilize the structure
    corecore