580 research outputs found

    Comparison of Similarity Coefficients used for Cluster Analysis with Amplified Fragment Length Polymorphism Markers in the Silkworm, Bombyx mori

    Get PDF
    Establishing accurate genetic similarity and dissimilarity between individuals is an essential and decisive point for clustering and analyzing inter and intra population diversity because different similarity and dissimilarity indices may yield contradictory outcomes. We assessed the variations caused by three commonly used similarity coefficients including Jaccard, Sorensen-Dice and Simple matching in the clustering and ordination of seven Iranian native silkworm, Bombyx mori L. (Lepidoptera: Bombycidae), strains analyzed by amplified fragment length polymorphism markers. Comparisons among the similarity coefficients were made using the Spearman correlation analysis, dendrogram evaluation (visual inspection and consensus fork index - CIC), projection efficiency in a two-dimensional space, and groups formed by the Tocher optimization procedure. The results demonstrated that for almost all methodologies, the Jaccard and Sorensen-Dice coefficients revealed extremely close results, because both of them exclude negative co-occurrences. Due to the fact that there is no guarantee that the DNA regions with negative cooccurrences between two strains are indeed identical, the use of coefficients such as Jaccard and Sorensen-Dice that do not include negative co-occurrences was imperative for closely related organisms

    Genetic variation of wild and hatchery populations of the catla Indian major carp (Catla catla Hamilton 1822: Cypriniformes, Cyprinidae) revealed by RAPD markers

    Get PDF
    Genetic variation is a key component for improving a stock through selective breeding programs. Randomly amplified polymorphic DNA (RAPD) markers were used to assess genetic variation in three wild population of the catla carp (Catla catla Hamilton 1822) in the Halda, Jamuna and Padma rivers and one hatchery population in Bangladesh. Five decamer random primers were used to amplify RAPD markers from 30 fish from each population. Thirty of the 55 scorable bands were polymorphic, indicating some degree of genetic variation in all the populations. The proportion of polymorphic loci and gene diversity values reflected a relatively higher level of genetic variation in the Halda population. Sixteen of the 30 polymorphic loci showed a significant (p < 0.05, p < 0.01, p < 0.001) departure from homogeneity and the FST values in the different populations indicated some degree of genetic differentiation in the population pairs. Estimated genetic distances between populations were directly correlated with geographical distances. The unweighted pair group method with averages (UPGMA) dendrogram showed two clusters, the Halda population forming one cluster and the other populations the second cluster. Genetic variation of C. catla is a useful trait for developing a good management strategy for maintaining genetic quality of the species

    New resampling method for evaluating stability of clusters

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Hierarchical clustering is a widely applied tool in the analysis of microarray gene expression data. The assessment of cluster stability is a major challenge in clustering procedures. Statistical methods are required to distinguish between real and random clusters. Several methods for assessing cluster stability have been published, including resampling methods such as the bootstrap.</p> <p>We propose a new resampling method based on continuous weights to assess the stability of clusters in hierarchical clustering. While in bootstrapping approximately one third of the original items is lost, continuous weights avoid zero elements and instead allow non integer diagonal elements, which leads to retention of the full dimensionality of space, i.e. each variable of the original data set is represented in the resampling sample.</p> <p>Results</p> <p>Comparison of continuous weights and bootstrapping using real datasets and simulation studies reveals the advantage of continuous weights especially when the dataset has only few observations, few differentially expressed genes and the fold change of differentially expressed genes is low.</p> <p>Conclusion</p> <p>We recommend the use of continuous weights in small as well as in large datasets, because according to our results they produce at least the same results as conventional bootstrapping and in some cases they surpass it.</p

    CATCHprofiles: Clustering and Alignment Tool for ChIP Profiles

    Get PDF
    Chromatin Immuno Precipitation (ChIP) profiling detects in vivo protein-DNA binding, and has revealed a large combinatorial complexity in the binding of chromatin associated proteins and their post-translational modifications. To fully explore the spatial and combinatorial patterns in ChIP-profiling data and detect potentially meaningful patterns, the areas of enrichment must be aligned and clustered, which is an algorithmically and computationally challenging task. We have developed CATCHprofiles, a novel tool for exhaustive pattern detection in ChIP profiling data. CATCHprofiles is built upon a computationally efficient implementation for the exhaustive alignment and hierarchical clustering of ChIP profiling data. The tool features a graphical interface for examination and browsing of the clustering results. CATCHprofiles requires no prior knowledge about functional sites, detects known binding patterns “ab initio”, and enables the detection of new patterns from ChIP data at a high resolution, exemplified by the detection of asymmetric histone and histone modification patterns around H2A.Z-enriched sites. CATCHprofiles' capability for exhaustive analysis combined with its ease-of-use makes it an invaluable tool for explorative research based on ChIP profiling data
    corecore