528 research outputs found

    Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>DNA microarray technology allows for the measurement of genome-wide expression patterns. Within the resultant mass of data lies the problem of analyzing and presenting information on this genomic scale, and a first step towards the rapid and comprehensive interpretation of this data is gene clustering with respect to the expression patterns. Classifying genes into clusters can lead to interesting biological insights. In this study, we describe an iterative clustering approach to uncover biologically coherent structures from DNA microarray data based on a novel clustering algorithm EP_GOS_Clust.</p> <p>Results</p> <p>We apply our proposed iterative algorithm to three sets of experimental DNA microarray data from experiments with the yeast <it>Saccharomyces cerevisiae </it>and show that the proposed iterative approach improves biological coherence. Comparison with other clustering techniques suggests that our iterative algorithm provides superior performance with regard to biological coherence. An important consequence of our approach is that an increasing proportion of genes find membership in clusters of high biological coherence and that the average cluster specificity improves.</p> <p>Conclusion</p> <p>The results from these clustering experiments provide a robust basis for extracting motifs and trans-acting factors that determine particular patterns of expression. In addition, the biological coherence of the clusters is iteratively assessed independently of the clustering. Thus, this method will not be severely impacted by functional annotations that are missing, inaccurate, or sparse.</p

    Dynamic probe of the interface in lamellar forming non-linear block copolymers of the (BA) 3 B and (BA) 3 B(AB) 3 type. A dielectric spectroscopy study

    Get PDF
    Abstract Dielectric spectroscopy is employed in lamellar forming non-linear block copolymers of the type (BA) 3 B and (BA) 3 B(AB) 3 based on polyisoprene (A) and polystyrene (B), at temperatures well below the order-to-disorder transition temperature and below the glass transition temperature of the hard phase (polystyrene). We show here that dielectric spectroscopy can be used as a tool to probe the interface in ordered block copolymers with a basic triblock unit. Our estimate of the interfacial width is based on the mobility of the junction points at the interface and compares favorably with the estimated thickness from thermodynamics.

    Stationary probability density of stochastic search processes in global optimization

    Full text link
    A method for the construction of approximate analytical expressions for the stationary marginal densities of general stochastic search processes is proposed. By the marginal densities, regions of the search space that with high probability contain the global optima can be readily defined. The density estimation procedure involves a controlled number of linear operations, with a computational cost per iteration that grows linearly with problem size

    Identifying genetic interactions associated with late-onset Alzheimer's disease

    Get PDF
    Background: Identifying genetic interactions in data obtained from genome-wide association studies (GWASs) can help in understanding the genetic basis of complex diseases. The large number of single nucleotide polymorphisms (SNPs) in GWASs however makes the identification of genetic interactions computationally challenging. We developed the Bayesian Combinatorial Method (BCM) that can identify pairs of SNPs that in combination have high statistical association with disease. Results: We applied BCM to two late-onset Alzheimer's disease (LOAD) GWAS datasets to identify SNPs that interact with known Alzheimer associated SNPs. We also compared BCM with logistic regression that is implemented in PLINK. Gene Ontology analysis of genes from the top 200 dataset SNPs for both GWAS datasets showed overrepresentation of LOAD-related terms. Four genes were common to both datasets: APOE and APOC1, which have well established associations with LOAD, and CAMK1D and FBXL13, not previously linked to LOAD but having evidence of involvement in LOAD. Supporting evidence was also found for additional genes from the top 30 dataset SNPs. Conclusion: BCM performed well in identifying several SNPs having evidence of involvement in the pathogenesis of LOAD that would not have been identified by univariate analysis due to small main effect. These results provide support for applying BCM to identify potential genetic variants such as SNPs from high dimensional GWAS datasets

    Optimization of minimum set of protein–DNA interactions: a quasi exact solution with minimum over-fitting

    Get PDF
    Motivation: A major limitation in modeling protein interactions is the difficulty of assessing the over-fitting of the training set. Recently, an experimentally based approach that integrates crystallographic information of C2H2 zinc finger–DNA complexes with binding data from 11 mutants, 7 from EGR finger I, was used to define an improved interaction code (no optimization). Here, we present a novel mixed integer programming (MIP)-based method that transforms this type of data into an optimized code, demonstrating both the advantages of the mathematical formulation to minimize over- and under-fitting and the robustness of the underlying physical parameters mapped by the code

    Simplicity of condensed matter at its core: Generic definition of a Roskilde-simple system

    Get PDF
    The theory of isomorphs is reformulated by defining Roskilde-simple systems (those with isomorphs) by the property that the order of the potential energies of configurations at one density is maintained when these are scaled uniformly to a different density. Isomorphs remain curves in the thermodynamic phase diagram along which structure, dynamics, and excess entropy are invariant, implying that the phase diagram is effectively one-dimensional with respect to many reduced-unit properties. In contrast to the original formulation of the isomorph theory, however, the density-scaling exponent is not exclusively a function of density and the isochoric heat capacity is not an exact isomorph invariant. A prediction is given for the latter quantity's variation along the isomorphs. Molecular dynamics simulations of the Lennard-Jones and Lennard-Jones Gaussian systems validate the new approach

    Hidden scale invariance of metals

    Get PDF
    Density functional theory (DFT) calculations of 58 liquid elements at their triple point show that most metals exhibit near proportionality between thermal fluctuations between virial and potential-energy in the isochoric ensemble. This demonstrates a general "hidden" scale invariance of metals making the dense part of the thermodynamic phase diagram effectively one dimensional with respect to structure and dynamics. DFT computed density scaling exponents, related to the Gr{\"u}neisen parameter, are in good agreement with experimental values for 16 elements where reliable data were available. Hidden scale invariance is demonstrated in detail for magnesium by showing invariance of structure and dynamics. Computed melting curves of period three metals follow curves with invariance (isomorphs). The experimental structure factor of magnesium is predicted by assuming scale invariant inverse power-law (IPL) pair interactions. However, crystal packings of several transition metals (V, Cr, Mn, Fe, Nb, Mo, Ta, W and Hg), most post-transition metals (Ga, In, Sn, and Tl) and the metalloids Si and Ge cannot be explained by the IPL assumption. Thus, hidden scale invariance can be present even when the IPL-approximation is inadequate. The virial-energy correlation coefficient of iron and phosphorous is shown to increase at elevated pressures. Finally, we discuss how scale invariance explains the Gr{\"u}neisen equation of state and a number of well-known empirical melting and freezing rules.Comment: 12 pages, 11 figure
    corecore