1,245 research outputs found

    Spherical distance metrics applied to protein structure classification

    Get PDF
    Observed protein structures usually represent energetically favorable conformations. While not all observed structures are necessarily functional, it is generally agreed that protein structure is closely related to protein function. Given a collection of proteins sharing a common global structure, variations in their local structures at specific, critical locations may result in different biological functions. Structural relationships among proteins are important in the study of the evolution of proteins as well as in drug design and development. Analysis of geometrical 3D protein structure has been shown to be effective with respect to classifying proteins. Prior work has shown that the Double Centroid Reduced Representation (DCRR) model is a useful geometric representation for protein structure with respect to visual models, reducing the quantity of modeled information for each amino acid, yet retaining the most important geometrical and chemical features of each: the centroids of the backbone and of the side-chain. DCRR has not yet been applied in the calculation of geometric structural similarity. Meanwhile, multi-dimensional indexing (MDI) of protein structure combines protein structural analysis with distance metrics to facilitate structural similarity queries and is also used for clustering protein structures into related groups. In this respect, the combination of geometric models with MDI has been shown to be effective. Prior work, notably Distance and Density-based Protein Indexing (DDPIn), applies MDI to protein models based on the geometry of the C-alpha backbone. DDPIn\u27s distance metrics are based on radial and density functions that incorporate spherical-based metrics, and the indices are built from metric-tree (M-tree) structures. This work combines DCRR with DDPIn for the development of new DCRR centroid-based metrics: spherical binning distance and inter-centroid spherical distance. The use of DCRR models will provide additional significant structural information via the inclusion of side-chain centroids. Additionally, the newly developed distance metric functions combined with DCRR and M-tree indexing attempt to improve upon the performance of prior work (DDPIn), given the same data set, with respect to both individual k-nearest neighbor (kNN) search queries as well as clustering all proteins in the index

    Neighborhood Selection for Thresholding-based Subspace Clustering

    Full text link
    Subspace clustering refers to the problem of clustering high-dimensional data points into a union of low-dimensional linear subspaces, where the number of subspaces, their dimensions and orientations are all unknown. In this paper, we propose a variation of the recently introduced thresholding-based subspace clustering (TSC) algorithm, which applies spectral clustering to an adjacency matrix constructed from the nearest neighbors of each data point with respect to the spherical distance measure. The new element resides in an individual and data-driven choice of the number of nearest neighbors. Previous performance results for TSC, as well as for other subspace clustering algorithms based on spectral clustering, come in terms of an intermediate performance measure, which does not address the clustering error directly. Our main analytical contribution is a performance analysis of the modified TSC algorithm (as well as the original TSC algorithm) in terms of the clustering error directly.Comment: ICASSP 201

    On sphere-filling ropes

    Get PDF
    What is the longest rope on the unit sphere? Intuition tells us that the answer to this packing problem depends on the rope's thickness. For a countably infinite number of prescribed thickness values we construct and classify all solution curves. The simplest ones are similar to the seamlines of a tennis ball, others exhibit a striking resemblance to Turing patterns in chemistry, or to ordered phases of long elastic rods stuffed into spherical shells.Comment: 15 pages, 8 figure

    The exponential map is chaotic: An invitation to transcendental dynamics

    Get PDF
    We present an elementary and conceptual proof that the complex exponential map is "chaotic" when considered as a dynamical system on the complex plane. (This result was conjectured by Fatou in 1926 and first proved by Misiurewicz 55 years later.) The only background required is a first undergraduate course in complex analysis.Comment: 22 pages, 4 figures. (Provisionally) accepted for publication the American Mathematical Monthly. V2: Final pre-publication version. The article has been revised, corrected and shortened by 14 pages; see Version 1 for a more detailed discussion of further properties of the exponential map and wider transcendental dynamic
    • …
    corecore