Search CORE

294 research outputs found

On the origin of distribution patterns of motifs in biological networks

Author: Konagurthu Arun S
Lesk Arthur M
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Inventories of small subgraphs in biological networks have identified commonly-recurring patterns, called motifs. The inference that these motifs have been selected for function rests on the idea that their occurrences are significantly more frequent than random. Results Our analysis of several large biological networks suggests, in contrast, that the frequencies of appearance of common subgraphs are similar in natural and corresponding random networks. Conclusion Indeed, certain topological features of biological networks give rise naturally to the common appearance of the motifs. We therefore question whether frequencies of occurrences are reasonable evidence that the structures of motifs have been selected for their functional contribution to the operation of networks.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

On Universal Codes for Integers: Wallace Tree, Elias Omega and Variations

Author: Allison Lloyd
Konagurthu Arun
Schmidt Daniel
Publication venue
Publication date: 12/06/2019
Field of study

A universal code for the (positive) integers can be used to store or compress a sequence of integers. Every universal code implies a probability distribution on integers. This implied distribution may be a reasonable choice when the true distribution of a source of integers is unknown. Wallace Tree Code (WTC) is a universal code for integers based on binary trees. We give the encoding and decoding routines for WTC and analyse the properties of the code in comparison to two well-known codes, the Fibonacci and Elias omega codes. Some improvements on the Elias omega code are also described and examined.Comment: 8 pages, 8 figures (3 figure image files

arXiv.org e-Print Archive

A fast indexing approach for protein structure comparison

Author: Arun S Konagurthu
James Bailey
Kotagiri Ramamohanarao
Lei Zhang
Publication venue: Springer Nature
Publication date: 01/01/2010
Field of study

BACKGROUND: Protein structure comparison is a fundamental task in structural biology. While the number of known protein structures has grown rapidly over the last decade, searching a large database of protein structures is still relatively slow using existing methods. There is a need for new techniques which can rapidly compare protein structures, whilst maintaining high matching accuracy. RESULTS: We have developed IR Tableau, a fast protein comparison algorithm, which leverages the tableau representation to compare protein tertiary structures. IR tableau compares tableaux using information retrieval style feature indexing techniques. Experimental analysis on the ASTRAL SCOP protein structural domain database demonstrates that IR Tableau achieves two orders of magnitude speedup over the search times of existing methods, while producing search results of comparable accuracy. CONCLUSION: We show that it is possible to obtain very significant speedups for the protein structure comparison problem, by employing an information retrieval style approach for indexing proteins. The comparison accuracy achieved is also strong, thus opening the way for large scale processing of very large protein structure databases

Springer - Publisher Connector

PubMed Central

University of Melbourne Institutional Repository

How precise are reported protein coordinate data?

Author: Abramson David
Allison Lloyd
Konagurthu Arun S.
Lesk Arthur M.
Stuckey Peter J.
Publication venue: 'International Union of Crystallography (IUCr)'
Publication date: 01/03/2014
Field of study

Atomic coordinates in the Worldwide Protein Data Bank (wwPDB) are generally reported to greater precision than the experimental structure determinations have actually achieved. By using information theory and data compression to study the compressibility of protein atomic coordinates, it is possible to quantify the amount of randomness in the coordinate data and thereby to determine the realistic precision of the reported coordinates. On average, the value of each Cα coordinate in a set of selected protein structures solved at a variety of resolutions is good to about 0.1 Å

University of Queensland eSpace

A Generalization of the Convex Kakeya Problem

Author: A. Konagurthu
A.S. Besicovitch
A.S. Besicovitch
A.S. Besicovitch
B. Fisher
D. Ohmann
G. Pál
G.D. Chakerian
I. Laba
I.J. Schoenberg
K. Bezdek
K. Bezdek
L. Fejes Tóth
O. Perron
S. Kakeya
T. Tao
Publication venue
Publication date: 01/01/2012
Field of study

Given a set of line segments in the plane, not necessarily finite, what is a convex region of smallest area that contains a translate of each input segment? This question can be seen as a generalization of Kakeya's problem of finding a convex region of smallest area such that a needle can be rotated through 360 degrees within this region. We show that there is always an optimal region that is a triangle, and we give an optimal \Theta(n log n)-time algorithm to compute such a triangle for a given set of n segments. We also show that, if the goal is to minimize the perimeter of the region instead of its area, then placing the segments with their midpoint at the origin and taking their convex hull results in an optimal solution. Finally, we show that for any compact convex figure G, the smallest enclosing disk of G is a smallest-perimeter region containing a translate of every rotated copy of G.Comment: 14 pages, 9 figure

arXiv.org e-Print Archive

Crossref

ScholarWorks@UNIST

포항공과대학교

The divergence time of protein structures modelled by Markov matrices and its relation to the divergence of sequences

Author: Allison Lloyd
de la Banda Maria Garcia
Konagurthu Arun S.
Rajapaksa Sandun
Stuckey Peter J.
Publication venue
Publication date: 10/08/2023
Field of study

A complete time-parameterized statistical model quantifying the divergent evolution of protein structures in terms of the patterns of conservation of their secondary structures is inferred from a large collection of protein 3D structure alignments. This provides a better alternative to time-parameterized sequence-based models of protein relatedness, that have clear limitations dealing with twilight and midnight zones of sequence relationships. Since protein structures are far more conserved due to the selection pressure directly placed on their function, divergence time estimates can be more accurate when inferred from structures. We use the Bayesian and information-theoretic framework of Minimum Message Length to infer a time-parameterized stochastic matrix (accounting for perturbed structural states of related residues) and associated Dirichlet models (accounting for insertions and deletions during the evolution of protein domains). These are used in concert to estimate the Markov time of divergence of tertiary structures, a task previously only possible using proxies (like RMSD). By analyzing one million pairs of homologous structures, we yield a relationship between the Markov divergence time of structures and of sequences. Using these inferred models and the relationship between the divergence of sequences and structures, we demonstrate a competitive performance in secondary structure prediction against neural network architectures commonly employed for this task. The source code and supplementary information are downloadable from \url{http://lcb.infotech.monash.edu.au/sstsum}.Comment: 12 pages, 6 figure

arXiv.org e-Print Archive

Genome-Wide Survey of MicroRNA - Transcription Factor Feed-Forward Regulatory Circuits in Human

Author: Alvarez-Garcia
Angela Re
Calin
Chan
Chen
Corà
Corà
Corà
Daniela Taverna
Davide Corá
Elnitski
Esquela-Kerscher
Filipowicz
Gershengom
Griffiths-Jones
Gudmundsson
He
Hornstein
Hubbard
Iorio
Joglekar
John
Konagurthu
Krek
Ladd
Lai
Landgraf
Lee
Lewis
Lewis
Liu
Loots
Martinez
Matys
Mazurie
Michele Caselle
Milo
Nielsen
O’Donnell
Pan
Pesole
Phan
Saini
Shalgi
Shen-Orr
Tsang
Wagner
Wilkerson
Xie
Zeller
Zhang
Zhao
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 01/01/2009
Field of study

In this work, we describe a computational framework for the genome-wide identification and characterization of mixed transcriptional/post-transcriptional regulatory circuits in humans. We concentrated in particular on feed-forward loops (FFL), in which a master transcription factor regulates a microRNA, and together with it, a set of joint target protein coding genes. The circuits were assembled with a two step procedure. We first constructed separately the transcriptional and post-transcriptional components of the human regulatory network by looking for conserved over-represented motifs in human and mouse promoters, and 3'-UTRs. Then, we combined the two subnetworks looking for mixed feed-forward regulatory interactions, finding a total of 638 putative (merged) FFLs. In order to investigate their biological relevance, we filtered these circuits using three selection criteria: (I) GeneOntology enrichment among the joint targets of the FFL, (II) independent computational evidence for the regulatory interactions of the FFL, extracted from external databases, and (III) relevance of the FFL in cancer. Most of the selected FFLs seem to be involved in various aspects of organism development and differentiation. We finally discuss a few of the most interesting cases in detail.Comment: 51 pages, 5 figures, 4 tables. Supporting information included. Accepted for publication in Molecular BioSystem

arXiv.org e-Print Archive

Crossref

A fast indexing approach for protein structure comparison

Author: A Lesk
A Stivala
A Tversky
AG Murzin
AM Lesk
AP Kamat
Arun S Konagurthu
AS Konagurthu
AS Konagurthu
CA Orengo
E Krissinel
ES Shih
ES Shih
ESC Shih
FM Richards
HM Berman
I Michalopoulos
J Shapiro
James Bailey
JF Gibrat
Kotagiri Ramamohanarao
L Holm
Lei Zhang
M Carpentier
O Carugo
P Jaccard
S Kirillova
SE Brenner
SF Altschul
T Madej
W Lo
W Lo
W Lo
WL Delano
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref