Search CORE

1,599 research outputs found

Optimal Embeddings of Distance Regular Graphs into Euclidean Spaces

Author: Bannai
Bourgain
Brouwer
Enflo
Frank Vallentin
Goemans
Indyk
Linial
Linial
Linial
Matousek
Vandenberghe
Publication venue: 'Elsevier BV'
Publication date: 24/05/2007
Field of study

In this paper we give a lower bound for the least distortion embedding of a distance regular graph into Euclidean space. We use the lower bound for finding the least distortion for Hamming graphs, Johnson graphs, and all strongly regular graphs. Our technique involves semidefinite programming and exploiting the algebra structure of the optimization problem so that the question of finding a lower bound of the least distortion is reduced to an analytic question about orthogonal polynomials.Comment: 10 pages, (v3) some corrections, accepted in Journal of Combinatorial Theory, Series

arXiv.org e-Print Archive

Crossref

CWI's Institutional Repository

On the number of 4-cycles in a tournament

Author: Linial Nati
Morgenstern Avraham
Publication venue
Publication date: 01/01/2015
Field of study

T

is an

n

-vertex tournament with a given number of

3

-cycles, what can be said about the number of its

4

-cycles? The most interesting range of this problem is where

T

is assumed to have

c\cdot n^3

cyclic triples for some

c>0

and we seek to minimize the number of

4

-cycles. We conjecture that the (asymptotic) minimizing

T

is a random blow-up of a constant-sized transitive tournament. Using the method of flag algebras, we derive a lower bound that almost matches the conjectured value. We are able to answer the easier problem of maximizing the number of

4

-cycles. These questions can be equivalently stated in terms of transitive subtournaments. Namely, given the number of transitive triples in

T

, how many transitive quadruples can it have? As far as we know, this is the first study of inducibility in tournaments.Comment: 11 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Safe Functional Inference for Uncharacterized Viral Proteins

Author: Michal Linial
Yaniv Loewenstein
Publication venue
Publication date: 14/08/2008
Field of study

The explosive growth in the number of sequenced genomes has created a flood of protein sequences with unknown structure and function. A routine protocol for functional inference on an input query sequence is based on a database search for homologues. Searching a query against a non-redundant database using BLAST (or more advanced methods, e.g. PSI-BLAST) suffers from several drawbacks: (i) a local alignment often dominates the results; (ii) the reported statistical score (i.e. E-value) is often misleading; (iii) incorrect annotations may be falsely propagated. 
Several systematic methods are commonly used to assign sequences with functions on a genomic scale. In Pfam (1) and resources alike, statistical profiles (HMMs) are built from semi-manual multiple alignments of seed homologous sequences. The profiles are then used to scan genomic sequences for additional family members. The drawbacks of this scheme are: (i) only families with a predetermined seed are considered; (ii) the query must have a detectable sequence similarity to seed sequences; (iii) attention to internal relationships among the family members or the relations to other families is lacking; (iv) family membership is often set by pre-determined thresholds.
An alternative to profile or model based methods for functional inference relies on a hierarchical clustering of the protein space, as implemented in the ProtoNet approach (2). The fundamental principle is the creation of a tree that captures evolutionary relatedness among protein families. The tree construction is fully automatic, and is based only on reported BLAST similarities among clustered sequences. The tree provides protein groupings in continuous evolutionary granularities, from closely related to distant superfamilies. Clusters in the ProtoNet tree show high correspondence with homologous sequence (i.e. Pfam and InterPro), functional (i.e. E.C. classification) and structural (i.e., SCOP) families (3). A new clustering scheme (4) has provided an extensive update to the ProtoNet process, which is now based on direct clustering of all detectable sequence similarities. 
Herein, we use the ProtoNet resource to develop a methodology for a consistent and safe functional inference for remote families. We illustrate the success of our approach towards clusters of poorly characterized viral proteins. Viral sequences are characterized by a rapid evolutionary rate which drives viral families to be even more remote (sequence-similarity-wise). Thus, functional inference for viral families is apparently an unsolved task. Despite this inherent difficulty, the new ProtoNet tree scaffold reliably captures weak evolutionary connections for viral families, which were previously overlooked. We take advantage of this, and propose new functional assignments for viral protein families.&#xa

Crossref

Nature Precedings