Search CORE

9 research outputs found

Author index

Author
Publication venue: Published by Elsevier B.V.
Publication date
Field of study

Screening synteny blocks in pairwise genome comparisons through integer programming

Author: Andrew H Paterson
BJ Haas
Brent Pedersen
C Simillion
C Simillion
C Soderlund
E Lyons
E Lyons
Eric Lyons
G Tesler
H Tang
H Tang
Haibao Tang
HW Six
James C Schnable
JE Bowers
JM Aury
JM Catchen
K Yogeeswaran
L Cui
M Kellis
Michael Freeling
O Jaillon
O Jaillon
P Pevzner
Q Peng
R Warren
RM Karp
S Schwartz
SF Altschul
W Miller
WJ Kent
X Wang
Y Van de Peer
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background It is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor. Analyses are particularly problematic among lineages that have repeatedly experienced whole genome duplication (WGD) events. To compare multiple "subgenomes" derived from genome duplications, we need to relax the traditional requirements of "one-to-one" syntenic matchings of genomic regions in order to reflect "one-to-many" or more generally "many-to-many" matchings. However this relaxation may result in the identification of synteny blocks that are derived from ancient shared WGDs that are not of interest. For many downstream analyses, we need to eliminate weak, low scoring alignments from pairwise genome comparisons. Our goal is to objectively select subset of synteny blocks whose total scores are maximized while respecting the duplication history of the genomes in comparison. We call this "quota-based" screening of synteny blocks in order to appropriately fill a quota of syntenic relationships within one genome or between two genomes having WGD events. Results We have formulated the synteny block screening as an optimization problem known as "Binary Integer Programming" (BIP), which is solved using existing linear programming solvers. The computer program QUOTA-ALIGN performs this task by creating a clear objective function that maximizes the compatible set of synteny blocks under given constraints on overlaps and depths (corresponding to the duplication history in respective genomes). Such a procedure is useful for any pairwise synteny alignments, but is most useful in lineages affected by multiple WGDs, like plants or fish lineages. For example, there should be a 1:2 ploidy relationship between genome A and B if genome B had an independent WGD subsequent to the divergence of the two genomes. We show through simulations and real examples using plant genomes in the rosid superorder that the quota-based screening can eliminate ambiguous synteny blocks and focus on specific genomic evolutionary events, like the divergence of lineages (in cross-species comparisons) and the most recent WGD (in self comparisons). Conclusions The QUOTA-ALIGN algorithm screens a set of synteny blocks to retain only those compatible with a user specified ploidy relationship between two genomes. These blocks, in turn, may be used for additional downstream analyses such as identifying true orthologous regions in interspecific comparisons. There are two major contributions of QUOTA-ALIGN: 1) reducing the block screening task to a BIP problem, which is novel; 2) providing an efficient software pipeline starting from all-against-all BLAST to the screened synteny blocks with dot plot visualizations. Python codes and full documentations are publicly available <url>http://github.com/tanghaibao/quota-alignment</url>. QUOTA-ALIGN program is also integrated as a major component in SynMap <url>http://genomevolution.com/CoGe/SynMap.pl</url>, offering easier access to thousands of genomes for non-programmers.</p

Crossref

DigitalCommons@University of Nebraska

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Arizona

eScholarship - University of California

Local improvement algorithms for a path packing problem: A performance analysis based on linear programming

Author: Bontridder K.M.J. de
Halldórsson B.V.
Halldórsson M.M.
Hurkens C.A.J. (Cor)
Lenstra J.K. (Jan Karel)
Ravi R.
Stougie L. (Leen)
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

Given a graph, we wish to find a maximum number of vertex-disjoint paths of length 2. We propose a series of local improvement algorithms for this problem, and present a linear-programming based method for analyzing their performance

VU Research Portal

CWI's Institutional Repository

Pure OAI Repository

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Nonoverlapping Local Alignments (Weighted Independent Sets of Axis Parallel Rectangles)

Author: Babu Narayanan
R. Ravi
Vineet Bafna
Publication venue
Publication date: 01/01/1995
Field of study

We consider the following problem motivated by an application in computational molecular biology. We are given a set of weighted axis-parallel rectangles such that for any pair of rectangles and either axis, the projection of one rectangle does not enclose that of the other. Define a pair to be independent if their projections in both axes are disjoint. The problem is to find a maximum-weight independent subset of rectangles. We show that the problem is NP-hard even in the uniform case when all the weights are the same. We analyze the performance of a natural local-improvement heuristic for the general problem and prove a performance ratio of 3.25. We extend the heuristic to the problem of finding a maximum-weight independent set in (d + 1)-claw free graphs, and show a tight performance ratio of d \Gamma 1 + 1 d . A performance ratio of d 2 was known for the heuristic when applied to the uniform case. Our contributions are proving the hardness of the problem and providing a tight..

CiteSeerX

Elsevier - Publisher Connector

Nonoverlapping local alignments (weighted independent sets of axis parallel rectangles)

Author: F. R. K. Chung
J. Kececioglu
S. B. Needleman
T. F. Smith
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Nonoverlapping local alignments (weighted independent sets of axis-parallel rectangles)

Author: Arora
Babu Narayanan
Bafna
Bafna
Berman
Bollobas
Chung
Halldórsson
Hannenhalli
Hannenhalli
Kececioglu
Kececioglu
Kececioglu
Marathe
Needleman
R. Ravi
Smith
Vineet Bafna
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Interval scheduling and colorful independent sets

Author: Mathias Weller
Matthias Mnich
Matthias Mnich
Rene Bevern
Rolf Niedermeier
Rolf Niedermeier
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/07/2014
Field of study

Numerous applications in scheduling, such as resource allocation or steel manufacturing, can be modeled using the NP-hard Independent Set problem (given an undirected graph and an integer k, find a set of at least k pairwise non-adjacent vertices). Here, one encounters special graph classes like 2-union graphs (edge-wise unions of two interval graphs) and strip graphs (edge-wise unions of an interval graph and a cluster graph), on which Independent Set remains NP-hard but admits constant-ratio approximations in polynomial time. We study the parameterized complexity of Independent Set on 2-union graphs and on subclasses like strip graphs. Our investigations significantly benefit from a new structural "compactness" parameter of interval graphs and novel problem formulations using vertex-colored interval graphs. Our main contributions are: 1. We show a complexity dichotomy: restricted to graph classes closed under induced subgraphs and disjoint unions, Independent Set is polynomial-time solvable if both input interval graphs are cluster graphs, and is NP-hard otherwise. 2. We chart the possibilities and limits of effective polynomial-time preprocessing (also known as kernelization). 3. We extend Halld\'orsson and Karlsson (2006)'s fixed-parameter algorithm for Independent Set on strip graphs parameterized by the structural parameter "maximum number of live jobs" to show that the problem (also known as Job Interval Selection) is fixed-parameter tractable with respect to the parameter k and generalize their algorithm from strip graphs to 2-union graphs. Preliminary experiments with random data indicate that Job Interval Selection with up to fifteen jobs and 5*10^5 intervals can be solved optimally in less than five minutes.Comment: This revision does not contain Theorem 7 of the first revision, whose proof contained an erro

arXiv.org e-Print Archive

CiteSeerX

Maastricht University Research Portal

Crossref

INRIA a CCSD electronic archive server

Discovery of Unconventional Patterns for Sequence Analysis: Theory and Algorithms

Author: BATTAGLIA GIOVANNI
Publication venue: 'Pisa University Press'
Publication date: 19/12/2011
Field of study

The biology community is collecting a large amount of raw data, such as the genome sequences of organisms, microarray data, interaction data such as gene-protein interactions, protein-protein interactions, etc. This amount is rapidly increasing and the process of understanding the data is lagging behind the process of acquiring it. An inevitable first step towards making sense of the data is to study their regularities focusing on the non-random structures appearing surprisingly often in the input sequences: patterns. In this thesis we discuss three incarnations of the pattern discovery task, exploring three types of patterns that can model different regularities of the input dataset. While mask patterns have been designed to model short repeated biological sequences, showing a high conservation of their content at some specific positions, permutation patterns have been designed to detect repeated patterns whose parts maintain their physical adjacency but not their ordering in all the pattern occurrences. Transposons, instead, model mobile sequences in the input dataset, which can be discovered by comparing different copies of the same input string, detecting large insertions and deletions in their alignment

Electronic Thesis and Dissertation Archive - Università di Pisa