Search CORE

11 research outputs found

Recommended from our members

A Maximum Parsimony Principle for Multichromosomal Complex Genome Rearrangements

Author: Raphael Benjamin J.
Simonaitis Pijus
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 22nd International Workshop on Algorithms in Bioinformatics (WABI 2022)
Publication date: 01/01/2022
Field of study

Motivation. Complex genome rearrangements, such as chromothripsis and chromoplexy, are common in cancer and have also been reported in individuals with various developmental and neurological disorders. These mutations are proposed to involve simultaneous breakage of the genome at many loci and rejoining of these breaks that produce highly rearranged genomes. Since genome sequencing measures only the novel adjacencies present at the time of sequencing, determining whether a collection of novel adjacencies resulted from a complex rearrangement is a complicated and ill-posed problem. Current heuristics for this problem often result in the inference of complex rearrangements that affect many chromosomes. Results. We introduce a model for complex rearrangements that builds upon the methods developed for analyzing simple genome rearrangements such as inversions and translocations. While nearly all of these existing methods use a maximum parsimony assumption of minimizing the number of rearrangements, we propose an alternative maximum parsimony principle based on minimizing the number of chromosomes involved in a rearrangement scenario. We show that our model leads to inference of more plausible sequences of rearrangements that better explain a complex congenital rearrangement in a human genome and chromothripsis events in 22 cancer genomes

Princeton University Open Access Repository

Dagstuhl Research Online Publication Server

Weighted Minimum-Length Rearrangement Scenarios

Author: Chateau Annie
Simonaitis Pijus
Swenson Krister M.
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 19th International Workshop on Algorithms in Bioinformatics (WABI 2019)
Publication date: 01/01/2019
Field of study

We present the first known model of genome rearrangement with an arbitrary real-valued weight function on the rearrangements. It is based on the dominant model for the mathematical and algorithmic study of genome rearrangement, Double Cut and Join (DCJ). Our objective function is the sum or product of the weights of the DCJs in an evolutionary scenario, and the function can be minimized or maximized. If the likelihood of observing an independent DCJ was estimated based on biological conditions, for example, then this objective function could be the likelihood of observing the independent DCJs together in a scenario. We present an O(n^4)-time dynamic programming algorithm solving the Minimum Cost Parsimonious Scenario (MCPS) problem for co-tailed genomes with n genes (or syntenic blocks). Combining this with our previous work on MCPS yields a polynomial-time algorithm for general genomes. The key theoretical contribution is a novel link between the parsimonious DCJ (or 2-break) scenarios and quadrangulations of a regular polygon. To demonstrate that our algorithm is fast enough to treat biological data, we run it on syntenic blocks constructed for Human paired with Chimpanzee, Gibbon, Mouse, and Chicken. We argue that the Human and Gibbon pair is a particularly interesting model for the study of weighted genome rearrangements

HAL-ENS-LYON

Dagstuhl Research Online Publication Server

(Re)introducing regular graph languages

Author: Gilroy Sorcha
Lopez Adam
Maneth Sebastian
Simonaitis Pijus
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

Crossref

Edinburgh Research Explorer

Scénarios évolutifs pondérés de réarrangements génomiques

Author: Simonaitis Pijus
Publication venue: HAL CCSD
Publication date: 10/07/2020
Field of study

Recent advances in sequencing technologies revealed the ubiquity of genome rearrangements between each and every one of us. These large scale mutationsrearrange segments of chromosomes and have a profound impact on genetic variation, disease, and evolution. The study of the consequences of rearrangements along with their molecular mechanisms, however, is still in its infancy.Given extant genomes, we are interested in tracing back the evolutionary rearrangement scenarios that transformed their least common ancestor into the genomes that we observe today. This helps not only to reveal evolutionary relationships between organisms, but also provides a window for the study of genome rearrangements themselves.The central computational problem in this subfield of comparative genomicsis that of finding optimal rearrangement scenarios transforming one genome into another. Historically all rearrangements were treated as being equally possible, and optimal scenarios were those that contained the minimum number of rearrangements. Recent advances in biology, however, allow us to devise much more sophisticated models. We present a short survey of the existingwork on using biological constraints for genome rearrangements, and argue that a much more flexible approach is necessary to accompany the influx of newly available biological data.In this work we propose an extremely general framework for genome rearrangements with biological constraints. Our main contribution is a polynomial time algorithm that, for an arbitrary cost function, finds a minimum cost scenario among those of minimum length. Along the way we establish a number of novel links between sorting genomes with double cut and join rearrangements, sorting graphs with 2-breaks or edge swaps, sorting permutations with mathematical transpositions, sorting strings with interchanges, and token swapping on graphs.Un réarrangement génomique est une mutation qui modifie la structure des chromosomes voir même leur nombre dans un génome. Outre des fusions et des fissions de chromosomes, ces réarrangements comprennent des délétions, des insertions et des inversions de segments chromosomiques. Deux extrémités de chromosomes différents peuvent également être échangées au cours d'une translocation. L'ensemble de ces mutations constitue un scénario évolutif de réarrangements entre les espèces. Nous nous sommes intéressés à la reconstruction des scénarios de réarrangements entre espèces animales.Notre projet associe des outils mathématiques et algorithmiques avec la compréhension biologique actuelle des réarrangements génomiques. D'un point de vue biologique, notre objectif est de lier génétique et épigénétique aux réarrangements dans les deux sens:1) nous développons une méthodologie pour étudier des caractéristiques génétiques et épigénétiques associées aux réarrangements,2) et inversement pour trouver des scénarios de réarrangements guidés par de telles caractéristiques génétiques et épigénétiques.La principale contribution de cette thèse est la suivante. Nous présentons un cadre sur le modèle de réarrangements double cut and join avec des poids arbitraires. Dans ce cadre un scénario de poids minimum peut être trouvé en temps polynomial parmi les scénarios de longueur minimale pour deux génomes à contenu génétique identique et sans doublons.En plus de cela, nous établissons un certain nombre de nouvelles correspondances entre les divers problèmes de tri. Ces problèmes incluent le tri des génomes avec des réarrangements dits Double Cut and Join, le tri des graphes avec 2-breaks ou edge swaps, le tri des permutations avec des transpositions, le tri des chaînes avec des échanges et l'échange de jetons sur les graphes

Thèses en Ligne

Theses.fr

Finding local genome rearrangements

Author: Simonaitis Pijus
Swenson Krister,
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

International audienceThe double cut and join (DCJ) model of genome rearrangement is well studied due to its mathematical simplicity and power to account for the many events that transform gene order. These studies have mostly been devoted to the understanding of minimum length scenarios transforming one genome into another. In this paper we search instead for rearrangement scenarios that minimize the number of rearrangements whose breakpoints are unlikely due to some biological criteria. One such criterion has recently become accessible due to the advent of the Hi-C experiment, facilitating the study of 3D spacial distance between breakpoint regions

HAL-ENS-LYON

Directory of Open Access Journals

INRIA a CCSD electronic archive server

Dagstuhl Research Online Publication Server

A General Framework for Genome Rearrangement with Biological Constraints

Author: Chateau Annie
Simonaitis Pijus
Swenson Krister,
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/10/2018
Field of study

International audienceThis paper generalizes previous studies on genome rearrangement under biological constraints, using double cut and join (DCJ). We propose a model for weighted DCJ, along with a family of optimization problems called ϕ-MCPS (MiniMuM CoSt ParSiMoniouS SCenario), that are based on labeled graphs. We show how to compute solutions to general instances of ϕ-MCPS, given an algorithm to compute ϕ-MCPS on a circular genome with exactly one occurrence of each gene. These general instances can have an arbitrary number of circular and linear chromosomes, and arbitrary gene content. The practicality of the framework is displayed by presenting polynomial-time algorithms that generalize the results of Bulteau, Fertin, and Tannier on the Sorting by wDCJS anD inDelS in intergeneS problem, and that generalize previous results on the MiniMuM loCal ParSiMoniouS SCenario problem

HAL-ENS-LYON

Models and algorithms for genome rearrangement with positional constraints

Author: Blanchette Mathieu
Simonaitis Pijus
Swenson Krister,
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

International audienceBackgroundTraditionally, the merit of a rearrangement scenario between two gene orders has been measured based on a parsimony criteria alone; two scenarios with the same number of rearrangements are considered equally good. In this paper, we acknowledge that each rearrangement has a certain likelihood of occurring based on biological constraints, e.g. physical proximity of the DNA segments implicated or repetitive sequences.ResultsWe propose optimization problems with the objective of maximizing overall likelihood, by weighting the rearrangements. We study a binary weight function suitable to the representation of sets of genome positions that are most likely to have swapped adjacencies. We give a polynomial-time algorithm for the problem of finding a minimum weight double cut and join scenario among all minimum length scenarios. In the process we solve an optimization problem on colored noncrossing partitions, which is a generalization of the Maximum Independent Set problem on circle graphs.ConclusionsWe introduce a model for weighting genome rearrangements and show that under simple yet reasonable conditions, a fundamental distance can be computed in polynomial time. This is achieved by solving a generalization of the Maximum Independent Set problem on circle graphs. Several variants of the problem are also mentioned

HAL-ENS-LYON

Crossref

Springer - Publisher Connector

INRIA a CCSD electronic archive server

PubMed Central

Rearrangement Scenarios Guided by Chromatin Structure

Author: Pulicani Sylvain
Rivals Eric
Simonaitis Pijus
Swenson Krister,
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

International audienceGenome architecture can be drastically modified through a succession of large-scale rearrangements. In the quest to infer these rearrangement scenarios, it is often the case that the parsimony principal alone does not impose enough constraints. In this paper we make an initial effort towards computing scenarios that respect chromosome con-formation, by using Hi-C data to guide our computations. We confirm the validity of a model – along with optimization problems Minimum Local Scenario and Minimum Local Parsimonious Scenario – developed in previous work that is based on a partition into equivalence classes of the adjacencies between syntenic blocks. To accomplish this we show that the quality of a clustering of the adjacencies based on Hi-C data is directly correlated to the quality of a rearrangement scenario that we compute between Drosophila melanogaster and Drosophila yakuba. We evaluate a simple greedy strategy to choose the next rearrangement based on Hi-C, and motivate the study of the solution space of Minimum Local Parsimonious Scenario

HAL-ENS-LYON

Crossref

INRIA a CCSD electronic archive server

Co-evolution of AR gene copy number and structural complexity in endocrine therapy resistant prostate cancer

Author: Corey Eva
Dehm Scott M.
Feng Felix Y.
Henzler Christine
Knutson Todd P.
Li Yingming
Lynch Molly
Miller Jeffrey T.
Morrissey Colm
Munro Sarah A.
Oseth LeAnn
Passow Courtney N.
Raphael Benjamin J.
Simonaitis Pijus
Wikström Pernilla
Zhao Shuang G.
Zivanovic Andrej
Publication venue
Publication date: 01/01/2023
Field of study

Androgen receptor (AR) inhibition is standard of care for advanced prostate cancer (PC). However, efficacy is limited by progression to castration-resistant PC (CRPC), usually due to AR re-activation via mechanisms that include AR amplification and structural rearrangement. These two classes of AR alterations often co-occur in CRPC tumors, but it is unclear whether this reflects intercellular or intracellular heterogeneity of AR. Resolving this is important for developing new therapies and predictive biomarkers. Here, we analyzed 41 CRPC tumors and 6 patient-derived xenografts (PDXs) using linked-read DNA-sequencing, and identified 7 tumors that developed complex, multiply-rearranged AR gene structures in conjunction with very high AR copy number. Analysis of PDX models by optical genome mapping and fluorescence in situ hybridization showed that AR residing on extrachromosomal DNA (ecDNA) was an underlying mechanism, and was associated with elevated levels and diversity of AR expression. This study identifies co-evolution of AR gene copy number and structural complexity via ecDNA as a mechanism associated with endocrine therapy resistance

Publikationer från Umeå universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line