118 research outputs found

    Sorting by Block Moves

    Get PDF
    The research in this thesis is focused on the problem of Block Sorting, which has applications in Computational Biology and in Optical Character Recognition (OCR). A block in a permutation is a maximal sequence of consecutive elements that are also consecutive in the identity permutation. BLOCK SORTING is the process of transforming an arbitrary permutation to the identity permutation through a sequence of block moves. Given an arbitrary permutation π and an integer m, the Block Sorting Problem, or the problem of deciding whether the transformation can be accomplished in at most m block moves has been shown to be NP-hard. After being known to be 3-approximable for over a decade, block sorting has been researched extensively and now there are several 2-approximation algorithms for its solution. This work introduces new structures on a permutation, which are called runs and ordered pairs, and are used to develop two new approximation algorithms. Both the new algorithms are 2-approximation algorithms, yielding the approximation ratio equal to the current best. This work also includes an analysis of both the new algorithms showing they are 2-approximation algorithms

    Sorting permutations by cut-circularize-linearize-and-paste operations

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genome rearrangements are studied on the basis of genome-wide analysis of gene orders and important in the evolution of species. In the last two decades, a variety of rearrangement operations, such as reversals, transpositions, block-interchanges, translocations, fusions and fissions, have been proposed to evaluate the differences between gene orders in two or more genomes. Usually, the computational studies of genome rearrangements are formulated as problems of sorting permutations by rearrangement operations.</p> <p>Result</p> <p>In this article, we study a sorting problem by cut-circularize-linearize-and-paste (CCLP) operations, which aims to find a minimum number of CCLP operations to sort a signed permutation representing a chromosome. The CCLP is a genome rearrangement operation that cuts a segment out of a chromosome, circularizes the segment into a temporary circle, linearizes the temporary circle as a linear segment, and possibly inverts the linearized segment and pastes it into the remaining chromosome. The CCLP operation can model many well-known rearrangements, such as reversals, transpositions and block-interchanges, and others not reported in the biological literature. In addition, it really occurs in the immune response of higher animals. To distinguish those CCLP operations from the reversal, we call them as non-reversal CCLP operations. In this study, we use permutation groups in algebra to design an <it>O</it>(<it>δn</it>) time algorithm for solving the weighted sorting problem by CCLP operations when the weight ratio between reversals and non-reversal CCLP operations is 1:2, where <it>n</it> is the number of genes in the given chromosome and <it>δ</it> is the number of needed CCLP operations.</p> <p>Conclusion</p> <p>The algorithm we propose in this study is very simple so that it can be easily implemented with 1-dimensional arrays and useful in the studies of phylogenetic tree reconstruction and human immune response to tumors.</p

    On the distribution of the number of cycles in the breakpoint graph of a random signed permutation

    Get PDF
    International audienceWe use the finite Markov chain embedding technique to obtain the distribution of the number of cycles in the breakpoint graph of a random uniform signed permutation. This further gives a very good approximation of the distribution of the reversal distance between two random genomes

    Incidencia de los estudios sobre reordenamiento genómico en la secuenciación del genoma humano

    Get PDF
    En este artículo de revisión se muestran los principales aportes de la literatura científica relacionados con el problema SBPR (Sorting Permutations By Prefix Reversals, en español, Ordenamiento de permutaciones con reversión de prefijos) realizados en los últimos 47 años que han servido como base en la secuenciación completa del genoma humano. De hecho, este estudio tiene como propósito describir los principales antecedentes del problema desde sus orígenes hasta su aplicación final en la secuenciación del genoma humano. La metodología utilizada está basada en la revisión documental, la cual permitió construir una matriz y un grafo, en donde se resumen todas las interconexiones bibliográficas posibles. Sin embargo, los principales hallazgos demuestran que los años 90 fueron claves para desarrollar una teoría sólida en cuanto a construcción y verificación en lo que refiere a algoritmos. Finalmente, se brindan las conclusiones y perspectivas a futuro de los principales resultados obtenidos.
    corecore