13,540 research outputs found
The Parallelism Motifs of Genomic Data Analysis
Genomic data sets are growing dramatically as the cost of sequencing
continues to decline and small sequencing devices become available. Enormous
community databases store and share this data with the research community, but
some of these genomic data analysis problems require large scale computational
platforms to meet both the memory and computational requirements. These
applications differ from scientific simulations that dominate the workload on
high end parallel systems today and place different requirements on programming
support, software libraries, and parallel architectural design. For example,
they involve irregular communication patterns such as asynchronous updates to
shared data structures. We consider several problems in high performance
genomics analysis, including alignment, profiling, clustering, and assembly for
both single genomes and metagenomes. We identify some of the common
computational patterns or motifs that help inform parallelization strategies
and compare our motifs to some of the established lists, arguing that at least
two key patterns, sorting and hashing, are missing
Self-Assembly of 4-sided Fractals in the Two-handed Tile Assembly Model
We consider the self-assembly of fractals in one of the most well-studied
models of tile based self-assembling systems known as the Two-handed Tile
Assembly Model (2HAM). In particular, we focus our attention on a class of
fractals called discrete self-similar fractals (a class of fractals that
includes the discrete Sierpi\'nski carpet). We present a 2HAM system that
finitely self-assembles the discrete Sierpi\'nski carpet with scale factor 1.
Moreover, the 2HAM system that we give lends itself to being generalized and we
describe how this system can be modified to obtain a 2HAM system that finitely
self-assembles one of any fractal from an infinite set of fractals which we
call 4-sided fractals. The 2HAM systems we give in this paper are the first
examples of systems that finitely self-assemble discrete self-similar fractals
at scale factor 1 in a purely growth model of self-assembly. Finally, we show
that there exists a 3-sided fractal (which is not a tree fractal) that cannot
be finitely self-assembled by any 2HAM system
Active Self-Assembly of Algorithmic Shapes and Patterns in Polylogarithmic Time
We describe a computational model for studying the complexity of
self-assembled structures with active molecular components. Our model captures
notions of growth and movement ubiquitous in biological systems. The model is
inspired by biology's fantastic ability to assemble biomolecules that form
systems with complicated structure and dynamics, from molecular motors that
walk on rigid tracks and proteins that dynamically alter the structure of the
cell during mitosis, to embryonic development where large-scale complicated
organisms efficiently grow from a single cell. Using this active self-assembly
model, we show how to efficiently self-assemble shapes and patterns from simple
monomers. For example, we show how to grow a line of monomers in time and
number of monomer states that is merely logarithmic in the length of the line.
Our main results show how to grow arbitrary connected two-dimensional
geometric shapes and patterns in expected time that is polylogarithmic in the
size of the shape, plus roughly the time required to run a Turing machine
deciding whether or not a given pixel is in the shape. We do this while keeping
the number of monomer types logarithmic in shape size, plus those monomers
required by the Kolmogorov complexity of the shape or pattern. This work thus
highlights the efficiency advantages of active self-assembly over passive
self-assembly and motivates experimental effort to construct general-purpose
active molecular self-assembly systems
The Power of Duples (in Self-Assembly): It's Not So Hip To Be Square
In this paper we define the Dupled abstract Tile Assembly Model (DaTAM),
which is a slight extension to the abstract Tile Assembly Model (aTAM) that
allows for not only the standard square tiles, but also "duple" tiles which are
rectangles pre-formed by the joining of two square tiles. We show that the
addition of duples allows for powerful behaviors of self-assembling systems at
temperature 1, meaning systems which exclude the requirement of cooperative
binding by tiles (i.e., the requirement that a tile must be able to bind to at
least 2 tiles in an existing assembly if it is to attach). Cooperative binding
is conjectured to be required in the standard aTAM for Turing universal
computation and the efficient self-assembly of shapes, but we show that in the
DaTAM these behaviors can in fact be exhibited at temperature 1. We then show
that the DaTAM doesn't provide asymptotic improvements over the aTAM in its
ability to efficiently build thin rectangles. Finally, we present a series of
results which prove that the temperature-2 aTAM and temperature-1 DaTAM have
mutually exclusive powers. That is, each is able to self-assemble shapes that
the other can't, and each has systems which cannot be simulated by the other.
Beyond being of purely theoretical interest, these results have practical
motivation as duples have already proven to be useful in laboratory
implementations of DNA-based tiles
Size-Dependent Tile Self-Assembly: Constant-Height Rectangles and Stability
We introduce a new model of algorithmic tile self-assembly called
size-dependent assembly. In previous models, supertiles are stable when the
total strength of the bonds between any two halves exceeds some constant
temperature. In this model, this constant temperature requirement is replaced
by an nondecreasing temperature function that depends on the size of the smaller of the two halves. This
generalization allows supertiles to become unstable and break apart, and
captures the increased forces that large structures may place on the bonds
holding them together.
We demonstrate the power of this model in two ways. First, we give fixed tile
sets that assemble constant-height rectangles and squares of arbitrary input
size given an appropriate temperature function. Second, we prove that deciding
whether a supertile is stable is coNP-complete. Both results contrast with
known results for fixed temperature.Comment: In proceedings of ISAAC 201
Virtual Environment for Next Generation Sequencing Analysis
Next Generation Sequencing technology, on the one hand, allows a more accurate analysis, and, on the other hand, increases the amount of data to process. A new protocol for sequencing the messenger RNA in a cell, known as RNA- Seq, generates millions of short sequence fragments in a single run. These fragments, or reads, can be used to measure levels of gene expression and to identify novel splice variants of genes. The proposed solution is a distributed architecture consisting of a Grid Environment and a Virtual Grid Environment, in order to reduce processing time by making the system scalable and flexibl
- …