Search CORE

514 research outputs found

The MATCHIT automaton : exploiting compartmentalization for the synthesis of branched polymers

Author: Fellermann Harold
Füchslin Rudolf Marcel
Hadorn Maik
Lancet Doron
Rasmussen Steen
Sorek Daniel
Weyland Mathias S.
Publication venue: Hindawi
Publication date: 01/01/2013
Field of study

We propose an automaton, a theoretical framework that demonstrates how to improve the yield of the synthesis of branched chemical polymer reactions. This is achieved by separating substeps of the path of synthesis into compartments. We use chemical containers (chemtainers) to carry the substances through a sequence of fixed successive compartments. We describe the automaton in mathematical terms and show how it can be configured automatically in order to synthesize a given branched polymer target. The algorithm we present finds an optimal path of synthesis in linear time. We discuss how the automaton models compartmentalized structures found in cells, such as the endoplasmic reticulum and the Golgi apparatus, and we show how this compartmentalization can be exploited for the synthesis of branched polymers such as oligosaccharides. Lastly, we show examples of artificial branched polymers and discuss how the automaton can be configured to synthesize them with maximal yield

ZHAW digitalcollection

Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3

Author: A Marchler-Bauer
A Mortazavi
A Nekrutenko
A Rambaut
A Resch
AA Patel
AB Carvalho
AE Vinogradov
AG Clark
AM McGuire
AN Ladd
AR Hatton
AS Chang
AV Alekseyenko
AV Philips
B Budagyan
B Modrek
B Modrek
B Prud'homme
BR Graveley
BR Graveley
BR Graveley
C Lee
C Notredame
CI Castillo-Davis
CL Fitzpatrick
CS Thummel
D Babushok
D Baek
D Gatfield
D Monroe
D Ortíz-Barrientos
DA Petrov
DA Petrov
DG Gilbert
DI Nurminsky
DJ Kenan
DL Black
DL Swofford
E Betran
E Kim
E Kim
E Wagner
EA Glazov
F-C Chen
FA Kondrashov
G Ast
G Lev-Maor
G Marais
GD Schuler
H Itoh
IA Swinburne
International Human Genome Sequencing Consortium
J Delaunay
J Felsenstein
J Rozas
JM Burnette
JM Comeron
JM Johnson
K Tamura
KL Fox-Walsh
KM Neugebauer
LF Lareau
M Clamp
M Guo
M Labrador
M Lynch
M Roy
M Talerico
MA Noor
MD Adams
MD Adams
MF Wilkinson
Mohamed AF Noor
MT Levine
Nikolai P Kandul
NJ Proudfoot
NM Kopelman
P Haddrill
PA Sharp
PJ Good
PM O'Grady
Q Pan
R Sorek
R Sorek
R Sorek
R Sorek
R Sorek
S Karlin
S Misra
S Richards
S-T Chen
SF Altschul
SM Berget
TE Royce
V Stolc
W Wang
WG Hill
Y Xing
Y Xing
Z Kan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Background: Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results: Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion: We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Caltech Authors

Characteristics of transposable element exonization within human and mouse

Author: A Athanasiadis
A Corvelo
A Gerber
A Goren
A Levy
A Magen
A Nekrutenko
A Resch
AFA Smit
Agnes Hotz-Wagenblatt
B Giardine
B Mersch
BR Graveley
Britta Mersch
C Liu
D Karolchik
D Labuda
DD Kim
E Kim
ES Lander
EY Levanon
G Ast
G Lev-Maor
G Lev-Maor
Gil Ast
H Xie
Ilya Ruvinsky
J Hull
J Jurka
J Jurka
JO Kriegs
JO Yang
JP Nemes
K Nakabayashi
KP Kister
L Lin
L Lin
M Amit
M Blow
M Krull
M Moller-Krull
M Roy
M Sironi
M Sironi
MA Batzer
MD Koob
N Gal-Mark
N Gal-Mark
N Sela
NH Gehring
Noa Sela
O Ram
P Deininger
PL Deininger
R Cordaux
R Sorek
R Sorek
R Sorek
RA Gibbs
RE Mills
RH Waterston
RM Kuhn
RT Hillman
S He
S Schwartz
SK Ng
SS Singer
ST Sherry
T Kwan
T Kwan
VV Kapitonov
W Makalowski
WJ Kent
WL Chen
WS Lo
XH Zhang
Y Xing
YF Chang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/06/2010
Field of study

Insertion of transposed elements within mammalian genes is thought to be an important contributor to mammalian evolution and speciation. Insertion of transposed elements into introns can lead to their activation as alternatively spliced cassette exons, an event called exonization. Elucidation of the evolutionary constraints that have shaped fixation of transposed elements within human and mouse protein coding genes and subsequent exonization is important for understanding of how the exonization process has affected transcriptome and proteome complexities. Here we show that exonization of transposed elements is biased towards the beginning of the coding sequence in both human and mouse genes. Analysis of single nucleotide polymorphisms (SNPs) revealed that exonization of transposed elements can be population-specific, implying that exonizations may enhance divergence and lead to speciation. SNP density analysis revealed differences between Alu and other transposed elements. Finally, we identified cases of primate-specific Alu elements that depend on RNA editing for their exonization. These results shed light on TE fixation and the exonization process within human and mouse genes.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Simultaneous identification of long similar substrings in large sets of sequences

Author: A Lefebvre
Burghardt Wittig
E Check
Friedrich Möller
J Kleffe
Jürgen Kleffe
M Abouelhoda
M Hiller
M Höhl
PE Warburton
R Sorek
S Burkhardt
S Kurtz
S Kurtz
S Mielordt
T Hamborg
W Kent
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Sequence comparison faces new challenges today, with many complete genomes and large libraries of transcripts known. Gene annotation pipelines match these sequences in order to identify genes and their alternative splice forms. However, the software currently available cannot simultaneously compare sets of sequences as large as necessary especially if errors must be considered. Results We therefore present a new algorithm for the identification of almost perfectly matching substrings in very large sets of sequences. Its implementation, called ClustDB, is considerably faster and can handle 16 times more data than VMATCH, the most memory efficient exact program known today. ClustDB simultaneously generates large sets of exactly matching substrings of a given minimum length as seeds for a novel method of match extension with errors. It generates alignments of maximum length with a considered maximum number of errors within each overlapping window of a given size. Such alignments are not optimal in the usual sense but faster to calculate and often more appropriate than traditional alignments for genomic sequence comparisons, EST and full-length cDNA matching, and genomic sequence assembly. The method is used to check the overlaps and to reveal possible assembly errors for 1377 <it>Medicago truncatula </it>BAC-size sequences published at <url>http://www.medicago.org/genome/assembly_table.php?chr=1</url>. Conclusion The program ClustDB proves that window alignment is an efficient way to find long sequence sections of homogenous alignment quality, as expected in case of random errors, and to detect systematic errors resulting from sequence contaminations. Such inserts are systematically overlooked in long alignments controlled by only tuning penalties for mismatches and gaps. ClustDB is freely available for academic use.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

How the other half lives: CRISPR-Cas's influence on bacteriophages

Author: AE Briner
AF Andersson
BR Levin
C Pourcel
CL Sun
D Bhaya
E Pennisi
E Scarpellini
EV Koonin
FS Berezovskaya
GW Tyson
H Deveau
H Deveau
I Yosef
IN Berezovsky
J He
J Iranzo
J Mahony
JF Heidelberg
JM Peters
JO Haerter
KD Seed
KL Maxwell
LA Marraffini
LM Childs
M Sebaihia
P Han
P Horvath
P Horvath
PD Hsu
R Barrangou
R Lillestøl
R Sorek
S Al-Attar
S Minot
W Jiang
Publication venue
Publication date: 24/11/2017
Field of study

CRISPR-Cas is a genetic adaptive immune system unique to prokaryotic cells used to combat phage and plasmid threats. The host cell adapts by incorporating DNA sequences from invading phages or plasmids into its CRISPR locus as spacers. These spacers are expressed as mobile surveillance RNAs that direct CRISPR-associated (Cas) proteins to protect against subsequent attack by the same phages or plasmids. The threat from mobile genetic elements inevitably shapes the CRISPR loci of archaea and bacteria, and simultaneously the CRISPR-Cas immune system drives evolution of these invaders. Here we highlight our recent work, as well as that of others, that seeks to understand phage mechanisms of CRISPR-Cas evasion and conditions for population coexistence of phages with CRISPR-protected prokaryotes.Comment: 24 pages, 8 figure

arXiv.org e-Print Archive

Crossref

Alu-Alu Recombination Underlying the First Large Genomic Deletion in GlcNAc-Phosphotransferase Alpha/Beta (GNPTAB) Gene in a MLII Alpha/Beta Patient

Author: A Raas-Rothschild
B Knebelmann
B Pérez
B Tappino
B Tappino
C Has
D Meili
G Franke
G Lev-Maor
G Yeo
GA Mitchell
J Tazi
M Encarnação
M Wood
M Zarghooni
PL Deininger
R Quental
R Sorek
R Vervoort
S Kornfeld
S Quental
S Tiede
T Otomo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/10/2011
Field of study

Mucolipidosis type II α/β is a severe, autosomal recessive lysosomal storage disorder, caused by a defect in the GNPTAB gene that codes for the α/β subunits of the GlcNAc-phosphotransferase. To date, over 100 different mutations have been identified in MLII α/β patients, but no large deletions have been reported. Here we present the first case of a large homozygous intragenic GNPTAB gene deletion (c.3435-386_3602 + 343del897) encompassing exon 19, identified in a ML II α/β patient. Long-range PCR and sequencing methodologies were used to refine the characterization of this rearrangement, leading to the identification of a 21 bp repetitive motif in introns 18 and 19. Further analysis revealed that both the 5' and 3' breakpoints were located within highly homologous Alu elements (Alu-Sz in intron 18 and Alu-Sq2, in intron 19), suggesting that this deletion has probably resulted from Alu-Alu unequal homologous recombination. RT-PCR methods were used to further evaluate the consequences of the alteration for the processing of the mutant pre mRNA GNPTAB, revealing the production of three abnormal transcripts: one without exon 19 (p.Lys1146_Trp1201del); another with an additional loss of exon 20 (p.Arg1145Serfs*2), and a third in which exon 19 was substituted by a pseudoexon inclusion consisting of a 62 bp fragment from intron 18 (p.Arg1145Serfs*16). Interestingly, this 62 bp fragment corresponds to the Alu-Sz element integrated in intron 18.This represents the first description of a large deletion identified in the GNPTAB gene and contributes to enrich the knowledge on the molecular mechanisms underlying causative mutations in ML II.This work was supported by FCT - project PIC/IC/83252/2007 (http://alfa.fct.mctes.pt/). Coutinho MF and Quental S received grants from the FCT (SFRH/BD/48103/2008; SFRH/BPD/64025/2009)

Crossref

PubMed Central

Copenhagen University Research Information System

Repositório Científico do Instituto Nacional de Saúde

Phytoscreening and phytoextraction of heavy metals at Danish polluted sites using willow and poplar trees

Author: A Sorek
Arno Rein
BA Smith
C Reimann
D Hammer
DA Vroblesky
EM Sheehan
F Pietrini
G Gopalakrishnan
H Marschner
JG Burken
L Sachs
M Algreen
M Larsen
M Mleczek
M Zacchini
Mette Algreen
MJ McLaughlin
MWH Evangelou
R Djingova
R Hohl
S Rock
S Trapp
S Trapp
S Trapp
Skov og Naturstyrelsen
Skov og Naturstyrelsen
Stefan Trapp
TS Tuovinen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

The main purpose of this study was to determine typical concentrations of heavy metals (HM) in wood from willows and poplars, in order to test the feasibility of phytoscreening and phytoextraction of HM. Samples were taken from one strongly, one moderately, and one slightly polluted site and from three reference sites. Wood from both tree species had similar background concentrations at 0.5 mg kg(−1) for cadmium (Cd), 1.6 mg kg(−1) for copper (Cu), 0.3 mg kg(−1) for nickel (Ni), and 25 mg kg(−1) for zinc (Zn). Concentrations of chromium (Cr) and lead (Pb) were below or close to detection limit. Concentrations in wood from the highly polluted site were significantly elevated, compared to references, in particular for willow. The conclusion from these results is that tree coring could be used successfully to identify strongly heavy metal-polluted soil for Cd, Cu, Ni, Zn, and that willow trees were superior to poplars, except when screening for Ni. Phytoextraction of HMs was quantified from measured concentration in wood at the most polluted site. Extraction efficiencies were best for willows and Cd, but below 0.5 % over 10 years, and below 1 ‰ in 10 years for all other HMs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11356-013-2085-z) contains supplementary material, which is available to authorized users

Crossref

Springer - Publisher Connector

PubMed Central

Online Research Database In Technology

Enrichment analysis of Alu elements with different spatial chromatin proximity in the human genome

Author: A Antonaki
A Huda
A Nekrutenko
A Smallwood
AF Smit
AM Deaton
C Esnault
C Feschotte
CB Lowe
CT Ong
D Grover
D Grover
D Schmidt
D Xie
E Berezikov
E Lieberman-Aiden
E Wit de
E Yaffe
EP Nora
ES Lander
ES Lander
F Cui
G Bourque
G Kunarso
G Li
G Li
GA Maston
GJ Faulkner
GN Gallus
H Santos-Rosa
H Santos-Rosa
H Xie
HH Kazazian Jr
IK Jordan
J Banerji
J Dekker
J Dostie
J Jurka
J Jurka
J Ule
JA Yoder
JE Hambor
JF Brookfield
JF Brookfield
JM Chen
JR Dixon
JR Korenberg
K Ahn
K Kaer
KC Wang
L Lin
L Teng
M Hackenberg
M Simonis
M Weber
MA Batzer
MG Kidwell
MH Kagey
MJ Fullwood
MM Suzuki
ND Heintzman
NR Smalheiser
P Jin
P Medstrand
P Polak
R Cordaux
R Eskeland
R Lister
R Schneider
R Sorek
RD Hawkins
S Shen
S Winkler
SD Gillies
SL Oei
T Pastor
T Wicker
V Kapitonov
VJ Lynch
WD Gifford
Y Lu
Y Quentin
Y Quentin
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Transposable elements (TEs) have no longer been totally considered as “junk DNA” for quite a time since the continual discoveries of their multifunctional roles in eukaryote genomes. As one of the most important and abundant TEs that still active in human genome, Alu, a SINE family, has demonstrated its indispensable regulatory functions at sequence level, but its spatial roles are still unclear. Technologies based on 3C(chromosomeconformation capture) have revealed the mysterious three-dimensional structure of chromatin, and make it possible to study the distal chromatin interaction in the genome. To find the role TE playing in distal regulation in human genome, we compiled the new released Hi-C data, TE annotation, histone marker annotations, and the genome-wide methylation data to operate correlation analysis, and found that the density of Alu elements showed a strong positive correlation with the level of chromatin interactions (hESC: r=0.9, P<2.2×1016; IMR90 fibroblasts: r = 0.94, P < 2.2 × 1016) and also have a significant positive correlation withsomeremote functional DNA elements like enhancers and promoters (Enhancer: hESC: r=0.997, P=2.3×10−4; IMR90: r=0.934, P=2×10−2; Promoter: hESC: r = 0.995, P = 3.8 × 10−4; IMR90: r = 0.996, P = 3.2 × 10−4). Further investigation involving GC content and methylation status showed the GC content of Alu covered sequences shared a similar pattern with that of the overall sequence, suggesting that Alu elements also function as the GC nucleotide and CpG site provider. In all, our results suggest that the Alu elements may act as an alternative parameter to evaluate the Hi-C data, which is confirmed by the correlation analysis of Alu elements and histone markers. Moreover, the GC-rich Alu sequence can bring high GC content and methylation flexibility to the regions with more distal chromatin contact, regulating the transcription of tissue-specific genes

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

University of Bedfordshire Repository

The effects of multiple features of alternatively spliced exons on the K(A)/K(S )ratio test

Author: A Nekrutenko
A Nekrutenko
B Modrek
C Lee
DL Philipps
E Quevillon
EV Kriventseva
F Wen
FC Chen
FC Chen
Feng-Chi Chen
G Yeo
GW Yeo
J Wang
K Iida
L Cartegni
L Cartegni
LC Filip
LD Hurst
M Karnaugh
MS Cline
NJ Mulder
R Sorek
R Sorek
S Stamm
SM Berget
TA Thanaraj
Trees-Juen Chuang
U Ohler
WG Fairbrother
WG Fairbrother
WG Fairbrother
XH Zhang
XH Zhang
Y Xing
Y Xing
Y Xing
Y Xing
Z Wang
Z Yang
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: The evolution of alternatively spliced exons (ASEs) is of primary interest because these exons are suggested to be a major source of functional diversity of proteins. Many exon features have been suggested to affect the evolution of ASEs. However, previous studies have relied on the K(A)/K(S )ratio test without taking into consideration information sufficiency (i.e., exon length > 75 bp, cross-species divergence > 5%) of the studied exons, leading to potentially biased interpretations. Furthermore, which exon feature dominates the results of the K(A)/K(S )ratio test and whether multiple exon features have additive effects have remained unexplored. RESULTS: In this study, we collect two different datasets for analysis – the ASE dataset (which includes lineage-specific ASEs and conserved ASEs) and the ACE dataset (which includes only conserved ASEs). We first show that information sufficiency can significantly affect the interpretation of relationship between exons features and the K(A)/K(S )ratio test results. After discarding exons with insufficient information, we use a Boolean method to analyze the relationship between test results and four exon features (namely length, protein domain overlapping, inclusion level, and exonic splicing enhancer (ESE) frequency) for the ASE dataset. We demonstrate that length and protein domain overlapping are dominant factors, and they have similar impacts on test results of ASEs. In addition, despite the weak impacts of inclusion level and ESE motif frequency when considered individually, combination of these two factors still have minor additive effects on test results. However, the ACE dataset shows a slightly different result in that inclusion level has a marginally significant effect on test results. Lineage-specific ASEs may have contributed to the difference. Overall, in both ASEs and ACEs, protein domain overlapping is the most dominant exon feature while ESE frequency is the weakest one in affecting test results. CONCLUSION: The proposed method can easily find additive effects of individual or multiple factors on the K(A)/K(S )ratio test results of exons. Therefore, the system can analyze complex conditions in evolution where multiple features are involved. More factors can also be added into the system to extend the scope of evolutionary analysis of exons. In addition, our method may be useful when orthologous exons can not be found for the K(A)/K(S )ratio test

Crossref

Springer - Publisher Connector

National Health Research Institues

Directory of Open Access Journals

PubMed Central

Diverse Splicing Patterns of Exonized Alu Elements in Human Tissues

Author: A Gerber
A Goren
AG Clark
AL Price
Anne Tye
AR Muotri
AV Alekseyenko
AV Alekseyenko
B Mersch
B Modrek
B Moghadaszadeh
Beverly L. Davidson
BF Voight
BJ Blencowe
BR Graveley
CG Goodyer
CS Carlson
D Karolchik
ES Lander
ET Wang
F Tajima
G Lev-Maor
J Hasler
J Piriyapongsa
JA Calarco
JA Lee
James J. Cai
JC Fay
JL Kelley
JM Akey
K Kapur
L Eng
Lan Lin
M Kreitman
M Krull
M Przeworski
MW Pfaffl
N Gal-Mark
N Kim
N Petit
N Sela
P Andolfatto
P Yi
PC Sabeti
Peng Jiang
PJ Gardina
R Nielsen
R Sorek
R Sorek
R Sorek
RA Irizarry
RC Lewontin
RM Kuhn
S Biswas
S Olson
S Rozen
SB Hedges
Shihao Shen
SS Singer
T Kwan
TA Clark
V Gotea
W Makalowski
WE Johnson
WG Fairbrother
WG Fairbrother
XH Zhang
Y Xing
Y Xing
Y Xing
Yi Xing
Yoshihide Hayashizaki
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Exonization of Alu elements is a major mechanism for birth of new exons in primate genomes. Prior analyses of expressed sequence tags show that almost all Alu-derived exons are alternatively spliced, and the vast majority of these exons have low transcript inclusion levels. In this work, we provide genomic and experimental evidence for diverse splicing patterns of exonized Alu elements in human tissues. Using Exon array data of 330 Alu-derived exons in 11 human tissues and detailed RT-PCR analyses of 38 exons, we show that some Alu-derived exons are constitutively spliced in a broad range of human tissues, and some display strong tissue-specific switch in their transcript inclusion levels. Most of such exons are derived from ancient Alu elements in the genome. In SEPN1, mutations of which are linked to a form of congenital muscular dystrophy, the muscle-specific inclusion of an Alu-derived exon may be important for regulating SEPN1 activity in muscle. Realtime qPCR analysis of this SEPN1 exon in macaque and chimpanzee tissues indicates human-specific increase in its transcript inclusion level and muscle specificity after the divergence of humans and chimpanzees. Our results imply that some Alu exonization events may have acquired adaptive benefits during the evolution of primate transcriptomes

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central