Search CORE

557 research outputs found

Classification of Dust Days by Satellite Remotely Sensed Aerosol Products

Author: Broday D. M.
Cohen A.
Levy Robert C.
Sorek-Hammer M.
Ziv B.
Publication venue
Publication date
Field of study

Considerable progress in satellite remote sensing (SRS) of dust particles has been seen in the last decade. From an environmental health perspective, such an event detection, after linking it to ground particulate matter (PM) concentrations, can proxy acute exposure to respirable particles of certain properties (i.e. size, composition, and toxicity). Being affected considerably by atmospheric dust, previous studies in the Eastern Mediterranean, and in Israel in particular, have focused on mechanistic and synoptic prediction, classification, and characterization of dust events. In particular, a scheme for identifying dust days (DD) in Israel based on ground PM10 (particulate matter of size smaller than 10 nm) measurements has been suggested, which has been validated by compositional analysis. This scheme requires information regarding ground PM10 levels, which is naturally limited in places with sparse ground-monitoring coverage. In such cases, SRS may be an efficient and cost-effective alternative to ground measurements. This work demonstrates a new model for identifying DD and non-DD (NDD) over Israel based on an integration of aerosol products from different satellite platforms (Moderate Resolution Imaging Spectroradiometer (MODIS) and Ozone Monitoring Instrument (OMI)). Analysis of ground-monitoring data from 2007 to 2008 in southern Israel revealed 67 DD, with more than 88 percent occurring during winter and spring. A Classification and Regression Tree (CART) model that was applied to a database containing ground monitoring (the dependent variable) and SRS aerosol product (the independent variables) records revealed an optimal set of binary variables for the identification of DD. These variables are combinations of the following primary variables: the calendar month, ground-level relative humidity (RH), the aerosol optical depth (AOD) from MODIS, and the aerosol absorbing index (AAI) from OMI. A logistic regression that uses these variables, coded as binary variables, demonstrated 93.2 percent correct classifications of DD and NDD. Evaluation of the combined CART-logistic regression scheme in an adjacent geographical region (Gush Dan) demonstrated good results. Using SRS aerosol products for DD and NDD, identification may enable us to distinguish between health, ecological, and environmental effects that result from exposure to these distinct particle populations

NASA Technical Reports Server

Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3

Author: A Marchler-Bauer
A Mortazavi
A Nekrutenko
A Rambaut
A Resch
AA Patel
AB Carvalho
AE Vinogradov
AG Clark
AM McGuire
AN Ladd
AR Hatton
AS Chang
AV Alekseyenko
AV Philips
B Budagyan
B Modrek
B Modrek
B Prud'homme
BR Graveley
BR Graveley
BR Graveley
C Lee
C Notredame
CI Castillo-Davis
CL Fitzpatrick
CS Thummel
D Babushok
D Baek
D Gatfield
D Monroe
D Ortíz-Barrientos
DA Petrov
DA Petrov
DG Gilbert
DI Nurminsky
DJ Kenan
DL Black
DL Swofford
E Betran
E Kim
E Kim
E Wagner
EA Glazov
F-C Chen
FA Kondrashov
G Ast
G Lev-Maor
G Marais
GD Schuler
H Itoh
IA Swinburne
International Human Genome Sequencing Consortium
J Delaunay
J Felsenstein
J Rozas
JM Burnette
JM Comeron
JM Johnson
K Tamura
KL Fox-Walsh
KM Neugebauer
LF Lareau
M Clamp
M Guo
M Labrador
M Lynch
M Roy
M Talerico
MA Noor
MD Adams
MD Adams
MF Wilkinson
Mohamed AF Noor
MT Levine
Nikolai P Kandul
NJ Proudfoot
NM Kopelman
P Haddrill
PA Sharp
PJ Good
PM O'Grady
Q Pan
R Sorek
R Sorek
R Sorek
R Sorek
R Sorek
S Karlin
S Misra
S Richards
S-T Chen
SF Altschul
SM Berget
TE Royce
V Stolc
W Wang
WG Hill
Y Xing
Y Xing
Z Kan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Background: Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results: Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion: We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Caltech Authors

Characteristics of transposable element exonization within human and mouse

Author: A Athanasiadis
A Corvelo
A Gerber
A Goren
A Levy
A Magen
A Nekrutenko
A Resch
AFA Smit
Agnes Hotz-Wagenblatt
B Giardine
B Mersch
BR Graveley
Britta Mersch
C Liu
D Karolchik
D Labuda
DD Kim
E Kim
ES Lander
EY Levanon
G Ast
G Lev-Maor
G Lev-Maor
Gil Ast
H Xie
Ilya Ruvinsky
J Hull
J Jurka
J Jurka
JO Kriegs
JO Yang
JP Nemes
K Nakabayashi
KP Kister
L Lin
L Lin
M Amit
M Blow
M Krull
M Moller-Krull
M Roy
M Sironi
M Sironi
MA Batzer
MD Koob
N Gal-Mark
N Gal-Mark
N Sela
NH Gehring
Noa Sela
O Ram
P Deininger
PL Deininger
R Cordaux
R Sorek
R Sorek
R Sorek
RA Gibbs
RE Mills
RH Waterston
RM Kuhn
RT Hillman
S He
S Schwartz
SK Ng
SS Singer
ST Sherry
T Kwan
T Kwan
VV Kapitonov
W Makalowski
WJ Kent
WL Chen
WS Lo
XH Zhang
Y Xing
YF Chang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/06/2010
Field of study

Insertion of transposed elements within mammalian genes is thought to be an important contributor to mammalian evolution and speciation. Insertion of transposed elements into introns can lead to their activation as alternatively spliced cassette exons, an event called exonization. Elucidation of the evolutionary constraints that have shaped fixation of transposed elements within human and mouse protein coding genes and subsequent exonization is important for understanding of how the exonization process has affected transcriptome and proteome complexities. Here we show that exonization of transposed elements is biased towards the beginning of the coding sequence in both human and mouse genes. Analysis of single nucleotide polymorphisms (SNPs) revealed that exonization of transposed elements can be population-specific, implying that exonizations may enhance divergence and lead to speciation. SNP density analysis revealed differences between Alu and other transposed elements. Finally, we identified cases of primate-specific Alu elements that depend on RNA editing for their exonization. These results shed light on TE fixation and the exonization process within human and mouse genes.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Origins, Evolution, and Functional Potential of Alternative Splicing in Vertebrates

Author: A. Frankish
A. Reymond
Altschul
Amit
Boguski
C. Howald
Cazalla
Cheah
Chen
Graveley
Hansen
J. Fernandez-Banet
J. Harrow
J. M. Mudge
Jekosch
Jurka
Kan
Kim
Kim
Koren
Langmead
Lareau
Lu
McGuire
Mendell
Modrek
Mortazavi
Nilsen
Ohoka
Pan
Pickrell
R. Guigo
Schwartz
Sela
Sela
Simpson
Slater
Sorek
Sorek
Sorek
Sprague
Stolc
Sureau
T. Alioto
T. Derrien
T. Hubbard
Wang
Wang
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Alternative splicing (AS) has the potential to greatly expand the functional repertoire of mammalian transcriptomes. However, few variant transcripts have been characterized functionally, making it difficult to assess the contribution of AS to the generation of phenotypic complexity and to study the evolution of splicing patterns. We have compared the AS of 309 protein-coding genes in the human ENCODE pilot regions against their mouse orthologs in unprecedented detail, utilizing traditional transcriptomic and RNAseq data. The conservation status of every transcript has been investigated, and each functionally categorized as coding (separated into coding sequence [CDS] or nonsense-mediated decay [NMD] linked) or noncoding. In total, 36.7% of human and 19.3% of mouse coding transcripts are species specific, and we observe a 3.6 times excess of human NMD transcripts compared with mouse; in contrast to previous studies, the majority of species-specific AS is unlinked to transposable elements. We observe one conserved CDS variant and one conserved NMD variant per 2.3 and 11.4 genes, respectively. Subsequently, we identify and characterize equivalent AS patterns for 22.9% of these CDS or NMD-linked events in nonmammalian vertebrate genomes, and our data indicate that functional NMD-linked AS is more widespread and ancient than previously thought. Furthermore, although we observe an association between conserved AS and elevated sequence conservation, as previously reported, we emphasize that 30% of conserved AS exons display sequence conservation below the average score for constitutive exons. In conclusion, we demonstrate the value of detailed comparative annotation in generating a comprehensive set of AS transcripts, increasing our understanding of AS evolution in vertebrates. Our data supports a model whereby the acquisition of functional AS has occurred throughout vertebrate evolution and is considered alongside amino acid change as a key mechanism in gene evolution

CiteSeerX

Crossref

Serveur académique lausannois

PubMed Central

UPF Digital Repository

King's Research Portal

Deep Transfer Learning on Satellite Imagery Improves Air Quality Estimates in Developing Nations

Author: Arku R
Asanjan AA
Brauer M
Ezzati M
Ganguly AR
Lingenfelter V
Oza N
Pohle MV
Sahasrabhojanee A
Sorek-Hamer M
Suel E
Yadav N
Publication venue: ArXiv
Publication date: 17/02/2022
Field of study

Urban air pollution is a public health challenge in low- and middle-income countries (LMICs). However, LMICs lack adequate air quality (AQ) monitoring infrastructure. A persistent challenge has been our inability to estimate AQ accurately in LMIC cities, which hinders emergency preparedness and risk mitigation. Deep learning-based models that map satellite imagery to AQ can be built for high-income countries (HICs) with adequate ground data. Here we demonstrate that a scalable approach that adapts deep transfer learning on satellite imagery for AQ can extract meaningful estimates and insights in LMIC cities based on spatiotemporal patterns learned in HIC cities. The approach is demonstrated for Accra in Ghana, Africa, with AQ patterns learned from two US cities, specifically Los Angeles and New York

arXiv.org e-Print Archive

Spiral - Imperial College Digital Repository

Simultaneous identification of long similar substrings in large sets of sequences

Author: A Lefebvre
Burghardt Wittig
E Check
Friedrich Möller
J Kleffe
Jürgen Kleffe
M Abouelhoda
M Hiller
M Höhl
PE Warburton
R Sorek
S Burkhardt
S Kurtz
S Kurtz
S Mielordt
T Hamborg
W Kent
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Sequence comparison faces new challenges today, with many complete genomes and large libraries of transcripts known. Gene annotation pipelines match these sequences in order to identify genes and their alternative splice forms. However, the software currently available cannot simultaneously compare sets of sequences as large as necessary especially if errors must be considered. Results We therefore present a new algorithm for the identification of almost perfectly matching substrings in very large sets of sequences. Its implementation, called ClustDB, is considerably faster and can handle 16 times more data than VMATCH, the most memory efficient exact program known today. ClustDB simultaneously generates large sets of exactly matching substrings of a given minimum length as seeds for a novel method of match extension with errors. It generates alignments of maximum length with a considered maximum number of errors within each overlapping window of a given size. Such alignments are not optimal in the usual sense but faster to calculate and often more appropriate than traditional alignments for genomic sequence comparisons, EST and full-length cDNA matching, and genomic sequence assembly. The method is used to check the overlaps and to reveal possible assembly errors for 1377 <it>Medicago truncatula </it>BAC-size sequences published at <url>http://www.medicago.org/genome/assembly_table.php?chr=1</url>. Conclusion The program ClustDB proves that window alignment is an efficient way to find long sequence sections of homogenous alignment quality, as expected in case of random errors, and to detect systematic errors resulting from sequence contaminations. Such inserts are systematically overlooked in long alignments controlled by only tuning penalties for mismatches and gaps. ClustDB is freely available for academic use.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

How the other half lives: CRISPR-Cas's influence on bacteriophages

Author: AE Briner
AF Andersson
BR Levin
C Pourcel
CL Sun
D Bhaya
E Pennisi
E Scarpellini
EV Koonin
FS Berezovskaya
GW Tyson
H Deveau
H Deveau
I Yosef
IN Berezovsky
J He
J Iranzo
J Mahony
JF Heidelberg
JM Peters
JO Haerter
KD Seed
KL Maxwell
LA Marraffini
LM Childs
M Sebaihia
P Han
P Horvath
P Horvath
PD Hsu
R Barrangou
R Lillestøl
R Sorek
S Al-Attar
S Minot
W Jiang
Publication venue
Publication date: 24/11/2017
Field of study

CRISPR-Cas is a genetic adaptive immune system unique to prokaryotic cells used to combat phage and plasmid threats. The host cell adapts by incorporating DNA sequences from invading phages or plasmids into its CRISPR locus as spacers. These spacers are expressed as mobile surveillance RNAs that direct CRISPR-associated (Cas) proteins to protect against subsequent attack by the same phages or plasmids. The threat from mobile genetic elements inevitably shapes the CRISPR loci of archaea and bacteria, and simultaneously the CRISPR-Cas immune system drives evolution of these invaders. Here we highlight our recent work, as well as that of others, that seeks to understand phage mechanisms of CRISPR-Cas evasion and conditions for population coexistence of phages with CRISPR-protected prokaryotes.Comment: 24 pages, 8 figure

arXiv.org e-Print Archive

Crossref

A phylogenetic generalized hidden Markov model for predicting alternatively spliced exons

Author: A Siepel
AA Mironov
B Modrek
B Modrek
BJ Haas
CW Sugnet
D Boffelli
D Brett
DL Black
DL Philipps
G Dror
G Rätsch
GW Yeo
H Nagasqaki
I Korf
J Felsenstein
JD McAuliffe
JE Allen
JM Johnson
Jonathan E Allen
JS Pedersen
L Cartegni
L Croft
M Alexandersson
M Hasegawa
M Hiller
M Hiller
P Carninci
Q Xu
R Sorek
R Sorek
RA Drysdale
RC Edgar
SL Cawley
SS Gross
Steven L Salzberg
T Maniatis
U Ohler
Z Kan
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: An important challenge in eukaryotic gene prediction is accurate identification of alternatively spliced exons. Functional transcripts can go undetected in gene expression studies when alternative splicing only occurs under specific biological conditions. Non-expression based computational methods support identification of rarely expressed transcripts. RESULTS: A non-expression based statistical method is presented to annotate alternatively spliced exons using a single genome sequence and evidence from cross-species sequence conservation. The computational method is implemented in the program ExAlt and an analysis of prediction accuracy is given for Drosophila melanogaster. CONCLUSION: ExAlt identifies the structure of most alternatively spliced exons in the test set and cross-species sequence conservation is shown to improve the precision of predictions. The software package is available to run on Drosophila genomes to search for new cases of alternative splicing

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Digital Repository at the University of Maryland

Alu-Alu Recombination Underlying the First Large Genomic Deletion in GlcNAc-Phosphotransferase Alpha/Beta (GNPTAB) Gene in a MLII Alpha/Beta Patient

Author: A Raas-Rothschild
B Knebelmann
B Pérez
B Tappino
B Tappino
C Has
D Meili
G Franke
G Lev-Maor
G Yeo
GA Mitchell
J Tazi
M Encarnação
M Wood
M Zarghooni
PL Deininger
R Quental
R Sorek
R Vervoort
S Kornfeld
S Quental
S Tiede
T Otomo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/10/2011
Field of study

Mucolipidosis type II α/β is a severe, autosomal recessive lysosomal storage disorder, caused by a defect in the GNPTAB gene that codes for the α/β subunits of the GlcNAc-phosphotransferase. To date, over 100 different mutations have been identified in MLII α/β patients, but no large deletions have been reported. Here we present the first case of a large homozygous intragenic GNPTAB gene deletion (c.3435-386_3602 + 343del897) encompassing exon 19, identified in a ML II α/β patient. Long-range PCR and sequencing methodologies were used to refine the characterization of this rearrangement, leading to the identification of a 21 bp repetitive motif in introns 18 and 19. Further analysis revealed that both the 5' and 3' breakpoints were located within highly homologous Alu elements (Alu-Sz in intron 18 and Alu-Sq2, in intron 19), suggesting that this deletion has probably resulted from Alu-Alu unequal homologous recombination. RT-PCR methods were used to further evaluate the consequences of the alteration for the processing of the mutant pre mRNA GNPTAB, revealing the production of three abnormal transcripts: one without exon 19 (p.Lys1146_Trp1201del); another with an additional loss of exon 20 (p.Arg1145Serfs*2), and a third in which exon 19 was substituted by a pseudoexon inclusion consisting of a 62 bp fragment from intron 18 (p.Arg1145Serfs*16). Interestingly, this 62 bp fragment corresponds to the Alu-Sz element integrated in intron 18.This represents the first description of a large deletion identified in the GNPTAB gene and contributes to enrich the knowledge on the molecular mechanisms underlying causative mutations in ML II.This work was supported by FCT - project PIC/IC/83252/2007 (http://alfa.fct.mctes.pt/). Coutinho MF and Quental S received grants from the FCT (SFRH/BD/48103/2008; SFRH/BPD/64025/2009)

Crossref

PubMed Central

Copenhagen University Research Information System

Repositório Científico do Instituto Nacional de Saúde

Alu Exonization Events Reveal Features Required for Precise Recognition of Exons by the Splicing Machinery

Author: A Corvelo
A Goren
A Grover
A Levy
AF Muro
AJ Mighell
B Clouet d'Orval
C-C Chang
D Libri
D Solnick
DL Black
E Buratti
E Buratti
E Dimitriadou
E Kim
Eddo Kim
FU Nasim
G Ast
G Dreyfuss
G Dror
G Lev-Maor
Gil Ast
IL Hofacker
IL Hofacker
IM Meyer
J Jurka
J Kralovicova
J Wang
JB Kruskal
JO Kriegs
KL Fox-Walsh
L Cartegni
L Katz
LP Eperon
M Blanchette
M Hiller
M Hiller
M Krull
M Roy
MB Shapiro
ML Hastings
N Gal-Mark
N Sela
Nir Kfir
NN Singh
Nurit Gal-Mark
O Ram
PJ Shepard
R Sorek
R Sorek
R Sorek
Ram Oren
RF Roscigno
Roderic Guigó
S Jacquenet
S Washietl
Schraga Schwartz
SH Nagaraj
SH Schwartz
SM Berget
T Sing
WG Fairbrother
X Roca
XH Zhang
XH Zhang
XH Zhang
Y Xing
Z Wang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Despite decades of research, the question of how the mRNA splicing machinery precisely identifies short exonic islands within the vast intronic oceans remains to a large extent obscure. In this study, we analyzed Alu exonization events, aiming to understand the requirements for correct selection of exons. Comparison of exonizing Alus to their non-exonizing counterparts is informative because Alus in these two groups have retained high sequence similarity but are perceived differently by the splicing machinery. We identified and characterized numerous features used by the splicing machinery to discriminate between Alu exons and their non-exonizing counterparts. Of these, the most novel is secondary structure: Alu exons in general and their 5′ splice sites (5′ss) in particular are characterized by decreased stability of local secondary structures with respect to their non-exonizing counterparts. We detected numerous further differences between Alu exons and their non-exonizing counterparts, among others in terms of exon–intron architecture and strength of splicing signals, enhancers, and silencers. Support vector machine analysis revealed that these features allow a high level of discrimination (AUC = 0.91) between exonizing and non-exonizing Alus. Moreover, the computationally derived probabilities of exonization significantly correlated with the biological inclusion level of the Alu exons, and the model could also be extended to general datasets of constitutive and alternative exons. This indicates that the features detected and explored in this study provide the basis not only for precise exon selection but also for the fine-tuned regulation thereof, manifested in cases of alternative splicing

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central