Search CORE

35 research outputs found

Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3

Author: A Marchler-Bauer
A Mortazavi
A Nekrutenko
A Rambaut
A Resch
AA Patel
AB Carvalho
AE Vinogradov
AG Clark
AM McGuire
AN Ladd
AR Hatton
AS Chang
AV Alekseyenko
AV Philips
B Budagyan
B Modrek
B Modrek
B Prud'homme
BR Graveley
BR Graveley
BR Graveley
C Lee
C Notredame
CI Castillo-Davis
CL Fitzpatrick
CS Thummel
D Babushok
D Baek
D Gatfield
D Monroe
D Ortíz-Barrientos
DA Petrov
DA Petrov
DG Gilbert
DI Nurminsky
DJ Kenan
DL Black
DL Swofford
E Betran
E Kim
E Kim
E Wagner
EA Glazov
F-C Chen
FA Kondrashov
G Ast
G Lev-Maor
G Marais
GD Schuler
H Itoh
IA Swinburne
International Human Genome Sequencing Consortium
J Delaunay
J Felsenstein
J Rozas
JM Burnette
JM Comeron
JM Johnson
K Tamura
KL Fox-Walsh
KM Neugebauer
LF Lareau
M Clamp
M Guo
M Labrador
M Lynch
M Roy
M Talerico
MA Noor
MD Adams
MD Adams
MF Wilkinson
Mohamed AF Noor
MT Levine
Nikolai P Kandul
NJ Proudfoot
NM Kopelman
P Haddrill
PA Sharp
PJ Good
PM O'Grady
Q Pan
R Sorek
R Sorek
R Sorek
R Sorek
R Sorek
S Karlin
S Misra
S Richards
S-T Chen
SF Altschul
SM Berget
TE Royce
V Stolc
W Wang
WG Hill
Y Xing
Y Xing
Z Kan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Background: Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results: Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion: We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Caltech Authors

The clinical potential of antiangiogenic fragments of extracellular matrix proteins

Author: A Abdollahi
A Abdollahi
A R Clamp
A Sudhakar
A-K Olsson
AG Marneros
AL Feldman
BG Hudson
DW Davis
DWJ Van der Schaft
EI Cline
F Haviv
F Pezzella
G Bix
G C Jayson
G Klement
GA Homandberg
GL Rosner
H Jin
J Hanai
JP Eder
JP Thomas
M Kulke
M Sund
MS O'Reilly
N Glenjen
NJ MacDonald
O Kisker
P Carmeliet
P Nyberg
Q Yang
R Kalluri
RM Tjin Tham Sjin
RS Herbst
RS Herbst
SA Karumanchi
SA Wickström
SJ Lee
SM Plum
T Iizasa
TS Zorick
W Shi
Y Hamano
Y Maeshima
Y Sun
Y Yokoyama
YM Kim
Publication venue: Nature Publishing Group
Publication date: 31/10/2005
Field of study

Neovasculature development is a crucial step in the natural history of a cancer. While much emphasis has been placed on proangiogenic growth factors such as VEGF, it is clear that endogenous angiogenesis inhibitors also have critical roles in the regulation of this process. Recent research has identified several cryptic fragments of extracellular matrix/vascular basement membrane proteins that have potent antiangiogenic properties in vivo. It has become apparent that many of these fragments signal via interactions with endothelial integrins, although multiple downstream effector pathways have been implicated and endostatin, the first non-collagenous domain of collagen XVIII, influences an intricate signalling network. The activity of these molecules in animal models suggests that they may have significant clinical activity; however, results of phase I/II trials with endostatin were disappointing. Many possible reasons can be found for the failure of these studies. Weaknesses in trial design, endostatin administration regimen and patient selection are identifiable, and importantly the lack of a clearly defined antiangiogenic mechanism for endostatin hindered assessment of biologically effective dose. Additionally, in vivo immunological and proteolytic function-neutralising mechanisms may have negated endostatin's actions. Lessons learned from these studies will aid the future clinical development of other antiangiogenic extracellular matrix protein fragments

Crossref

PubMed Central

The University of Manchester - Institutional Repository

MSDmotif: exploring protein sites and motifs

Author: A Golovin
A Golovin
A Prilc
A Prlic
Adel Golovin
AG Murzin
AJ Shepherd
AV Efimov
AV Efimov
BL Sibanda
C Bystroff
CA Orengo
CG Hunter
CH Wu
CT Porter
D Schomburg
DCP Kuhn
DI Stuart
DJ Craik
EJ Milner-White
EJ Milner-White
EJ Milner-White
ELL Sonnhammer
ELL Sonnhammer
H Boutselakis
H Kaur
H Kawasaki
HM Berman
ID Kuntz
J Lee
JD Watson
JD Watson
JYL Questel
KB Li
Kim Henrick
M Clamp
MJ Hartshorn
MR Nelson
N Hulo
ND Rawlings
RD Dowell
RD Finn
S Hayward
S Zhirong
SF Altschul
SF Altschul
T Hubbard
TJ Oldfield
TL Bailey
WJ Duddy
WR Pearson
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB) is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS) protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Fast Statistical Alignment

We describe a new program for the alignment of multiple biological sequences that is both statistically motivated and fast enough for problem sizes that arise in practice. Our Fast Statistical Alignment program is based on pair hidden Markov models which approximate an insertion/deletion process on a tree and uses a sequence annealing algorithm to combine the posterior probabilities estimated from these models into a multiple alignment. FSA uses its explicit statistical model to produce multiple alignments which are accompanied by estimates of the alignment accuracy and uncertainty for every column and character of the alignment—previously available only with alignment programs which use computationally-expensive Markov Chain Monte Carlo approaches—yet can align thousands of long sequences. Moreover, FSA utilizes an unsupervised query-specific learning procedure for parameter estimation which leads to improved accuracy on benchmark reference alignments in comparison to existing programs. The centroid alignment approach taken by FSA, in combination with its learning procedure, drastically reduces the amount of false-positive alignment on biological data in comparison to that given by other methods. The FSA program and a companion visualization tool for exploring uncertainty in alignments can be used via a web interface at http://orangutan.math.berkeley.edu/fsa/, and the source code is available at http://fsa.sourceforge.net/

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Caltech Authors

GoLoco motif proteins binding to Gαi1: insights from molecular simulations

Molecular dynamics simulations, computational alanine scanning and sequence analysis were used to investigate the structural properties of the Gαi1/GoLoco peptide complex. Using these methodologies, binding of the GoLoco motif peptide to the Gαi1 subunit was found to restrict the relative movement of the helical and catalytic domains in the Gαi1 subunit, which is in agreement with a proposed mechanism of GDP dissociation inhibition by GoLoco motif proteins. In addition, the results provide further insights into the role of the “Switch IV” region located within the helical domain of Gα, the conformation of which might be important for interactions with various Gα partners

Crossref

Springer - Publisher Connector

PubMed Central

Differentiating Protein-Coding and Noncoding RNA: Challenges and Ambiguities

The assumption that RNA can be readily classified into either protein-coding or non-protein–coding categories has pervaded biology for close to 50 years. Until recently, discrimination between these two categories was relatively straightforward: most transcripts were clearly identifiable as protein-coding messenger RNAs (mRNAs), and readily distinguished from the small number of well-characterized non-protein–coding RNAs (ncRNAs), such as transfer, ribosomal, and spliceosomal RNAs. Recent genome-wide studies have revealed the existence of thousands of noncoding transcripts, whose function and significance are unclear. The discovery of this hidden transcriptome and the implicit challenge it presents to our understanding of the expression and regulation of genetic information has made the need to distinguish between mRNAs and ncRNAs both more pressing and more complicated. In this Review, we consider the diverse strategies employed to discriminate between protein-coding and noncoding transcripts and the fundamental difficulties that are inherent in what may superficially appear to be a simple problem. Misannotations can also run in both directions: some ncRNAs may actually encode peptides, and some of those currently thought to do so may not. Moreover, recent studies have shown that some RNAs can function both as mRNAs and intrinsically as functional ncRNAs, which may be a relatively widespread phenomenon. We conclude that it is difficult to annotate an RNA unequivocally as protein-coding or noncoding, with overlapping protein-coding and noncoding transcripts further confounding this distinction. In addition, the finding that some transcripts can function both intrinsically at the RNA level and to encode proteins suggests a false dichotomy between mRNAs and ncRNAs. Therefore, the functionality of any transcript at the RNA level should not be discounted

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of Queensland eSpace

Phylogeography of Ostreopsis along West Pacific Coast, with Special Reference to a Novel Clade from Japan

Author: A Amato
A De Queiroz
A Laza-Martinez
A Penna
A Penna
A Penna
A Stamatakis
AG Sáez
AM Waterhouse
AW Coleman
B Beszteri
Brett Neilan
C Brescianini
C De Vargas
C-P Leaw
CL Pin
D Morrison
D Sarno
DG Mann
DH Mathews
DJ Thornhill
DR Norris
EG Besada
F Rodriguez
F Ronquist
FH Chang
G Rocap
G Rocap
GM Hallegraeff
Haruo Yamaguchi
Hiroshi Sakanari
I Alvarez
I Pearce
J Šlapeta
JAA Nylander
JD Thompson
JP Huelsenbeck
JP Quod
JS Wiles
K Aligizaki
K Aligizaki
K Katoh
K Tamura
Keita Uehara
KF Darling
KF Darling
Kirsty Smith
L Fritz
L Provasoli
L Rhodes
Lesley Rhodes
LK Medlin
M Adachi
M Adachi
M Chinain
M Clamp
M Gottschling
M Montresor
MA Faust
MA Faust
MA Faust
MA Faust
MA Selina
Masao Adachi
MS McGlone
Naohito Hariganeya
NT Shears
P Ciminiello
PG Williamson
RA Fisher
RC Edgar
RD Barrett
RRL Guillard
RW Litaker
RW Litaker
S Taniyama
S Taniyama
S Taniyama
S Uwai
Shinya Sato
Shoichiro Suda
T Yasumoto
Takeshi Yasumoto
TJ Smayda
Tomohiro Nishimura
Wittaya Tawong
Y Fukuyo
Y Onuma
Yosuke Taira
Publication venue: Public Library of Science
Publication date: 02/12/2011
Field of study

BACKGROUND: A dinoflagellate genus Ostreopsis is known as a potential producer of Palytoxin derivatives. Palytoxin is the most potent non-proteinaceous compound reported so far. There has been a growing number of reports on palytoxin-like poisonings in southern areas of Japan; however, the distribution of Ostreopsis has not been investigated so far. Morphological plasticity of Ostreopsis makes reliable microscopic identification difficult so the employment of molecular tools was desirable. METHODS/PRINCIPAL FINDING: In total 223 clones were examined from samples mainly collected from southern areas of Japan. The D8-D10 region of the nuclear large subunit rDNA (D8-D10) was selected as a genetic marker and phylogenetic analyses were conducted. Although most of the clones were unable to be identified, there potentially 8 putative species established during this study. Among them, Ostreopsis sp. 1-5 did not belong to any known clade, and each of them formed its own clade. The dominant species was Ostreopsis sp. 1, which accounted for more than half of the clones and which was highly toxic and only distributed along the Japanese coast. Comparisons between the D8-D10 and the Internal Transcribed Spacer (ITS) region of the nuclear rDNA, which has widely been used for phylogenetic/phylogeographic studies in Ostreopsis, revealed that the D8-D10 was less variable than the ITS, making consistent and reliable phylogenetic reconstruction possible. CONCLUSIONS/SIGNIFICANCE: This study unveiled a surprisingly diverse and widespread distribution of Japanese Ostreopsis. Further study will be required to better understand the phylogeography of the genus. Our results posed the urgent need for the development of the early detection/warning systems for Ostreopsis, particularly for the widely distributed and strongly toxic Ostreopsis sp. 1. The D8-D10 marker will be suitable for these purposes

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Identification of novel conserved peptide uORF homology groups in Arabidopsis and rice reveals ancient eukaryotic origin of select groups and preferential association with transcription factor-encoding genes

Author: A Churbanov
A Cruz-Ramirez
A Gaba
A Imai
A Marchler-Bauer
A Raney
A Wiese
AG Hinnebusch
AGI
AJ Lincoln
AL Parola
AP Geballe
BG Luukkonen
BR Graveley
BW Verhagen
C Hanfrey
C Hanfrey
CA Hayden
Celine A Hayden
CI Castillo-Davis
E Henriksson
EC Lai
EJ Douzery
EM Zdobnov
F Ronquist
F Rook
G Blanc
G Blanc
G Toledo-Ortiz
GL Law
J Felsenstein
J Felsenstein
J Futterer
J Lee
JB Hollick
JD Thompson
JE Galagan
JF Martinez-Garcia
JL Riechmann
JQ Wen
JR Hill
JR Sanford
K Bharti
K Wolfe
KH Wolfe
L Nover
LC Pendleton
M Anisimova
M Ashburner
M Clamp
M Kozak
M Kozak
M Kozak
M Seki
M Werner
M Yoine
MA Heim
ME Schranz
MJ Sanderson
ML Crowe
ML Marton
MM Lee
N Gutterson
N Satoh-Nagasawa
P Akiva
P Fang
PC Bailey
PJ Eastmond
PT Evans
R Kawaguchi
R Walden
RC Edgar
Richard A Jorgensen
S Kikuchi
SF Altschul
SM Chaw
T Aoyama
T Nakano
T Tabuchi
T Yang
TA Gray
TS Bayer
U Gopfert
V Castelli
W Gish
X Jin
X Wang
XD Fu
Y Sakuma
Z Mou
Z Yang
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Upstream open reading frames (uORFs) can mediate translational control over the largest, or major ORF (mORF) in response to starvation, polyamine concentrations, and sucrose concentrations. One plant uORF with conserved peptide sequences has been shown to exert this control in an amino acid sequence-dependent manner but generally it is not clear what kinds of genes are regulated, or how extensively this mechanism is invoked in a given genome. Results By comparing full-length cDNA sequences from Arabidopsis and rice we identified 26 distinct homology groups of conserved peptide uORFs, only three of which have been reported previously. Pairwise Ka/Ks analysis showed that purifying selection had acted on nearly all conserved peptide uORFs and their associated mORFs. Functions of predicted mORF proteins could be inferred for 16 homology groups and many of these proteins appear to have a regulatory function, including 6 transcription factors, 5 signal transduction factors, 3 developmental signal molecules, a homolog of translation initiation factor eIF5, and a RING finger protein. Transcription factors are clearly overrepresented in this data set when compared to the frequency calculated for the entire genome (p = 1.2 × 10-7). Duplicate gene pairs arising from a whole genome duplication (ohnologs) with a conserved uORF are much more likely to have been retained in Arabidopsis (Arabidopsis thaliana) than are ohnologs of other genes (39% vs 14% of ancestral genes, p = 5 × 10-3). Two uORF groups were found in animals, indicating an ancient origin of these putative regulatory elements. Conclusion Conservation of uORF amino acid sequence, association with homologous mORFs over long evolutionary time periods, preferential retention after whole genome duplications, and preferential association with mORFs coding for transcription factors suggest that the conserved peptide uORFs identified in this study are strong candidates for translational controllers of regulatory genes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central