Search CORE

17 research outputs found

Plant protein-coding gene families: emerging bioinformatics approaches

Author: Altschul
Andreeva
Attwood
Beers
Benson
Bru
Cambra
Carretero-Paulet
Chain
Chen
Cochrane
Cuff
de Lima Morais
Del Bem
Enright
Faro
Feng
Finn
Fraser
Frech
Garcia-Lorenzo
Guilfoyle
Guindon
Haft
Hunter
Kaminuma
Kersey
Klimke
Kolodziejczyk
Kotsyfakis
Lees
Leinonen
Letunic
Li
Li
Lijavetzky
Lima
Liolios
Lu
Manuel Martinez
Marchler-Bauer
Martinez
Martinez
Martinez
Mi
Moreno-Risueno
Mugford
Nikolskaya
Nissen
Paterson
Pearson
Perez-Rodriguez
Philippe
Plett
Proost
Pruitt
Rautengarten
Rawlings
Remington
Roberts
Rouard
Sigrist
Singh
Swaminathan
Takahashi
Tatusov
Tian
Tyler
UniProt_Consortium
Van de Peer
Vercammen
Wang
Yu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

Protein-coding gene families are sets of similar genes with a shared evolutionary origin and, generally, with similar biological functions. In plants, the size and role of gene families has been only partially addressed. However, suitable bioinformatics tools are being developed to cluster the enormous number of sequences currently available in databases. Specifically, comparative genomic databases promise to become powerful tools for gene family annotation in plant clades. In this review, I evaluate the data retrieved from various gene family databases, the ease with which they can be extracted and how useful the extracted information is

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

A Human-Specific De Novo Protein-Coding Gene Associated with Human Brain Functions

Author: A Siepel
A Siepel
A Varki
AC Marques
B Ewing
Chuan-Yun Li
Chunmei Cao
D Gordon
D Karolchik
D Leister
D Wang
DA Nickerson
DG Knowles
DG Knowles
DJ Begun
DL Hartl
DL Wheeler
EJ Vallender
ES Lander
F Duan
FG Wulczyn
George R. Uhl
GM Cooper
GR Uhl
GR Uhl
J Cai
J Rozas
JP Gong
K Chen
Liping Wei
M Long
M Toll-Riera
M Wu
MT Levine
Philip E. Bourne
Ping-Wu Zhang
Qing-Rong Liu
QR Liu
QR Liu
Quan Du
Quan Yu
RC Gentleman
RR Hudson
S Ohno
SF Saccone
Shu-Juan Lu
ST Chen
T Barrett
UniProt_Consortium
W Peng
W Wang
Xiao-Mo Li
Xiaofeng Zheng
Yan Zhang
Yong Zhang
Z Wu
Zhanbo Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

To understand whether any human-specific new genes may be associated with human brain functions, we computationally screened the genetic vulnerable factors identified through Genome-Wide Association Studies and linkage analyses of nicotine addiction and found one human-specific de novo protein-coding gene, FLJ33706 (alternative gene symbol C20orf203). Cross-species analysis revealed interesting evolutionary paths of how this gene had originated from noncoding DNA sequences: insertion of repeat elements especially Alu contributed to the formation of the first coding exon and six standard splice junctions on the branch leading to humans and chimpanzees, and two subsequent substitutions in the human lineage escaped two stop codons and created an open reading frame of 194 amino acids. We experimentally verified FLJ33706's mRNA and protein expression in the brain. Real-Time PCR in multiple tissues demonstrated that FLJ33706 was most abundantly expressed in brain. Human polymorphism data suggested that FLJ33706 encodes a protein under purifying selection. A specifically designed antibody detected its protein expression across human cortex, cerebellum and midbrain. Immunohistochemistry study in normal human brain cortex revealed the localization of FLJ33706 protein in neurons. Elevated expressions of FLJ33706 were detected in Alzheimer's brain samples, suggesting the role of this novel gene in human-specific pathogenesis of Alzheimer's disease. FLJ33706 provided the strongest evidence so far that human-specific de novo genes can have protein-coding potential and differential protein expression, and be involved in human brain functions

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The maternal and early embryonic transcriptome of the milkweed bug Oncopeltus fasciatus

Author: A Abzhanov
A Conesa
A Dorn
A Grimson
A Papanicolaou
A Rosenblueth
A Sarkar
AP Orth
B Ewen-Campen
B Ewing
B Ewing
Ben Ewen-Campen
C Camacho
C Nüsslein-Volhard
Cassandra G Extavour
CE Bruder
CJ Lowe
CL Hughes
D Bellin
D Erezyilmaz
D Gordon
D Lawson
DA Hahn
DF Erezyilmaz
DR Angelini
E Huebner
E Kristiansson
E Meyer
E Novaes
E Toulza
EA Bogdanova
EM Zdobnov
F Cheung
F Roeding
F Zhang
FH Butt
IAG Consortium
J Schmid
JA Bolker
JC Vera
JMW Slack
K Mita
KA Panfilio
KD Pruitt
Kristen A Panfilio
M Ashburner
M Kumé
MD Piulachs
MD Robinson
N Garcia-Reyero
Nathan Shaner
P Beldade
P Liu
P Liu
P Liu
PA Lawrence
PA Lawrence
PD Danley
R Nunes da Fonseca
RA Jenner
RE Timme
RJ Sommer
S Kumar
S Tweedie
SA Shabalina
SB Hedges
Siegfried Roth
ST O'Neil
TL Parchman
UniProt_Consortium
W Brockman
WN Beklemishev
WN Beklemishev
X Huang
Y Pauchet
Y Surget-Groba
Yuichiro Suzuki
YY Zhu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Most evolutionary developmental biology ("evo-devo") studies of emerging model organisms focus on small numbers of candidate genes cloned individually using degenerate PCR. However, newly available sequencing technologies such as 454 pyrosequencing have recently begun to allow for massive gene discovery in animals without sequenced genomes. Within insects, although large volumes of sequence data are available for holometabolous insects, developmental studies of basally branching hemimetabolous insects typically suffer from low rates of gene discovery. Results We used 454 pyrosequencing to sequence over 500 million bases of cDNA from the ovaries and embryos of the milkweed bug <it>Oncopeltus fasciatus</it>, which lacks a sequenced genome. This indirectly developing insect occupies an important phylogenetic position, branching basal to Diptera (including fruit flies) and Hymenoptera (including honeybees), and is an experimentally tractable model for short-germ development. 2,087,410 reads from both normalized and non-normalized cDNA assembled into 21,097 sequences (isotigs) and 112,531 singletons. The assembled sequences fell into 16,617 unique gene models, and included predictions of splicing isoforms, which we examined experimentally. Discovery of new genes plateaued after assembly of ~1.5 million reads, suggesting that we have sequenced nearly all transcripts present in the cDNA sampled. Many transcripts have been assembled at close to full length, and there is a net gain of sequence data for over half of the pre-existing <it>O. fasciatus </it>accessions for developmental genes in GenBank. We identified 10,775 unique genes, including members of all major conserved metazoan signaling pathways and genes involved in several major categories of early developmental processes. We also specifically address the effects of cDNA normalization on gene discovery in <it>de novo </it>transcriptome analyses. Conclusions Our sequencing, assembly and annotation framework provide a simple and effective way to achieve high-throughput gene discovery for organisms lacking a sequenced genome. These data will have applications to the study of the evolution of arthropod genes and genetic pathways, and to the wider evolution, development and genomics communities working with emerging model organisms. [The sequence data from this study have been submitted to GenBank under study accession number SRP002610 (<url>http://www.ncbi.nlm.nih.gov/sra?term=SRP002610</url>). Custom scripts generated are available at <url>http://www.extavourlab.com/protocols/index.html</url>. Seven Additional files are available.]</p

Crossref

Kölner UniversitätsPublikationsServer

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository

High throughput estimation of functional cell activities reveals disease mechanisms and predicts relevant clinical outcomes

This work is supported by grants BIO2014- 57291-R from the Spanish Ministry of Economy and Competitiveness and “Plataforma de Recursos Biomoleculares y Bioinformáticos” PT13/0001/0007 from the ISCIII, both co-funded with European Regional Development Funds (ERDF); PROMETEOII/2014/025 from the Generalitat Valenciana (GVA-FEDER); Fundació la Marató TV3 (ref. 20133134); and EU H2020- INFRADEV-1-2015-1 ELIXIR-EXCELERATE (ref. 676559) and EU FP7-People ITN Marie Curie Project (ref 316861)

Crossref

Queen Mary Research Online

Transcriptomic characterization of the enzymatic antioxidants FeSOD, MnSOD, APX and KatG in the dinoflagellate genus Symbiodinium

Author: A Bodył
A Gutteridge
A Pierleoni
A Untergasser
AA Salamov
AB Mayfield
AB Mayfield
C Arif
C Shinzato
CH Foyer
CL Andersen
CR Voolstra
DA Benson
DA Schoenberg
DA Schoenberg
DDK Logan
DJ Barshis
DJ Miller
DJ Thornhill
DM Hillis
EM Sampayo
ES McGinty
F Abascal
F Dufernez
G Smulevich
GM Cooper
H Jespersen
H Putnam
H Zhang
H Zhang
HD Freudenthal
IM Yakovleva
J Felsenstein
JA Schwarz
JD Thompson
JH Wisecaver
JJA McLaughlin
JL Matta
JT Ladner
K Takishita
K Takishita
K Wada
KG Welinder
L Käll
L Muscatine
M Rodriguez-Lanetty
M Stat
M Stat
M Takabayashi
M Zámocký
M Zámocký
M Zámocký
MA Coffroth
MA Nowak
MK DeSalvo
MP Lesser
MP Lesser
MP Lesser
MW Pfaffl
N Császár
N Fankhauser
N Fawal
NJ Patron
NJ Patron
NN Rosic
NN Rosic
NR Polato
NT Pitsch
O Emanuelsson
O Hoegh-Guldberg
OK Okamoto
Ove Hoegh-Guldberg
P Milos
Paul L Fisher
PL Fisher
PW Glynn
R Berkelmans
R Guo
R Mittler
R Rowan
R Rowan
R Rowan
R Rowan
R Wintjens
RD Baker
RD Kortschak
RF Stern
RG Alscher
RJ Noel
S Baumgarten
S Forêt
S Guindon
S Kawaguti
S Richier
SE Edge
SF Altschul
Simon K Davy
Sophie Dove
SR Santos
Stefanie Pontasch
Susanne Becker
T Bayer
T Krueger
TC LaJeunesse
TC LaJeunesse
Thomas Krueger
TP Hughes
UniProt_Consortium
VM Weis
W Leggat
W Leggat
William Leggat
X Pochon
X Pochon
X-P Zhang
Y Li
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

CRK: an evolutionary approach for distinguishing biologically relevant interfaces from crystal contacts

Author: Altschul
Bahadur
Berman
Bernauer
Coulibaly
Davis
DeLano
Dey
Doron-Faigenboim
Elcock
Eliot
Gottschalk
Guharoy
Hamiaux
Janin
Kobe
Krissinel
Krissinel
Lubkowski
Nishiyama
Notredame
Percudani
Ponstingl
Reyes
Stern
Uniprot_Consortium
Ward
Yang
Zhu
Publication venue: 'Wiley'
Publication date: 01/01/2010
Field of study

Protein crystals contain two different types of interfaces: biologically relevant ones, observed in protein-protein complexes and oligomeric proteins, and nonspecific ones, corresponding to crystal lattice contacts. Because of the increasing complexity of the objects being tackled in structural biology, distinguishing biological contacts from crystal contacts is not always a trivial task and can lead to wrong interpretation of macromolecular structures. We devised an approach (CRK, core-rim K(a)/K(s) ratio) for distinguishing biologically relevant interfaces from nonspecific ones. Given a protein-protein interface, CRK finds a set of homologs to the sequences of the proteins involved in the interface, retrieves and aligns the corresponding coding sequences, on which it carries out a residue-by-residue K(a)/K(s) ratio (omega) calculation. It divides interface residues into a "rim" and a "core" set and analyzes the selection pressure on the residues belonging to the two sets. We developed and tested CRK on different datasets and test cases, consisting of biologically relevant contacts, nonspecific ones or of both types. The method proves very effective in distinguishing the two categories of interfaces, with an overall accuracy rate of 84%. As it relies on different principles when compared with existing tools, CRK is optimally suited to be used in combination with them. In addition, CRK has potential applications in the validation of structures of oligomeric proteins and protein complexes

Crossref

ZORA