Search CORE

28 research outputs found

A left-handed crossover involved in amidohydrolase catalysis Crystal structure of Erwinia chrysanthemi l-asparaginase with bound l-aspartate

Author: Gribskov Michael R.
Miller Maria
Rao J.K.Mohana
Wlodawer Alexander
Publication venue: Published by Elsevier B.V.
Publication date
Field of study

AbstractThe crystal structure of l-asparaginase from Erwinia chrysanthemi in the presence and absence of l-aspartate was determined at 1.8 Å resolution. Conserved residues in a left-handed crossover (a rare occurrence in protein structures) link pairs of dimers into the catalytically active tetrameric form of the enzyme. The structure of ErA containing bound aspartic acid shows that this unusual strand connectivity is an essential part of the active site architecture, responsible for releasing the product of the enzymatic hydrolysis. The orientation of the bound aspartate indicates for the first time a threonine residue as a catalytic nucleophile

Elsevier - Publisher Connector

Differential gene expression in Varroa jacobsoni mites following a host shift to European honey bees (Apis mellifera)

Author: Anderson Denis L
Andino Gladys K
Evans Jay D
Gribskov Michael R.
Hunt Greg
Publication venue: 'Purdue University (bepress)'
Publication date: 01/01/2016
Field of study

Background: Varroa mites are widely considered the biggest honey bee health problem worldwide. Until recently, Varroa jacobsoni has been found to live and reproduce only in Asian honey bee (Apis cerana) colonies, while V. destructor successfully reproduces in both A. cerana and A. mellifera colonies. However, we have identified an island population of V. jacobsoni that is highly destructive to A. mellifera, the primary species used for pollination and honey production. The ability of these populations of mites to cross the host species boundary potentially represents an enormous threat to apiculture, and is presumably due to genetic variation that exists among populations of V. jacobsoni that influences gene expression and reproductive status. In this work, we investigate differences in gene expression between populations of V. jacobsoni reproducing on A. cerana and those either reproducing or not capable of reproducing on A. mellifera, in order to gain insight into differences that allow V. jacobsoni to overcome its normal species tropism. Results: We sequenced and assembled a de novo transcriptome of V. jacobsoni. We also performed a differential gene expression analysis contrasting biological replicates of V. jacobsoni populations that differ in their ability to reproduce on A. mellifera. Using the edgeR, EBSeq and DESeq R packages for differential gene expression analysis, we found 287 differentially expressed genes (FDR ≤ 0.05), of which 91% were up regulated in mites reproducing on A. mellifera. In addition, mites found reproducing on A. mellifera showed substantially more variation in expression among replicates. We searched for orthologous genes in public databases and were able to associate 100 of these 287 differentially expressed genes with a functional description. Conclusions: There is differential gene expression between the two mite groups, with more variation in gene expression among mites that were able to reproduce on A. mellifera. A small set of genes showed reduced expression in mites on the A. mellifera host, including putative transcription factors and digestive tract developmental genes. The vast majority of differentially expressed genes were up-regulated in this host. This gene set showed enrichment for genes associated with mitochondrial respiratory function and apoptosis, suggesting that mites on this host may be experiencing higher stress, and may be less optimally adapted to parasitize it. Some genes involved in reproduction and oogenesis were also overexpressed, which should be further studied in regards to this host shift. © 2016 The Author(s)

Crossref

PubMed Central

Purdue E-Pubs

A Search for Parent-of-Origin Effects on Honey Bee Gene Expression

Author: Arechavaleta-Velasco Miguel E
Emore Christine M
Gibson Joshua D
Gribskov Michael R
Grozinger Christina M
Hunt Greg J
Kocher Sarah D
Queller David C
San Miguel Phillip
Strassmann Joan E
Tsuruda Jennifer M
Westerman Rick
Publication venue: Washington University Open Scholarship
Publication date: 01/08/2015
Field of study

Parent-specific gene expression (PSGE) is little known outside of mammals and plants. PSGE occurs when the expression level of a gene depends on whether an allele was inherited from the mother or the father. Kin selection theory predicts that there should be extensive PSGE in social insects because social insect parents can gain inclusive fitness benefits by silencing parental alleles in female offspring. We searched for evidence of PSGE in honey bees using transcriptomes from reciprocal crosses between European and Africanized strains. We found 46 transcripts with significant parent-of-origin effects on gene expression, many of which overexpressed the maternal allele. Interestingly, we also found a large proportion of genes showing a bias toward maternal alleles in only one of the reciprocal crosses. These results indicate that PSGE may occur in social insects. The nonreciprocal effects could be largely driven by hybrid incompatibility between these strains. Future work will help to determine if these are indeed parent-of-origin effects that can modulate inclusive fitness benefits

Washington University St. Louis: Open Scholarship

Establishing bioinformatics research in the Asia Pacific

Author: A Christoffels
A Konagaya
A Suresh
AKMA Baten
AM Khan
AR Sikder
CY Lin
D Gilbert
H Sugawara
HH Lin
J Sprenger
JC Tong
LJK Wee
M Brahmachary
Martti Tammi
Michael Gribskov
R Thadani
RTH Tsai
S Bhattacharya
S Foret
S Mathivanan
S Miyano
S Ranganathan
S Ranjan
S Takasaki
Shoba Ranganathan
Tin Wee Tan
U Kulkarni-Kale
X Wu
YP Lim
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

In 1998, the Asia Pacific Bioinformatics Network (APBioNet), Asia's oldest bioinformatics organisation was set up to champion the advancement of bioinformatics in the Asia Pacific. By 2002, APBioNet was able to gain sufficient critical mass to initiate the first International Conference on Bioinformatics (InCoB) bringing together scientists working in the field of bioinformatics in the region. This year, the InCoB2006 Conference was organized as the 5(th )annual conference of the Asia-Pacific Bioinformatics Network, on Dec. 18–20, 2006 in New Delhi, India, following a series of successful events in Bangkok (Thailand), Penang (Malaysia), Auckland (New Zealand) and Busan (South Korea). This Introduction provides a brief overview of the peer-reviewed manuscripts accepted for publication in this Supplement. It exemplifies a typical snapshot of the growing research excellence in bioinformatics of the region as we embark on a trajectory of establishing a solid bioinformatics research culture in the Asia Pacific that is able to contribute fully to the global bioinformatics community

Crossref

Springer - Publisher Connector

PubMed Central

Purdue E-Pubs

Macquarie University ResearchOnline

ScholarBank@NUS

Fast index based algorithms and software for matching position specific scoring matrices

Author: A Kel
A Sandelin
B Dorohonceanu
D Weeks
G Castillo
H Gonnet
J Henikoff
J Henikoff
J Kärkkäinen
K Quandt
L Goldstein
LR Murphy
M Abouelhoda
M Beckstette
M Beckstette
M Gribskov
Michael Beckstette
N de Bruijn
N Hulo
P Embrechts
P Haverty
P Scordis
R Giegerich
R Staden
R Tatusov
Robert Giegerich
Robert Homann
S Kurtz
S Kurtz
S Rahmann
S Rajasekaran
Stefan Kurtz
T Kasai
T Li
T Wu
T Wu
TK Attwood
V Freschi
V Matys
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: In biological sequence analysis, position specific scoring matrices (PSSMs) are widely used to represent sequence motifs in nucleotide as well as amino acid sequences. Searching with PSSMs in complete genomes or large sequence databases is a common, but computationally expensive task. RESULTS: We present a new non-heuristic algorithm, called ESAsearch, to efficiently find matches of PSSMs in large databases. Our approach preprocesses the search space, e.g., a complete genome or a set of protein sequences, and builds an enhanced suffix array that is stored on file. This allows the searching of a database with a PSSM in sublinear expected time. Since ESAsearch benefits from small alphabets, we present a variant operating on sequences recoded according to a reduced alphabet. We also address the problem of non-comparable PSSM-scores by developing a method which allows the efficient computation of a matrix similarity threshold for a PSSM, given an E-value or a p-value. Our method is based on dynamic programming and, in contrast to other methods, it employs lazy evaluation of the dynamic programming matrix. We evaluated algorithm ESAsearch with nucleotide PSSMs and with amino acid PSSMs. Compared to the best previous methods, ESAsearch shows speedups of a factor between 17 and 275 for nucleotide PSSMs, and speedups up to factor 1.8 for amino acid PSSMs. Comparisons with the most widely used programs even show speedups by a factor of at least 3.8. Alphabet reduction yields an additional speedup factor of 2 on amino acid sequences compared to results achieved with the 20 symbol standard alphabet. The lazy evaluation method is also much faster than previous methods, with speedups of a factor between 3 and 330. CONCLUSION: Our analysis of ESAsearch reveals sublinear runtime in the expected case, and linear runtime in the worst case for sequences not shorter than | [Formula: see text] |(m )+ m - 1, where m is the length of the PSSM and [Formula: see text] a finite alphabet. In practice, ESAsearch shows superior performance over the most widely used programs, especially for DNA sequences. The new algorithm for accurate on-the-fly calculations of thresholds has the potential to replace formerly used approximation approaches. Beyond the algorithmic contributions, we provide a robust, well documented, and easy to use software package, implementing the ideas and algorithms presented in this manuscript

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Publications at Bielefeld University

Bioinformatics research in the Asia Pacific: a 2007 update

Author: A Madhumalar
BC Kim
C Wang
CJO Baker
D Gilbert
DT Singh
GL Zhang
H Sugawara
H Zhao
KH Choo
L Kong
M Ganapathiraju
Michael Gribskov
N Yanamala
O Miotto
O Miotto
PD Yoo
Q Xu
R Ördög
RTH Tsai
S Dastmalchi
S Miyano
S Ranganathan
S Ranganathan
S Ranganathan
SH Chen
SH Nagaraj
Shoba Ranganathan
Tin Wee Tan
U Sangket
V Chelliah
WY Kim
YP Lim
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

We provide a 2007 update on the bioinformatics research in the Asia-Pacific from the Asia Pacific Bioinformatics Network (APBioNet), Asia's oldest bioinformatics organisation set up in 1998. From 2002, APBioNet has organized the first International Conference on Bioinformatics (InCoB) bringing together scientists working in the field of bioinformatics in the region. This year, the InCoB2007 Conference was organized as the 6th annual conference of the Asia-Pacific Bioinformatics Network, on Aug. 27–30, 2007 at Hong Kong, following a series of successful events in Bangkok (Thailand), Penang (Malaysia), Auckland (New Zealand), Busan (South Korea) and New Delhi (India). Besides a scientific meeting at Hong Kong, satellite events organized are a pre-conference training workshop at Hanoi, Vietnam and a post-conference workshop at Nansha, China. This Introduction provides a brief overview of the peer-reviewed manuscripts accepted for publication in this Supplement. We have organized the papers into thematic areas, highlighting the growing contribution of research excellence from this region, to global bioinformatics endeavours

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Purdue E-Pubs

Macquarie University ResearchOnline

ScholarBank@NUS

The Genome of Nectria haematococca: Contribution of Supernumerary Chromosomes to Gene Expansion

The ascomycetous fungus Nectria haematococca, (asexual name Fusarium solani), is a member of a group of >50 species known as the “Fusarium solani species complex”. Members of this complex have diverse biological properties including the ability to cause disease on >100 genera of plants and opportunistic infections in humans. The current research analyzed the most extensively studied member of this complex, N. haematococca mating population VI (MPVI). Several genes controlling the ability of individual isolates of this species to colonize specific habitats are located on supernumerary chromosomes. Optical mapping revealed that the sequenced isolate has 17 chromosomes ranging from 530 kb to 6.52 Mb and that the physical size of the genome, 54.43 Mb, and the number of predicted genes, 15,707, are among the largest reported for ascomycetes. Two classes of genes have contributed to gene expansion: specific genes that are not found in other fungi including its closest sequenced relative, Fusarium graminearum; and genes that commonly occur as single copies in other fungi but are present as multiple copies in N. haematococca MPVI. Some of these additional genes appear to have resulted from gene duplication events, while others may have been acquired through horizontal gene transfer. The supernumerary nature of three chromosomes, 14, 15, and 17, was confirmed by their absence in pulsed field gel electrophoresis experiments of some isolates and by demonstrating that these isolates lacked chromosome-specific sequences found on the ends of these chromosomes. These supernumerary chromosomes contain more repeat sequences, are enriched in unique and duplicated genes, and have a lower G+C content in comparison to the other chromosomes. Although the origin(s) of the extra genes and the supernumerary chromosomes is not known, the gene expansion and its large genome size are consistent with this species' diverse range of habitats. Furthermore, the presence of unique genes on supernumerary chromosomes might account for individual isolates having different environmental niches

Public Library of Science (PLOS)

HAL AMU

Directory of Open Access Journals

PubMed Central

University of Kentucky

Purdue E-Pubs

VTT Research System

ProdInra

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Accurate classification of RNA structures using topological fingerprints

While RNAs are well known to possess complex structures, functionally similar RNAs often have little sequence similarity. While the exact size and spacing of base-paired regions vary, functionally similar RNAs have pronounced similarity in the arrangement, or topology, of base-paired stems. Furthermore, predicted RNA structures often lack pseudoknots (a crucial aspect of biological activity), and are only partially correct, or incomplete. A topological approach addresses all of these difficulties. In this work we describe each RNA structure as a graph that can be converted to a topological spectrum (RNA fingerprint). The set of subgraphs in an RNA structure, its RNA fingerprint, can be compared with the fingerprints of other RNA structures to identify and correctly classify functionally related RNAs. Topologically similar RNAs can be identified even when a large fraction, up to 30%, of the stems are omitted, indicating that highly accurate structures are not necessary. We investigate the performance of the RNA fingerprint approach on a set of eight highly curated RNA families, with diverse sizes and functions, containing pseudoknots, and with little sequence similarity–an especially difficult test set. In spite of the difficult test set, the RNA fingerprint approach is very successful (ROC AUC \u3e 0.95). Due to the inclusion of pseudoknots, the RNA fingerprint approach both covers a wider range of possible structures than methods based only on secondary structure, and its tolerance for incomplete structures suggests that it can be applied even to predicted structures. Source code is freely available at https://github.rcac.purdue.edu/mgribsko/XIOS_RNA_fingerprint

Crossref

Directory of Open Access Journals

PubMed Central

Purdue E-Pubs

FigShare

Composition-based statistics and translated nucleotide searches: Improving the TBLASTN module of BLAST

Author: AA Schäffer
AL Delcher
Alejandro A Schäffer
B Brejová
B Hao
BG Barrell
DJ States
E Birney
E Birney
E Boy-Marcotte
E Boy-Marcotte
E Halperin
E Michael Gertz
EM Gertz
F Damak
F Zinoni
G Macino
H Peltola
IG Young
J Hein
J Hein
JC Wootton
L Knecht
M Gribskov
MS Boguski
MS Boguski
MS Gelfand
O Gotoh
P Steneberg
P Steneberg
R Durbin
Richa Agarwala
S Henikoff
S Kurtz
SA Chervitz
SC Low
SF Altschul
SF Altschul
SF Altschul
SF Altschul
Stephen F Altschul
TF Smith
W Gish
WJ Kent
WR Pearson
WR Pearson
WR Pearson
X Guan
X Huang
Yi-Kuo Yu
YK Yu
YK Yu
Z Zhang
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: TBLASTN is a mode of operation for BLAST that aligns protein sequences to a nucleotide database translated in all six frames. We present the first description of the modern implementation of TBLASTN, focusing on new techniques that were used to implement composition-based statistics for translated nucleotide searches. Composition-based statistics use the composition of the sequences being aligned to generate more accurate E-values, which allows for a more accurate distinction between true and false matches. Until recently, composition-based statistics were available only for protein-protein searches. They are now available as a command line option for recent versions of TBLASTN and as an option for TBLASTN on the NCBI BLAST web server. RESULTS: We evaluate the statistical and retrieval accuracy of the E-values reported by a baseline version of TBLASTN and by two variants that use different types of composition-based statistics. To test the statistical accuracy of TBLASTN, we ran 1000 searches using scrambled proteins from the mouse genome and a database of human chromosomes. To test retrieval accuracy, we modernize and adapt to translated searches a test set previously used to evaluate the retrieval accuracy of protein-protein searches. We show that composition-based statistics greatly improve the statistical accuracy of TBLASTN, at a small cost to the retrieval accuracy. CONCLUSION: TBLASTN is widely used, as it is common to wish to compare proteins to chromosomes or to libraries of mRNAs. Composition-based statistics improve the statistical accuracy, and therefore the reliability, of TBLASTN results. The algorithms used by TBLASTN are not widely known, and some of the most important are reported here. The data used to test TBLASTN are available for download and may be useful in other studies of translated search algorithms

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The Selaginella Genome Identifies Genetic Changes Associated with the Evolution of Vascular Plants

Author: Albert V. A.
Ambrose B. A.
Aono N.
Aoyama T.
Ashton N. W.
Axtell M. J.
Banks J. A.
Barker E.
Barker M. S.
Bennetzen J. L.
Bonawitz N. D.
Bowman J. L.
Chapple C.
Cheng C.
Correa L. G. G.
Dacre M.
Debarry J.
Depamphilis C.
Dreyer I.
Elias M.
Engstrom E. M.
Estelle M.
Feng L.
Finet C.
Floyd S. K.
Frommer W. B.
Fujita T.
Gramzow L.
Gribskov M.
Gutensohn M.
Harholt J.
Hasebe M.
Hattori M.
Hellsten U.
Heyl A.
Hirai T.
Hiwatashi Y.
Ishikawa M.
Iwata M.
Karol K. G.
Koehler B.
Kolukisaoglu U.
Kubo M.
Kurata T.
Lalonde S.
Li K.
Li Y.
Lindquist E.
Litt A.
Loqué D.
Lyons E.
Manning G.
Maruyama T.
Michael T. P.
Mikami K.
Mitros T.
Miyazaki S.
Morinaga S.
Mueller-roeber B.
Murata T.
Nelson D. R.
Nishiyama T.
Obara M.
Oguri Y.
Olmstead R. G.
Onodera N.
Otillar R.
Petersen B. L.
Pils B.
Prigge M.
Rensing S. A.
Riaño-Pachón D. M.
Roberts A. W.
Salamov A.
Sato Y.
Scheller H. V.
Schmutz J.
Schulz B.
Schulz C.
Shakirov E. V.
Shapiro H.
Shibagaki N.
Shinohara N.
Shippen D. E.
Sotooka R.
Sugimoto N.
Sugita M.
Sumikawa N.
Sørensen I.
Tanurdzic M.
Theiben G.
Ulvskov P.
Wakazuki S.
Weng J.
Willats W. W. G. T.
Wipf D.
Wolf P. G.
Yang L.
Zhu Q.
Zimmer A. D.
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 01/01/2011
Field of study

Vascular plants appeared ~410 million years ago then diverged into several lineages of which only two survive: the euphyllophytes (ferns and seed plants) and the lycophytes (1). We report here the genome sequence of the lycophyte Selaginella moellendorffii (Selaginella), the first non-seed vascular plant genome reported. By comparing gene content in evolutionary diverse taxa, we found that the transition from a gametophyte- to sporophyte- dominated life cycle required far fewer new genes than the transition from a non-seed vascular to a flowering plant, while secondary metabolic genes expanded extensively and in parallel in the lycophyte and angiosperm lineages. Selaginella differs in post- transcriptional gene regulation, including small RNA regulation of repetitive elements, an absence of the tasiRNA pathway and extensive RNA editing of organellar genes

Cold Spring Harbor Laboratory Institutional Repository

MPG.PuRe