Search CORE

27 research outputs found

ACLAME: A CLAssification of Mobile genetic Elements, update 2010

Author: Altschul
Andreeva
Ariane Toussaint
Ashburner
Benson
Büchen-Osmond
Enright
Frost
Gipsi Lima-Mendez
Killcoyne
Lawrence
Leplae
Lima-Mendez
Lima-Mendez
Lima-Mendez
Molbak
Moura
Norman
Pearson
Pellegrini
Raphaël Leplae
Rohwer
Rohwer
Siguier
The Universal Protein Resource (UniProt)
Toussaint
Toussaint
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

The ACLAME database is dedicated to the collection, analysis and classification of sequenced mobile genetic elements (MGEs, in particular phages and plasmids). In addition to providing information on the MGEs content, classifications are available at various levels of organization. At the gene/protein level, families group similar sequences that are expected to share the same function. Families of four or more proteins are manually assigned with a functional annotation using the GeneOntology and the locally developed ontology MeGO dedicated to MGEs. At the genome level, evolutionary cohesive modules group sets of protein families shared among MGEs. At the population level, networks display the reticulate evolutionary relationships among MGEs. To increase the coverage of the phage sequence space, ACLAME version 0.4 incorporates 760 high-quality predicted prophages selected from the Prophinder database. Most of the data can be downloaded from the freely accessible ACLAME web site (http://aclame.ulb.ac.be). The BLAST interface for querying the database has been extended and numerous tools for in-depth analysis of the results have been added

Tachyon search speeds up retrieval of similar sequences by several orders of magnitude

Author: Altschul
Benson
Chia Yee Kwoh
Durga Kuchibhatla
Fernanda L. Sirota
Frank Eisenhaber
Georg Schneider
Joshua Tan
Katoh
Kent
Ooi
Pearson
Sayers
Sebastian Maurer-Stroh
The Universal Protein Resource (UniProt) in 2010.
Tobias Gattermayer
Waterhouse
Westley A. Sherman
Wootton
Zhao
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Summary: The usage of current sequence search tools becomes increasingly slower as databases of protein sequences continue to grow exponentially. Tachyon, a new algorithm that identifies closely related protein sequences ~200 times faster than standard BLAST, circumvents this limitation with a reduced database and oligopeptide matching heuristic

Crossref

PubMed Central

DR-NTU (Digital Repository of NTU)

MINT, the molecular interaction database: 2009 update

Author: Andrew Chatr Aryamontri
Aragues
Arnaud Ceol
Benson
Ceol
Ceol
Cerami
Chatr-aryamontri
Chatr-aryamontri
Chatr-Aryamontri
Chautard
Cusick
Daniele Peluso
Gianni Cesareni
Hubbard
Jayapandian
Kerrien
Kerrien
Killcoyne
Kulikova
Leonardo Briganti
Livia Perfetto
Luana Licata
Luisa Castagnoli
Matthews
Orchard
Orchard
Persico
Pruitt
Razick
Salwinski
Sugawara
The Universal Protein Resource (UniProt)
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

MINT (http://mint.bio.uniroma2.it/mint) is a public repository for molecular interactions reported in peer-reviewed journals. Since its last report, MINT has grown considerably in size and evolved in scope to meet the requirements of its users. The main changes include a more precise definition of the curation policy and the development of an enhanced and user-friendly interface to facilitate the analysis of the ever-growing interaction dataset. MINT has adopted the PSI-MI standards for the annotation and for the representation of molecular interactions and is a member of the IMEx consortium

Crossref

PubMed Central

ART

From experimental setup to bioinformatics: An RNAi screening platform to identify host factors involved in HIV-1 replication

Author: Alexa
Boutros
Brass
Brideau
Bushman
Carter
Cleveland
Erfle
Erfle
Eyre
Goff
Hubbard
Jensen
Kanehisa
Konig
Maglott
Malim
Malo
Martin
Mathivanan
Mishra
Novina
Otsu
Prudencio
Pruitt
Ptak
Rieber
Shannon
The universal protein resource (UniProt)
Thomas
Welker
Zhou
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Verification of alternative splicing variants based on domain integrity, truncation length and intrinsic protein disorder

Author: (2009) The Universal Protein Resource (UniProt) 2009
Altschul
Andreeva
Birzele
Boutet
Chothia
Consortium UniProt
Dosztanyi
Dyson
Fiegen
Finn
Fleming
Flicek
Hashimoto
Hedi Hegyi
Hegyi
Hubbard
Jin
Johnson
Kachel
Katzenberger
Kincaid
Koscielny
Kriventseva
Lajos Kalmar
Lewis
Liang
Liu
Melamud
Melamud
Nagy
Pan
Pan
Pan
Peter Tompa
Power
Pruitt
Romero
Saltzman
Shionyu
Stetefeld
Tamas Horvath
Tanner
Thanaraj
Tress
Tress
Trinh
Vashist
Vashist
Wang
Wang
Yura
Publication venue: Oxford University Press
Publication date
Field of study

According to current estimations ∼95% of multi-exonic human protein-coding genes undergo alternative splicing (AS). However, for 4000 human proteins in PDB, only 14 human proteins have structures of at least two alternative isoforms. Surveying these structural isoforms revealed that the maximum insertion accommodated by an isoform of a fully ordered protein domain was 5 amino acids, other instances of domain changes involved intrinsic structural disorder. After collecting 505 minor isoforms of human proteins with evidence for their existence we analyzed their length, protein disorder and exposed hydrophobic surface. We found that strict rules govern the selection of alternative splice variants aimed to preserve the integrity of globular domains: alternative splice sites (i) tend to avoid globular domains or (ii) affect them only marginally or (iii) tend to coincide with a location where the exposed hydrophobic surface is minimal or (iv) the protein is disordered. We also observed an inverse correlation between the domain fraction lost and the full length of the minor isoform containing the domain, possibly indicating a buffering effect for the isoform protein counteracting the domain truncation effect. These observations provide the basis for a prediction method (currently under development) to predict the viability of splice variants

Crossref

PubMed Central

Co-regulation of alternative splicing by diverse splicing factors in Caenorhabditis elegans

Author: Alan M. Zahler
Anderson
Anyanful
Barash
Barberan-Soler
Barberan-Soler
Barberan-Soler
Ben-Dov
Blanchette
Chalfie
Clower
David
Davies
Fisette
Francis
James Williams
Jeffrey Estella
Kanopka
Kawano
Kuroyanagi
Kuroyanagi
Lin
Longman
Loria
Lundquist
Lundquist
Martinez-Contreras
Martinez-Contreras
Motta-Mena
Nilsen
Ohno
Pedro Medina
Pfaffl
Rooke
Sergio Barberan-Soler
Skipper
Spartz
Spike
The Universal Protein Resource
Tian
Underwood
Venables
Wang
Yochem
Zahler
Publication venue: Oxford University Press
Publication date
Field of study

Regulation of alternative splicing is controlled by pre-mRNA sequences (cis-elements) and trans-acting protein factors that bind them. The combinatorial interactions of multiple protein factors with the cis-elements surrounding a given alternative splicing event lead to an integrated splicing decision. The mechanism of multifactorial splicing regulation is poorly understood. Using a splicing-sensitive DNA microarray, we assayed 352 Caenorhabditis elegans alternative cassette exons for changes in embryonic splicing patterns between wild-type and 12 different strains carrying mutations in a splicing factor. We identified many alternative splicing events that are regulated by multiple splicing factors. Many splicing factors have the ability to behave as splicing repressors for some alternative cassette exons and as splicing activators for others. Unexpectedly, we found that the ability of a given alternative splicing factor to behave as an enhancer or repressor of a specific splicing event can change during development. Our observations that splicing factors can change their effects on a substrate during development support a model in which combinatorial effects of multiple factors, both constitutive and developmentally regulated ones, contribute to the overall splicing decision

Crossref

PubMed Central

Reconstruction and analysis of genome-scale metabolic model of a photosynthetic bacterium

Abstract Background <it>Synechocystis </it>sp. PCC6803 is a cyanobacterium considered as a candidate photo-biological production platform - an attractive cell factory capable of using CO2 and light as carbon and energy source, respectively. In order to enable efficient use of metabolic potential of <it>Synechocystis </it>sp. PCC6803, it is of importance to develop tools for uncovering stoichiometric and regulatory principles in the <it>Synechocystis </it>metabolic network. Results We report the most comprehensive metabolic model of <it>Synechocystis </it>sp. PCC6803 available, <it>i</it>Syn669, which includes 882 reactions, associated with 669 genes, and 790 metabolites. The model includes a detailed biomass equation which encompasses elementary building blocks that are needed for cell growth, as well as a detailed stoichiometric representation of photosynthesis. We demonstrate applicability of <it>i</it>Syn669 for stoichiometric analysis by simulating three physiologically relevant growth conditions of <it>Synechocystis </it>sp. PCC6803, and through <it>in silico </it>metabolic engineering simulations that allowed identification of a set of gene knock-out candidates towards enhanced succinate production. Gene essentiality and hydrogen production potential have also been assessed. Furthermore, <it>i</it>Syn669 was used as a transcriptomic data integration scaffold and thereby we found metabolic hot-spots around which gene regulation is dominant during light-shifting growth regimes. Conclusions <it>i</it>Syn669 provides a platform for facilitating the development of cyanobacteria as microbial cell factories.</p

Crossref

Directory of Open Access Journals

PubMed Central

Physical mapping and BAC-end sequence analysis provide initial insights into the flax (Linum usitatissimum L.) genome

Author: A Diederichsen
ACJ Frijters
AH Paterson
AP Chan
Arabidopsis Genome Initiative
B Ewing
BC Meyers
C Soderlund
C Vitte
C Wu
CA Cullis
CA Cullis
CA Mathewson
CMC Bassett
CP Hong
CP Hong
CT Kelleher
CWJ Lai
D Zohary
DE Soltis
E Coe
E Datema
E Hribova
E Kvavadze
E Lerat
E Paux
E Paux
F Cheung
FM McCarthy
French-Italian Consortium for Grapevine Genome Characterization
GA Tuskan
GM Evans
HB Zhang
HBM Ali
International Brachypodium Initiative
International Rice Genome Sequencing Project
J Jurka
J Messing
J Schmutz
J Terol
J-H Mun
JA Schlueter
JA Shapiro
JC Venter
JE Frelichowski Jr
JE Stajih
JL Bennetzen
JL Shultz
L Mao
M Chen
M Delseny
M Febrer
M Marra
N Huo
P Smýkal
PB Goldsbrough
PB Goldsbrough
PF Cavagnaro
PS Schnable
Q Yu
R Ming
R Velasco
Raja Ragupathy
Rajkumar Rathinavelu
RE Pruitt
RG Schneeberger
RK Varshney
RL Warren
S Cloutier
S Cloutier
S Fenart
S Huang
S Ide
S McGinnis
S Ouyang
S Scalabrin
S Tucker
SY Rhee
Sylvie Cloutier
T Mozo
T Thiel
T Wicker
The Gene Ontology Consortium
The International Human Genome Mapping Consortium
The Universal Protein Resource Consortium
VM Gonzalez
WM Nelson
X Cheng
X Huang
Y Han
Y Han
YQ Gu
ZX Xu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Flax (<it>Linum usitatissimum </it>L.) is an important source of oil rich in omega-3 fatty acids, which have proven health benefits and utility as an industrial raw material. Flax seeds also contain lignans which are associated with reducing the risk of certain types of cancer. Its bast fibres have broad industrial applications. However, genomic tools needed for molecular breeding were non existent. Hence a project, Total Utilization Flax GENomics (TUFGEN) was initiated. We report here the first genome-wide physical map of flax and the generation and analysis of BAC-end sequences (BES) from 43,776 clones, providing initial insights into the genome. Results The physical map consists of 416 contigs spanning ~368 Mb, assembled from 32,025 fingerprints, representing roughly 54.5% to 99.4% of the estimated haploid genome (370-675 Mb). The N50 size of the contigs was estimated to be ~1,494 kb. The longest contig was ~5,562 kb comprising 437 clones. There were 96 contigs containing more than 100 clones. Approximately 54.6 Mb representing 8-14.8% of the genome was obtained from 80,337 BES. Annotation revealed that a large part of the genome consists of ribosomal DNA (~13.8%), followed by known transposable elements at 6.1%. Furthermore, ~7.4% of sequence was identified to harbour novel repeat elements. Homology searches against flax-ESTs and NCBI-ESTs suggested that ~5.6% of the transcriptome is unique to flax. A total of 4064 putative genomic SSRs were identified and are being developed as novel markers for their use in molecular breeding. Conclusion The first genome-wide physical map of flax constructed with BAC clones provides a framework for accessing target loci with economic importance for marker development and positional cloning. Analysis of the BES has provided insights into the uniqueness of the flax genome. Compared to other plant genomes, the proportion of rDNA was found to be very high whereas the proportion of known transposable elements was low. The SSRs identified from BES will be valuable in saturating existing linkage maps and for anchoring physical and genetic maps. The physical map and paired-end reads from BAC clones will also serve as scaffolds to build and validate the whole genome shotgun assembly.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Catalytic residues in hydrolases: analysis of methods designed for ligand-binding site prediction

Author: A Armon
A Bhinge
A Eichinger
A Gutteridge
A Pingoud
A Shulman-Peleg
A Stark
A Stark
A Stark
AA Bliznyuk
AC Stuart
AC Wallace
AH Elcock
AJ Chalk
ATR Laurie
ATR Laurie
B Huang
B Lee
B Zhang
C Taroni
CA Orengo
CM Seibert
CT Porter
D Pantoja-Uceda
DG Levitt
DJ Vocadlo
DT-H Chang
E Kellenberger
E Youn
FX Gomis-Rüth
G Nimrod
G Pugalenthi
GG Hammes
GJ Bartlett
GJ Kleywegt
GL Holliday
GL Holliday
GP Brady
H Yao
HM Berman
I Botos
Irena Roterman
J An
J An
J Dundas
J Liang
J Teyra
J Weigelt
J-M Chandonia
JA Barker
JM Yon
K Henrick
K Katayanagi
K Kinoshita
K Kinoshita
K Stummeyer
K Zhang
KA Snyder
Katarzyna Prymula
KP Peters
M Bryliński
M Grabowski
M Hendlich
M Jambon
M Jambon
M Kanehisa
M Landau
M Levitt
M Stahl
MA Kurowski
MJ Ondrechen
MP Liang
MR Landon
N Kallenbach
O Gileadi
O Goldenberg
O Lichtarge
O Lichtarge
P Aloy
P Baldi
P Reis
PJ Hajduk
PJ Hajduk
PP Wangikar
R Landgraf
RA Laskowski
RA Laskowski
RA Laskowski
RV Spriggs
S Madabushi
S Vajda
SE Brenner
T Fawcett
T Kortvelyesi
T Pupko
T Tadokoro
T Zhang
TA Binkowski
Tomasz Jadczyk
UniProt Consortium The Universal Protein Resource (UniProt)
V Siksnys
W Kabsch
Y Dou
Y Oda
Y Tsunaka
Y-R Tang
Publication venue: Springer Netherlands
Publication date: 01/01/2010
Field of study

The comparison of eight tools applicable to ligand-binding site prediction is presented. The methods examined cover three types of approaches: the geometrical (CASTp, PASS, Pocket-Finder), the physicochemical (Q-SiteFinder, FOD) and the knowledge-based (ConSurf, SuMo, WebFEATURE). The accuracy of predictions was measured in reference to the catalytic residues documented in the Catalytic Site Atlas. The test was performed on a set comprising selected chains of hydrolases. The results were analysed with regard to size, polarity, secondary structure, accessible solvent area of predicted sites as well as parameters commonly used in machine learning (F-measure, MCC). The relative accuracies of predictions are presented in the ROC space, allowing determination of the optimal methods by means of the ROC convex hull. Additionally the minimum expected cost analysis was performed. Both advantages and disadvantages of the eight methods are presented. Characterization of protein chains in respect to the level of difficulty in the active site prediction is introduced. The main reasons for failures are discussed. Overall, the best performance offers SuMo followed by FOD, while Pocket-Finder is the best method among the geometrical approaches

Crossref

Springer - Publisher Connector

PubMed Central

Jagiellonian Univeristy Repository

Sorting protein lists with nwCompare: A simple and fast algorithm for n

Author: Alizadeh
Bleasby
Bussey
Coˇté
Faca
Han
Huang
Huang
Kulasingam
Kumar
Li
The Universal Protein Resource (UniProt) 2009
Publication venue: 'Wiley'
Publication date
Field of study

Crossref