Search CORE

10,319 research outputs found

TRAPID : an efficient online tool for the functional and comparative analysis of de novo RNA-Seq transcriptomes

Author: Deforce Dieter
Proost Sebastian
Van Bel Michiel
Van de Peer Yves
Van Neste Christophe
Vandepoele Klaas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Transcriptome analysis through next-generation sequencing technologies allows the generation of detailed gene catalogs for non-model species, at the cost of new challenges with regards to computational requirements and bioinformatics expertise. Here, we present TRAPID, an online tool for the fast and efficient processing of assembled RNA-Seq transcriptome data, developed to mitigate these challenges. TRAPID offers high-throughput open reading frame detection, frameshift correction and includes a functional, comparative and phylogenetic toolbox, making use of 175 reference proteomes. Benchmarking and comparison against state-of-the-art transcript analysis tools reveals the efficiency and unique features of the TRAPID system

Springer - Publisher Connector

UPSpace at the University of Pretoria

MorphDB : prioritizing genes for specialized metabolism pathways and gene ontology categories in plants

Author: Amar David
Diels Tim
Shamir Ron
Tzfadia Oren
Van de Peer Yves
Van Parys Thomas
Zwaenepoel Arthur
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest

Frontiers - Publisher Connector

UPSpace at the University of Pretoria

SIFTER search: a web server for accurate phylogeny-based protein function prediction.

Author: Brenner Steven E
Luo Kevin R
Sahraeian Sayed M
Publication venue: eScholarship, University of California
Publication date: 01/01/2015
Field of study

We are awash in proteins discovered through high-throughput sequencing projects. As only a minuscule fraction of these have been experimentally characterized, computational methods are widely used for automated annotation. Here, we introduce a user-friendly web interface for accurate protein function prediction using the SIFTER algorithm. SIFTER is a state-of-the-art sequence-based gene molecular function prediction algorithm that uses a statistical model of function evolution to incorporate annotations throughout the phylogenetic tree. Due to the resources needed by the SIFTER algorithm, running SIFTER locally is not trivial for most users, especially for large-scale problems. The SIFTER web server thus provides access to precomputed predictions on 16 863 537 proteins from 232 403 species. Users can explore SIFTER predictions with queries for proteins, species, functions, and homologs of sequences not in the precomputed prediction set. The SIFTER web server is accessible at http://sifter.berkeley.edu/ and the source code can be downloaded

CiteSeerX

eScholarship - University of California

Species-level functional profiling of metagenomes and metatranscriptomes.

Author: A Sczyrba
A Shafquat
AE Duran-Pinedo
AK Sharma
B Buchfink
B Langmead
BE Suzek
BK Swan
C Burke
C Luo
Curtis Huttenhower
D Medini
DH Huson
DT Truong
DT Truong
E Pasolli
EA Franzosa
EA Franzosa
Eric A. Franzosa
George Weingart
GG Silva
Gholamali Rahnavard
H Hauswedell
J Kim
J Lloyd-Price
J Lloyd-Price
J Ravel
J. Gregory Caporaso
JA Fuhrman
K Huang
Karen Schwarzberg Lipson
Lauren J. McIver
LR Thompson
LR Thompson
Luke R. Thompson
M Hamady
M Kanehisa
M Scholz
Melanie Schirmer
MY Galperin
N Segata
N Segata
Nicola Segata
OU Mason
P Petrenko
PJ Turnbaugh
R Caspi
RC Edgar
RD Finn
Rob Knight
S Abubucker
S Nayfach
S Sunagawa
S Sunagawa
T Bose
UniProt Consortium.
W Huang
Y Ye
Y Zhao
Publication venue: eScholarship, University of California
Publication date: 01/11/2018
Field of study

Functional profiles of microbial communities are typically generated using comprehensive metagenomic or metatranscriptomic sequence read searches, which are time-consuming, prone to spurious mapping, and often limited to community-level quantification. We developed HUMAnN2, a tiered search strategy that enables fast, accurate, and species-resolved functional profiling of host-associated and environmental communities. HUMAnN2 identifies a community's known species, aligns reads to their pangenomes, performs translated search on unclassified reads, and finally quantifies gene families and pathways. Relative to pure translated search, HUMAnN2 is faster and produces more accurate gene family profiles. We applied HUMAnN2 to study clinal variation in marine metabolism, ecological contribution patterns among human microbiome pathways, variation in species' genomic versus transcriptional contributions, and strain profiling. Further, we introduce 'contributional diversity' to explain patterns of ecological assembly across different microbial community types

eScholarship - University of California

FSim: A Novel Functional Similarity Search Algorithm and Tool for Discovering Functionally Related Gene Products

Author
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2014
Field of study

An Introductory Guide to Aligning Networks Using SANA, the Simulated Annealing Network Aligner.

Author: A Chatr-Aryamontri
A Hasan
A Lesk
BH Junker
C Clark
C Mering Von
C Yang
CG El Van
D Davis
DM Prescott
EH Davidson
F Alkan
FE Faisal
FE Faisal
HT Phan
J Crawford
K Chen
K Mehlhorn
KI Smith
M Ashburner
M El-Kebir
M Kotlyar
M Malek
M Milano
M Vidal
MP Williamson
MR Garey
N Malod-Dognin
N Pržulj
N Yaveroğlu
O Fiehn
O Kuchaiev
O Kuchaiev
O Sporns
R Jaenicke
S Hashemifar
SA Cook
SF Altschul
SJ Larsen
T Hočevar
T Milenković
T Milenković
T Tokar
TA Farazi
V Saraph
V Vijayan
Publication venue: eScholarship, University of California
Publication date: 22/11/2019
Field of study

Sequence alignment has had an enormous impact on our understanding of biology, evolution, and disease. The alignment of biological networks holds similar promise. Biological networks generally model interactions between biomolecules such as proteins, genes, metabolites, or mRNAs. There is strong evidence that the network topology-the "structure" of the network-is correlated with the functions performed, so that network topology can be used to help predict or understand function. However, unlike sequence comparison and alignment-which is an essentially solved problem-network comparison and alignment is an NP-complete problem for which heuristic algorithms must be used.Here we introduce SANA, the Simulated Annealing Network Aligner. SANA is one of many algorithms proposed for the arena of biological network alignment. In the context of global network alignment, SANA stands out for its speed, memory efficiency, ease-of-use, and flexibility in the arena of producing alignments between two or more networks. SANA produces better alignments in minutes on a laptop than most other algorithms can produce in hours or days of CPU time on large server-class machines. We walk the user through how to use SANA for several types of biomolecular networks

arXiv.org e-Print Archive

eScholarship - University of California

Using WormBase: A Genome Biology Resource for Caenorhabditis elegans and Related Nematodes

Author: A Kalderimis
A Mitchell
AG Alexander
AJ Bretscher
AJ Vilella
C Camacho
C Trapnell
C Trapnell
D Angeles-Albores
DB Rhee
E Culetto
G Schindelman
Gene Ontology Consortium
H Li
H Motenko
I Greenwald
I Lee
I Lee
J Giacomotto
J Li
J Zheng
J-F Rual
JS Amberger
K-W Park
KL Howe
LD Stein
LM Schriml
LP O’Reilly
M Artal-Sanz
MB Gerstein
ME Skinner
OE Blacque
P Gaudet
R Balakrishnan
R Lyne
R O’Hagan
RC Edgar
RD Finn
RN Smith
RP Huntley
RP Huntley
RS Kamath
RYN Lee
S Burge
S Contrino
S Powell
S-J Lee
SF Altschul
SF Altschul
The Gene Ontology Consortium
TW Harris
W Zhong
WA Kibbe
WJ Kent
Y Nakamura
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/05/2018
Field of study

WormBase (www.wormbase.org) provides the nematode research community with a centralized database for information pertaining to nematode genes and genomes. As more nematode genome sequences are becoming available and as richer data sets are published, WormBase strives to maintain updated information, displays, and services to facilitate efficient access to and understanding of the knowledge generated by the published nematode genetics literature. This chapter aims to provide an explanation of how to use basic features of WormBase, new features, and some commonly used tools and data queries. Explanations of the curated data and step-by-step instructions of how to access the data via the WormBase website and available data mining tools are provided

Caltech Authors