Search CORE

Aberystwyth Research Portal

Directory of Open Access Journals

Aston Publications Explorer

Birkbeck Institutional Research Online

Kent Academic Repository

SIMBA: a web tool for managing bacterial genome assembly generated by Ion PGM sequencing technology

Author: Aguiar Edgar L.
Azevedo Vasco A. C.
Barh Debmalya
Benevides Leandro
Figueiredo Henrique C. P.
Folador Edson L.
Ghosh Preetam
Guimarães Luís C.
Mariano Diego C. B.
Oliveira Letícia C.
Pereira Felipe L.
Ramos Rommel T. J.
Silva Artur
Sousa Thiago J.
Publication venue: VCU Scholars Compass
Publication date: 01/01/2016
Field of study

Background The evolution of Next-Generation Sequencing (NGS) has considerably reduced the cost per sequenced-base, allowing a significant rise of sequencing projects, mainly in prokaryotes. However, the range of available NGS platforms requires different strategies and software to correctly assemble genomes. Different strategies are necessary to properly complete an assembly project, in addition to the installation or modification of various software. This requires users to have significant expertise in these software and command line scripting experience on Unix platforms, besides possessing the basic expertise on methodologies and techniques for genome assembly. These difficulties often delay the complete genome assembly projects. Results In order to overcome this, we developed SIMBA (SImple Manager for Bacterial Assemblies), a freely available web tool that integrates several component tools for assembling and finishing bacterial genomes. SIMBA provides a friendly and intuitive user interface so bioinformaticians, even with low computational expertise, can work under a centralized administrative control system of assemblies managed by the assembly center head. SIMBA guides the users to execute assembly process through simple and interactive pages. SIMBA workflow was divided in three modules: (i) projects: allows a general vision of genome sequencing projects, in addition to data quality analysis and data format conversions; (ii) assemblies: allows de novo assemblies with the software Mira, Minia, Newbler and SPAdes, also assembly quality validations using QUAST software; and (iii) curation: presents methods to finishing assemblies through tools for scaffolding contigs and close gaps. We also presented a case study that validated the efficacy of SIMBA to manage bacterial assemblies projects sequenced using Ion Torrent PGM. Conclusion Besides to be a web tool for genome assembly, SIMBA is a complete genome assemblies project management system, which can be useful for managing of several projects in laboratories. SIMBA source code is available to download and install in local webservers at http://ufmg-simba.sourceforge.net

VCU Scholars Compass

On the hierarchical classification of G Protein-Coupled Receptors

Author: A. A. Freitas
A. Secker
Attwood
Bhasin
Bhasin
Bissantz
Cardoso
Christopoulos
D. R. Flower
Das
Davies
Flower
Flower
Foord
Gether
Gloriam
Guo
Horn
H bert
J. Timmis
Karchin
Keerthi
Klabunde
Kolakowski
Lapinsh
M. Mendao
M. N. Davies
Milligan
Papasaikas
Prabhu
Sandberg
Schi th
Publication venue: 'Oxford University Press (OUP)'
Publication date: 22/10/2007
Field of study

Motivation: G protein-coupled receptors (GPCRs) play an important role in many physiological systems by transducing an extracellular signal into an intracellular response. Over 50% of all marketed drugs are targeted towards a GPCR. There is considerable interest in developing an algorithm that could effectively predict the function of a GPCR from its primary sequence. Such an algorithm is useful not only in identifying novel GPCR sequences but in characterizing the interrelationships between known GPCRs. Results: An alignment-free approach to GPCR classification has been developed using techniques drawn from data mining and proteochemometrics. A dataset of over 8000 sequences was constructed to train the algorithm. This represents one of the largest GPCR datasets currently available. A predictive algorithm was developed based upon the simplest reasonable numerical representation of the protein's physicochemical properties. A selective top-down approach was developed, which used a hierarchical classifier to assign sequences to subdivisions within the GPCR hierarchy. The predictive performance of the algorithm was assessed against several standard data mining classifiers and further validated against Support Vector Machine-based GPCR prediction servers. The selective top-down approach achieves significantly higher accuracy than standard data mining methods in almost all cases

CiteSeerX

Aberystwyth Research Portal

Kent Academic Repository

Evolution of Genes Neighborhood Within Reconciled Phylogenies: An Ensemble Approach

Author: J.-P. Doyon
J.-P. Doyon
L. Pachter
M. Csűrös
M.S. Bansal
M.S. Bansal
R. Libeskind-Hadas
S. Bérard
Y. Ponty
Publication venue
Publication date: 01/01/2014
Field of study

Context The reconstruction of evolutionary scenarios for whole genomes in terms of genome rearrangements is a fundamental problem in evolutionary and comparative genomics. The DeCo algorithm, recently introduced by Bérard et al., computes parsimonious evolutionary scenarios for gene adjacencies, from pairs of reconciled gene trees. However, as for many combinatorial optimization algorithms, there can exist many co-optimal, or slightly sub-optimal, evolutionary scenarios that deserve to be considered. Contribution We extend the DeCo algorithm to sample evolutionary scenarios from the whole solution space under the Boltzmann distribution, and also to compute Boltzmann probabilities for specific ancestral adjacencies. Results We apply our algorithms to a dataset of mammalian gene trees and adjacencies, and observe a significant reduction of the number of syntenic conflicts observed in the resulting ancestral gene adjacencies

INRIA a CCSD electronic archive server

Simon Fraser University Institutional Repository

arXiv.org e-Print Archive

HAL-Polytechnique

Hyperbolic Interaction Model For Hierarchical Multi-Label Classification

Author: Cai Zixin
Chen Boli
Huang Xin
Jing Liping
Xiao Lin
Publication venue
Publication date: 04/09/2019
Field of study

Different from the traditional classification tasks which assume mutual exclusion of labels, hierarchical multi-label classification (HMLC) aims to assign multiple labels to every instance with the labels organized under hierarchical relations. Besides the labels, since linguistic ontologies are intrinsic hierarchies, the conceptual relations between words can also form hierarchical structures. Thus it can be a challenge to learn mappings from word hierarchies to label hierarchies. We propose to model the word and label hierarchies by embedding them jointly in the hyperbolic space. The main reason is that the tree-likeness of the hyperbolic space matches the complexity of symbolic data with hierarchical structures. A new Hyperbolic Interaction Model (HyperIM) is designed to learn the label-aware document representations and make predictions for HMLC. Extensive experiments are conducted on three benchmark datasets. The results have demonstrated that the new model can realistically capture the complex data structures and further improve the performance for HMLC comparing with the state-of-the-art methods. To facilitate future research, our code is publicly available

Association for the Advancement of Artificial Intelligence: AAAI Publications

An integrated database of Eucalyptus spp. genome project

Author: A Bateman
BE Suzek
C Baudet
C Trapnell
Danieli Cristina Gonçalves
E Mizrachi
Eduardo Leal Oliveira Camargo
Gonçalo Amarante Guimarães Pereira
Jorge Lepikson Neto
L Wang
LB Koski
Leandro Costa Nascimento
M Ashburner
M Kanehisa
Marcela Mendes Salaza
Marcelo Falsarella Carazzolle
R Li
Ramon Oliveira Vidal
S Audic
SF Altschul
Wesley Leoricy Marques
X Huang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Structural studies on molecular mechanisms of Nelfinavir resistance caused by non-active site mutation V77I in HIV-1 protease

Author: Abhinav Grover
Ankita Gupta
Divya Wahi
Ritu Jain
Salma Jamal
Sukriti Goyal
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Algebraic Dynamic Programming over general data structures

Author: B Voß
C Höner zu Siederdissen
C Höner zu Siederdissen
C Höner zu Siederdissen
C Höner zu Siederdissen
C Höner zu Siederdissen
C Höner zu Siederdissen
C Höner zu Siederdissen
C McBride
Christian Höner zu Siederdissen
CM Reidys
FWD Huang
FWD Huang
G Sauthoff
J Garcia-Fernàndez
JK Baker
JS McCaskill
LR Rabiner
M Held
M Riechert
M Riechert
O Elemento
O Gotoh
P Billie
Peter F Stadler
R Bellman
R Durbin
R Giegerich
R Giegerich
R Lorenz
RA Cameron
RD Dowell
S Janssen
S Wuchty
SJ Prohaska
Sonja J Prohaska
WS Robinson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study