Search CORE

Tilburg University Repository

Proteinortho: Detection of (Co-)orthologs in large-scale analysis

Author: A Alexeyenko
A Force
A Nakabachi
A Schneider
AE Hirsh
AJ Enright
C Lanczos
D Cornaz
DM Kristensen
E Pruesse
EV Koonin
IK Jordan
J Hopcroft
JP McCutcheon
L Li
Lydia Steiner
M Fiedler
M Fiedler
M Remm
M Sikdar
Manja Marz
Marcus Lechner
MC Rivera
P Bork
Peter F Stadler
RL Tatusov
S Guattery
SM van Dongen
Sonja J Prohaska
Sven Findeiß
TJ Hubbard
WM Fitch
Z Fu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Orthology analysis is an important part of data analysis in many areas of bioinformatics such as comparative genomics and molecular phylogenetics. The ever-increasing flood of sequence data, and hence the rapidly increasing number of genomes that can be compared simultaneously, calls for efficient software tools as brute-force approaches with quadratic memory requirements become infeasible in practise. The rapid pace at which new data become available, furthermore, makes it desirable to compute genome-wide orthology relations for a given dataset rather than relying on relations listed in databases. Results The program <monospace>Proteinortho</monospace> described here is a stand-alone tool that is geared towards large datasets and makes use of distributed computing techniques when run on multi-core hardware. It implements an extended version of the reciprocal best alignment heuristic. We apply <monospace>Proteinortho</monospace> to compute orthologous proteins in the complete set of all 717 eubacterial genomes available at NCBI at the beginning of 2009. We identified thirty proteins present in 99% of all bacterial proteomes. Conclusions <monospace>Proteinortho</monospace> significantly reduces the required amount of memory for orthology analysis compared to existing tools, allowing such computations to be performed on off-the-shelf hardware.</p

Fraunhofer-ePrints

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets

clusterMaker: a multi-algorithm clustering plugin for Cytoscape

Abstract Background In the post-genomic era, the rapid increase in high-throughput data calls for computational tools capable of integrating data of diverse types and facilitating recognition of biologically meaningful patterns within them. For example, protein-protein interaction data sets have been clustered to identify stable complexes, but scientists lack easily accessible tools to facilitate combined analyses of multiple data sets from different types of experiments. Here we present <it>clusterMaker</it>, a Cytoscape plugin that implements several clustering algorithms and provides network, dendrogram, and heat map views of the results. The Cytoscape network is linked to all of the other views, so that a selection in one is immediately reflected in the others. <it>clusterMaker </it>is the first Cytoscape plugin to implement such a wide variety of clustering algorithms and visualizations, including the only implementations of hierarchical clustering, dendrogram plus heat map visualization (tree view), k-means, k-medoid, SCPS, AutoSOME, and native (Java) MCL. Results Results are presented in the form of three scenarios of use: analysis of protein expression data using a recently published mouse interactome and a mouse microarray data set of nearly one hundred diverse cell/tissue types; the identification of protein complexes in the yeast <it>Saccharomyces cerevisiae</it>; and the cluster analysis of the vicinal oxygen chelate (VOC) enzyme superfamily. For scenario one, we explore functionally enriched mouse interactomes specific to particular cellular phenotypes and apply fuzzy clustering. For scenario two, we explore the prefoldin complex in detail using both physical and genetic interaction clusters. For scenario three, we explore the possible annotation of a protein as a methylmalonyl-CoA epimerase within the VOC superfamily. Cytoscape session files for all three scenarios are provided in the Additional Files section. Conclusions The Cytoscape plugin <it>clusterMaker </it>provides a number of clustering algorithms and visualizations that can be used independently or in combination for analysis and visualization of biological data sets, and for confirming or generating hypotheses about biological function. Several of these visualizations and algorithms are only available to Cytoscape users through the <it>clusterMaker </it>plugin. <it>clusterMaker </it>is available via the Cytoscape plugin manager.</p

University of Toronto Research Repository

eScholarship - University of California

MPG.PuRe

Deep Blue Documents at the University of Michigan

Comparison of the protein-coding genomes of three deep-sea, sulfur-oxidising bacteria: “Candidatus Ruthia magnifica”, “Candidatus Vesicomyosocius okutanii” and Thiomicrospira crunogena

Author: CA Darby
D Barker
Daniel Barker
DM Emms
DY Wu
EV Koonin
F Cunningham
FA Kondrashov
FJ Stewart
G Coutts
H Kuwahara
ILG Newton
ILG Newton
JF Robson
K Endow
KM Wreggelsworth
KP Williams
KS Johnson
L Li
N Latysheva
R Jack
RG Murray
RW Nowell
S Dongen van
S Isabel
SC Cary
SF Altschul
SP Lapage
Susan E. McGill
The UniProt Consortium
WC Lathe
WD Swingley
WM Fitch
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2017
Field of study

Abstract Objective “ Candidatus Ruthia magnifica”, “Candidatus Vesicomyosocius okutanii” and Thiomicrospira crunogena are all sulfur-oxidising bacteria found in deep-sea vent environments. Recent research suggests that the two symbiotic organisms, “Candidatus R. magnifica” and “Candidatus V. okutanii”, may share common ancestry with the autonomously living species T. crunogena. We used comparative genomics to examine the genome-wide protein-coding content of all three species to explore their similarities. In particular, we used the OrthoMCL algorithm to sort proteins into groups of putative orthologs on the basis of sequence similarity. Results The OrthoMCL inflation parameter was tuned using biological criteria. Using the tuned value, OrthoMCL delimited 1070 protein groups. 63.5% of these groups contained one protein from each species. Two groups contained duplicate protein copies from all three species. 123 groups were unique to T. crunogena and ten groups included multiple copies of T. crunogena proteins but only single copies from the other species. “Candidatus R. magnifica” had one unique group, and had multiple copies in one group where the other species had a single copy. There were no groups unique to “Candidatus V. okutanii”, and no groups in which there were multiple “Candidatus V. okutanii” proteins but only single proteins from the other species. Results align with previous suggestions that all three species share a common ancestor. However this is not definitive evidence to make taxonomic conclusions and the possibility of horizontal gene transfer was not investigated. Methodologically, the tuning of the OrthoMCL inflation parameter using biological criteria provides further methods to refine the OrthoMCL procedure

Edinburgh Research Explorer

CCR6+ Th cell populations distinguish ACPA positive from ACPA negative rheumatoid arthritis

BranchClust: a phylogenetic algorithm for selecting gene families

Author: AG Murzin
AJ Enright
AP Vogler
C Winstanley
CM Zmasek
DF Feng
DL Fulton
E Hilario
EL Sonnhammer
EV Koonin
F Chevenet
G Perriere
H Ochman
HA Schmidt
J Felsenstein
J Peter Gogarten
J Raymond
JA Lake
JD Thompson
JP Gogarten
JP Gogarten
JP Gogarten
K Oshima
KP O'Brien
L Olendzenski
LB Koski
M Remm
Maria S Poptsova
MG Montague
N Saitou
O Zhaxybayeva
O Zhaxybayeva
O Zhaxybayeva
P Lapierre
RC Edgar
RL Charlebois
RL Tatusov
S Guindon
S Tsutsumi
S van Dongen
SF Altschul
SF Altschul
SR Eddy
T Dagan
TJ Harlow
U Dobrindt
WM Fitch
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Automated methods for assembling families of orthologous genes include those based on sequence similarity scores and those based on phylogenetic approaches. The first are easy to automate but usually they do not distinguish between paralogs and orthologs or have restriction on the number of taxa. Phylogenetic methods often are based on reconciliation of a gene tree with a known rooted species tree; a limitation of this approach, especially in case of prokaryotes, is that the species tree is often unknown, and that from the analyses of single gene families the branching order between related organisms frequently is unresolved. RESULTS: Here we describe an algorithm for the automated selection of orthologous genes that recognizes orthologous genes from different species in a phylogenetic tree for any number of taxa. The algorithm is capable of distinguishing complete (containing all taxa) and incomplete (not containing all taxa) families and recognizes in- and outparalogs. The BranchClust algorithm is implemented in Perl with the use of the BioPerl module for parsing trees and is freely available at . CONCLUSION: BranchClust outperforms the Reciprocal Best Blast hit method in selecting more sets of putatively orthologous genes. In the test cases examined, the correctness of the selected families and of the identified in- and outparalogs was confirmed by inspection of the pertinent phylogenetic trees

What Twin Studies Tell Us About the Heritability of Brain Development, Morphology, and Function: A Review

MicroRNA-mediated gene regulation plays a minor role in the transcriptomic plasticity of cold-acclimated Zebrafish brain tissue

De novo single-nucleotide and copy number variation in discordant monozygotic twins reveals disease-related genes

Author: A Al-Chalabi
A Cecchinato
A Kong
A McKenna
Alan Pittman
B Bertelsen
BS Petersen
C Lavedan
CD Campbell
Charles Lee
Chengsheng Zhang
D Freed
D Mataix-Cols
D Nickles
D Vitucci
Deborah Hughes
DF Levinson
E Colvert
EA Ehli
EHM Wong
Eliza Cerveira
Elliott Rees
EV Davydov
F Antonacci
F Magne
G Kuhlenbäumer
George Kirov
GM Dal
H Higashida
IA Adzhubei
J Chen
J Dongen van
J Fallon
J Tang
Jamal Nasir
JB Potash
JM Schwarz
John Hardy
K Meltz Steinberg
K Ohi
K Wang
K Wang
Kerra Pearce
L Cai
L Vadlamudi
L Yuan
LC Francioli
M Florio
Mark Kristiansen
ME Ketelaar
Michael Simpson
MJ Lindhurst
MY Dennis
Niranjanan Nirmalananthan
Nirmal Vadgama
P Kumar
Peter De Rijk
Qihui Zhu
R Acuna-Hidalgo
R Hashimoto
R Hilker
R Pamphlett
Robin Murray
RP Ebstein
S Akbarian
S Beicht
S Petrovski
S Schuster
SE Baranzini
SP Robertson
Takeo Yoshikawa
Tomas Fitzgerald
V Labrie
YL Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Recent studies have demonstrated genetic differences between monozygotic (MZ) twins. To test the hypothesis that early post-twinning mutational events associate with phenotypic discordance, we investigated a cohort of 13 twin pairs (n = 26) discordant for various clinical phenotypes using whole-exome sequencing and screened for copy number variation (CNV). We identified a de novo variant in PLCB1, a gene involved in the hydrolysis of lipid phosphorus in milk from dairy cows, associated with lactase non-persistence, and a variant in the mitochondrial complex I gene MT-ND5 associated with amyotrophic lateral sclerosis (ALS). We also found somatic variants in multiple genes (TMEM225B, KBTBD3, TUBGCP4, TFIP11) in another MZ twin pair discordant for ALS. Based on the assumption that discordance between twins could be explained by a common variant with variable penetrance or expressivity, we screened the twin samples for known pathogenic variants that are shared and identified a rare deletion overlapping ARHGAP11B, in the twin pair manifesting with either schizotypal personality disorder or schizophrenia. Parent-offspring trio analysis was implemented for two twin pairs to assess potential association of variants of parental origin with susceptibility to disease. We identified a de novo variant in RASD2 shared by 8-year-old male twins with a suspected diagnosis of autism spectrum disorder (ASD) manifesting as different traits. A de novo CNV duplication was also identified in these twins overlapping CD38, a gene previously implicated in ASD. In twins discordant for Tourette's syndrome, a paternally inherited stop loss variant was detected in AADAC, a known candidate gene for the disorder

Online Research @ Cardiff

The Jackson Laboratory: The Mouseion at the JAXlibrary

University of Northampton's Research Explorer

UCL Discovery

Institutional Repository Universiteit Antwerpen

King's Research Portal

St George's Online Research Archive

NECTAR

Phylogenetic and functional marker genes to study ammonia-oxidizing microorganisms (AOM) in the environment

Author: A Aakra
A Galan
A Gieseke
A Gieseke
A Hooper
A Mulder
A Pommerening-Roeser
A Princic
A Teske
AA Graaf van de
AC Layton
AE McCaig
AF El Sheikh
AH Treusch
AJ Holmes
AO Sliekers
B Kartal
B Kartal
B Thamdrup
BB Ward
BB Ward
BK Mobarry
C Dorador
C Hellinga
C Wuchter
CA Francis
CD Sinigalliano
CJ Phillips
CJ Schubert
CR Penton
Cristina Dorador
CS Schmidt
D Woebken
D Woebken
DG Capone
DJ Arp
DJ Arp
DJ Arp
DJ Bergmann
E Broda
EF DeLong
EF DeLong
EV Lebedeva
G Webster
GA Kowalchuk
GA Kowalchuk
GD O’Mullan
GF Wells
GW Nicol
GW Nicol
H Bothe
H Hirayama
H Jiang
H McTavish
H Tamegai
H Urakawa
HD Park
HJ Op den Camp
HM Dionisi
HM Dionisi
HP Koops
I Schmidt
IM Head
J Rijn van
J Stephen
J-C Auguet
J-H Rotthauwe
JA Fuhrman
JB Utaker
JC Auguet
JC Venter
JI Prosser
JI Prosser
JM Beman
JM Norton
JM Norton
Johannes F. Imhoff
JR Stephen
JR Torre de la
JT Hollibaugh
JW Mulder
K Koop-Jakobsen
KA Third
KA Third
Karl-Paul Witzel
KL Casciotti
KL Casciotti
L Calvo
LJ Reigstad
LY Stein
M Könneke
M Schmid
M Shimamura
M Strous
M Strous
M Tourna
M Wagner
M Wagner
MA Moran
MA Voytek
MB Karner
MC Schmid
MC Schmid
MG Klotz
MG Klotz
MH Nicolaisen
MM Kuypers
MM Kuypers
MS Jetten
MS Jetten
N Bano
N Byrne
N Igarashi
N Ward
NN Perreault
O-S Kim
Ok-Sun Kim
Ora Hadas
OS Kim
P Chain
P Junier
P Junier
P Junier
P Lam
P Lam
PC Burrell
Pilar Junier
PW Wielen van der
R Conrad
R Hatzenpichler
RA Nugroho
RC Edgar
RC Hastings
RD Jones
RD Jones
S Avrahami
S Guindon
S Hallin
S Juretschko
S Leininger
S Siripong
S Wickramasinghe
S-J Park
SA Carini
SAQ Burton
SC Nold
SJ Hallam
SJ Hallam
T Dalsgaard
T Hoshino
T Khin
T Ochsenreiter
T Zhang
TE Freitag
TH Erguder
Thomas Junier
TJ Mincer
U Dongen van
U Purkhold
U Purkhold
V Molina
Verónica Molina
WD Hiorns
WR Star van der
WWJM Vet de
Y Okano
Y Tal
YH Ahn
Z Jia
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

The oxidation of ammonia plays a significant role in the transformation of fixed nitrogen in the global nitrogen cycle. Autotrophic ammonia oxidation is known in three groups of microorganisms. Aerobic ammonia-oxidizing bacteria and archaea convert ammonia into nitrite during nitrification. Anaerobic ammonia-oxidizing bacteria (anammox) oxidize ammonia using nitrite as electron acceptor and producing atmospheric dinitrogen. The isolation and cultivation of all three groups in the laboratory are quite problematic due to their slow growth rates, poor growth yields, unpredictable lag phases, and sensitivity to certain organic compounds. Culture-independent approaches have contributed importantly to our understanding of the diversity and distribution of these microorganisms in the environment. In this review, we present an overview of approaches that have been used for the molecular study of ammonia oxidizers and discuss their application in different environments

OceanRep

Infoscience - École polytechnique fédérale de Lausanne