Search CORE

Scholarship@Western

Erasmus University Digital Repository

Two-Stage Clustering (TSC): A Pipeline for Selecting Operational Taxonomic Units for the High-Throughput Sequencing of PCR Amplicons

Author: C Quince
C Quince
C Zhang
EK Costello
Fei Zou
GB Gloor
Hai Zhang
Hong-Wei Zhou
Hua-Fang Sheng
HW Zhou
J Reeder
Jack Anthony Gilbert
JG Caporaso
MJ Claesson
ML Sogin
N Larsen
PD Schloss
PE Galand
PJ Turnbaugh
RC Edgar
RC Edgar
SM Huse
V Kunin
V Lazarevic
Xiao-Tao Jiang
Y Cai
Y Sun
Y Sun
Yan He
Yu Wang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Clustering 16S/18S rRNA amplicon sequences into operational taxonomic units (OTUs) is a critical step for the bioinformatic analysis of microbial diversity. Here, we report a pipeline for selecting OTUs with a relatively low computational demand and a high degree of accuracy. This pipeline is referred to as two-stage clustering (TSC) because it divides tags into two groups according to their abundance and clusters them sequentially. The more abundant group is clustered using a hierarchical algorithm similar to that in ESPRIT, which has a high degree of accuracy but is computationally costly for large datasets. The rarer group, which includes the majority of tags, is then heuristically clustered to improve efficiency. To further improve the computational efficiency and accuracy, two preclustering steps are implemented. To maintain clustering accuracy, all tags are grouped into an OTU depending on their pairwise Needleman-Wunsch distance. This method not only improved the computational efficiency but also mitigated the spurious OTU estimation from ‘noise’ sequences. In addition, OTUs clustered using TSC showed comparable or improved performance in beta-diversity comparisons compared to existing OTU selection methods. This study suggests that the distribution of sequencing datasets is a useful property for improving the computational efficiency and increasing the clustering accuracy of the high-throughput sequencing of PCR amplicons. The software and user guide are freely available at http://hwzhoulab.smu.edu.cn/paperdata/

CiteSeerX

Plymouth Electronic Archive and Research Library

FigShare

Photosynthetic quantum efficiency in south‐eastern Amazonian trees may be already affected by climate change

Author: Ashley D
Borges CS
Béu RG
Da Cruz WJA
Da Cunha M
da Silva ELS
de Oliveira CHL
de Souza IA
de Souza IA
de Souza LJ
dos Santos Prestes NCC
Fauset S
Ferreira LDS
Foyer CH
Galbraith D
Gloor E
Gonçalves MDA
Krause HG
Lopes TT
Marimon‐Junior BH
Marques EQ
Mendonça NG
Mendonça NG
Noleto PT
Oliveira MA
Pireda S
Reis SM
Santos DM
Santos EB
Schwantes Marimon B
Slot M
Tiwari R
Vitória AP
Winter K
Publication venue: 'Wiley'
Publication date: 27/04/2020
Field of study

Tropical forests are experiencing unprecedented high‐temperature conditions due to climate change that could limit their photosynthetic functions. We studied the high‐temperature sensitivity of photosynthesis in a rainforest site in southern Amazonia, where some of the highest temperatures and most rapid warming in the Tropics have been recorded. The quantum yield (F v /F m ) of photosystem II was measured in seven dominant tree species using leaf discs exposed to varying levels of heat stress. T 50 was calculated as the temperature at which F v /F m was half the maximum value. T 5 is defined as the breakpoint temperature, at which F v /F m decline was initiated. Leaf thermotolerance in the rapidly warming southern Amazonia was the highest recorded for forest tree species globally. T 50 and T 5 varied between species, with one mid‐storey species, Amaioua guianensis , exhibiting particularly high T 50 and T 5 values. While the T 50 values of the species sampled were several degrees above the maximum air temperatures experienced in southern Amazonia, the T 5 values of several species are now exceeded under present‐day maximum air temperatures

White Rose Research Online

Slip-Sliding Away: Serial Changes and Homoplasy in Repeat Number in the Drosophila yakuba Homolog of Human Cancer Susceptibility Gene BRCA2

Author: A Carmon
A Llopart
A Tutt
AG Clark
AR Venkitaraman
CV Barnwell
DL Swofford
DR Kelly
E Buschiazzo
F Tajima
F Xia
G Nagaraju
GB Gloor
J Rozas
JA Coyne
John M. Mercer
K Gundmundsdottir
K Thornton
KN Nathanson
L Pellegrini
L Pellegrini
M Klovstad
M Warren
ME Moynahan
MK Shivji
Mohamed A. F. Noor
N Galtier
P Bork
R Brough
RD Finn
Sarah M. Bennett
SM Bennett
T Lo
TA Hall
William J. Murphy
Y Miki
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Several recent studies have examined the function and evolution of a Drosophila homolog to the human breast cancer susceptibility gene BRCA2, named dmbrca2. We previously identified what appeared to be a recent expansion in the RAD51-binding BRC-repeat array in the ancestor of Drosophila yakuba. In this study, we examine patterns of variation and evolution of the dmbrca2 BRC-repeat array within D. yakuba and its close relatives. We develop a model of how unequal crossing over may have produced the expanded form, but we also observe short repeat forms, typical of other species in the D. melanogaster group, segregating within D. yakuba and D. santomea. These short forms do not appear to be identical-by-descent, suggesting that the history of dmbrca2 in the D. melanogaster subgroup has involved repeat unit contractions resulting in homoplasious forms. We conclude that the evolutionary history of dmbrca2 in D. yakuba and perhaps in other Drosophila species may be more complicated than can be inferred from examination of the published single genome sequences per species

DukeSpace

EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data

Author: A Engelbrektson
A Stamatakis
AP Dempster
B Baker
B Langmead
BE Dutilh
BJ Baker
Brett J Baker
Brian C Thomas
C Palmer
CA Lozupone
CB Do
Christopher S Miller
DR Zerbino
DS Goltsman
E Pruesse
EL Brodie
GB Gloor
GJ Dick
GW Tyson
H Li
H Li
H-W Zhou
I Kozarewa
I Letunic
I Lo
JG Caporaso
Jillian F Banfield
JL Morgan
JR Cole
L Dethlefsen
ML Sogin
MN Price
MT Suzuki
N Fierer
NR Pace
PJ Turnbaugh
PL Bond
Q Wang
R Li
RC Edgar
RC Edgar
S Kumar
SF Altschul
SG Tringe
SM Huse
SM Huse
Steven W Singer
TC Hazen
TD Otto
TZ DeSantis
V Farrelly
V Kunin
V Lazarevic
VJ Denef
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Recovery of ribosomal small subunit genes by assembly of short read community DNA sequence data generally fails, making taxonomic characterization difficult. Here, we solve this problem with a novel iterative method, based on the expectation maximization algorithm, that reconstructs full-length small subunit gene sequences and provides estimates of relative taxon abundances. We apply the method to natural and simulated microbial communities, and correctly recover community structure from known and previously unreported rRNA gene sequences. An implementation of the method is freely available at https://github.com/csmiller/EMIRGE

eScholarship - University of California

Deep Sequencing of the Vaginal Microbiota of Women with HIV

Author: A Chao
Andrew D. Fernandes
BE Sha
C Farquhar
CC Wang
CJ Krebs
D Wilkie
DN Fredricks
DR Smith
E Kretschmann
GE Noether
Gregor Reid
Gregory B. Gloor
GT Spear
H Jousimies-Somer
HA David
J Oksanen
J Ravel
Jean M. Macklaim
John Changalucha
K Baisley
L Myer
MA Antonio
R Amsel
R Tamrakar
RC Edgar
RC Martinez
RE Kass
RK Colwell
RP Nugent
Ruben Hummelen
Russell J. Dickson
S Cu-Uvin
S Kullback
S Srinivasan
SC Payne
SE Msuya
SK Lai
SL Hillier
SM Huse
Stefan Bereswill
TE Taha
TJ O'Connor
X Zhou
Publication venue: Public Library of Science
Publication date: 01/08/2010
Field of study

BACKGROUND: Women living with HIV and co-infected with bacterial vaginosis (BV) are at higher risk for transmitting HIV to a partner or newborn. It is poorly understood which bacterial communities constitute BV or the normal vaginal microbiota among this population and how the microbiota associated with BV responds to antibiotic treatment. METHODS AND FINDINGS: The vaginal microbiota of 132 HIV positive Tanzanian women, including 39 who received metronidazole treatment for BV, were profiled using Illumina to sequence the V6 region of the 16S rRNA gene. Of note, Gardnerella vaginalis and Lactobacillus iners were detected in each sample constituting core members of the vaginal microbiota. Eight major clusters were detected with relatively uniform microbiota compositions. Two clusters dominated by L. iners or L. crispatus were strongly associated with a normal microbiota. The L. crispatus dominated microbiota were associated with low pH, but when L. crispatus was not present, a large fraction of L. iners was required to predict a low pH. Four clusters were strongly associated with BV, and were dominated by Prevotella bivia, Lachnospiraceae, or a mixture of different species. Metronidazole treatment reduced the microbial diversity and perturbed the BV-associated microbiota, but rarely resulted in the establishment of a lactobacilli-dominated microbiota. CONCLUSIONS: Illumina based microbial profiling enabled high though-put analyses of microbial samples at a high phylogenetic resolution. The vaginal microbiota among women living with HIV in Sub-Saharan Africa constitutes several profiles associated with a normal microbiota or BV. Recurrence of BV frequently constitutes a different BV-associated profile than before antibiotic treatment

Scholarship@Western

LJMU Research Online (Liverpool John Moores University)

Coevolution of amino acid residues in the key photosynthetic enzyme Rubisco

Author: A Wagner
AR Portis
CH Yeang
DB Jordan
F Glaser
F Pazos
GB Gloor
Group TAP
I Andersson
IN Shindyalov
J Dutheil
J Dutheil
J Dutheil
J Galmés
JY Dutheil
KE Ridout
L McIntosh
L Van Valen
L Van Valen
M Anisimova
MA Fares
Maria Anisimova
Maxim V Kapralov
MC Saraf
Mingcong Wang
MV Kapralov
MV Kapralov
O Mueller-Cajar
PA Christin
RJ Ellis
RJ Spreitzer
S Guindon
S Guindon
S Guindon
S Whelan
SA Smith
SM Whitney
SQ Le
W Humphrey
Z Yang
Z Yang
Z Yang
Z Yang
ZO Wang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background One of the key forces shaping proteins is coevolution of amino acid residues. Knowing which residues coevolve in a particular protein may facilitate our understanding of protein evolution, structure and function, and help to identify substitutions that may lead to desired changes in enzyme kinetics. Rubisco, the most abundant enzyme in biosphere, plays an essential role in the process of carbon fixation through photosynthesis, thus facilitating life on Earth. This makes Rubisco an important model system for studying the dynamics of protein fitness optimization on the evolutionary landscape. In this study we investigated the selective and coevolutionary forces acting on large subunit of land plants Rubisco using Markov models of codon substitution and clustering approaches applied to amino acid substitution histories. Results We found that both selection and coevolution shape Rubisco, and that positively selected and coevolving residues have their specifically favored amino acid composition and pairing preference. The mapping of these residues on the known Rubisco tertiary structures showed that the coevolving residues tend to be in closer proximity with each other compared to the background, while positively selected residues tend to be further away from each other. This study also reveals that the residues under positive selection or coevolutionary force are located within functionally important regions and that some residues are targets of both positive selection and coevolution at the same time. Conclusion Our results demonstrate that coevolution of residues is common in Rubisco of land plants and that there is an overlap between coevolving and positively selected residues. Knowledge of which Rubisco residues are coevolving and positively selected could be used for further work on structural modeling and identification of substitutions that may be changed in order to improve efficiency of this important enzyme in crops.</p

Repository for Publications and Research Data

The Australian National University

Cellular injury and neuroinflammation in children with chronic intractable epilepsy

Author: A Andrioli
A Crespel
A Vezzani
A Vezzani
A Vezzani
AJ Bruce
Arthur DiPatri
C Dube
CA Gurnett
CK Petito
D Giulian
DC Henshall
DG Fujikawa
Douglas R Nordli
E Aronica
E Avignone
GF Tian
GL Holmes
J Choi
J Peltola
J Peltola
Jieun Choi
JL Jankowsky
JL Ridet
Joshua Rosenow
K Boer
K Borges
K Kanemoto
KC Somera-Molina
Kent Kelley
Linda Laux
M Maldonado
M Rizzi
M Thom
N Marchi
NP Turrin
P Gloor
R D'Ambrosio
R Kotloski
S Haspolat
S Koh
S Tilleux
S Wang
SM Allan
Sookyong Koh
Stephan U Schuele
T Ravizza
T Ravizza
TG Beach
Tord D Alden
TP Sutula
Veena Rajaram
WJ Streit
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Objective To elucidate the presence and potential involvement of brain inflammation and cell death in neurological morbidity and intractable seizures in childhood epilepsy, we quantified cell death, astrocyte proliferation, microglial activation and cytokine release in brain tissue from patients who underwent epilepsy surgery. Methods Cortical tissue was collected from thirteen patients with intractable epilepsy due to focal cortical dysplasia (6), encephalomalacia (5), Rasmussen's encephalitis (1) or mesial temporal lobe epilepsy (1). Sections were processed for immunohistochemistry using markers for neuron, astrocyte, microglia or cellular injury. Cytokine assay was performed on frozen cortices. Controls were autopsy brains from eight patients without history of neurological diseases. Results Marked activation of microglia and astrocytes and diffuse cell death were observed in epileptogenic tissue. Numerous fibrillary astrocytes and their processes covered the entire cortex and converged on to blood vessels, neurons and microglia. An overwhelming number of neurons and astrocytes showed DNA fragmentation and its magnitude significantly correlated with seizure frequency. Majority of our patients with abundant cell death in the cortex have mental retardation. IL-1beta, IL-8, IL-12p70 and MIP-1beta were significantly increased in the epileptogenic cortex; IL-6 and MCP-1 were significantly higher in patients with family history of epilepsy. Conclusions Our results suggest that active neuroinflammation and marked cellular injury occur in pediatric epilepsy and may play a common pathogenic role or consequences in childhood epilepsy of diverse etiologies. Our findings support the concept that immunomodulation targeting activated microglia and astrocytes may be a novel therapeutic strategy to reduce neurological morbidity and prevent intractable epilepsy.</p

University of Regensburg Publication Server

H2r: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments

Author: A del Sol Mesa
AL Barabási
B Rost
C Notredame
C Ouzounis
C Sander
C Steegborn
CC Hyde
CE Shannon
D Altschuh
DR Caffrey
E Eyal
E Neher
E Weber-Ban
E Zuckerkandl
ER Tillier
F Pearl
GB Gloor
GM Süel
HO Villar
I Kass
IM Wallace
J Tsai
JA Capra
JP Dekker
K Katoh
K Wang
LA Kelley
LC Martin
M Landau
Matthias Zwick
MC Saraf
ME Noble
O Noivirt
O Olmea
OV Kalinina
OV Kalinina
R Merkl
RA Estabrook
RA Laskowski
Rainer Merkl
RD Finn
RI Dima
S Henikoff
SJ Fleishman
SM Larson
SW Lockless
T Lassmann
T Sato
TD Schneider
U Göbel
V Kulik
V Kulik
WH Press
WR Atchley
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: A multiple sequence alignment (MSA) generated for a protein can be used to characterise residues by means of a statistical analysis of single columns. In addition to the examination of individual positions, the investigation of co-variation of amino acid frequencies offers insights into function and evolution of the protein and residues. RESULTS: We introduce conn(k), a novel parameter for the characterisation of individual residues. For each residue k, conn(k) is the number of most extreme signals of co-evolution. These signals were deduced from a normalised mutual information (MI) value U(k, l) computed for all pairs of residues k, l. We demonstrate that conn(k) is a more robust indicator than an individual MI-value for the prediction of residues most plausibly important for the evolution of a protein. This proposition was inferred by means of statistical methods. It was further confirmed by the analysis of several proteins. A server, which computes conn(k)-values is available at http://www-bioinf.uni-regensburg.de. CONCLUSION: The algorithms H2r, which analyses MSAs and computes conn(k)-values, characterises a specific class of residues. In contrast to strictly conserved ones, these residues possess some flexibility in the composition of side chains. However, their allocation is sensibly balanced with several other positions, as indicated by conn(k)

Correlated Mutations: A Hallmark of Phenotypic Amino Acid Substitutions

Author: A Bairoch
A Fuchs
A Hamosh
A Lapedes
A Lupi
A Tanoue
A Tanoue
AA Fodor
Andreas Kowarsch
Angelika Fuchs
BC Lee
C von Mering
D Altschuh
D Altschuh
D Vitkup
DD Pollock
DD Pollock
Dmitrij Frishman
EE Winter
F Endo
F Pazos
GB Gloor
H Huang
HM Berman
I Feldman
I Kass
IN Shindyalov
JG Caporaso
LC Martin
M Krzywinski
M Socolich
MH Knaggs
MS Singer
N Lopez-Bigas
NGC Smith
O Noivirt
O Noivirt-Brik
O Olmea
O Olmea
P Fariselli
P Ledoux
P Tuffery
P Wong
PC Ng
PC Ng
PD Stenson
Philipp Pagel
PJ Kundrotas
RC Edgar
RE Steward
RR Gutell
S Henikoff
S Sunyaev
S Vicatos
SAA Travers
SD Dunn
SK Ng
SM Larson
T Hershkovitz
Thomas Lengauer
U Göbel
V Ramensky
W Kabsch
WP Russ
WR Taylor
ZO Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Point mutations resulting in the substitution of a single amino acid can cause severe functional consequences, but can also be completely harmless. Understanding what determines the phenotypical impact is important both for planning targeted mutation experiments in the laboratory and for analyzing naturally occurring mutations found in patients. Common wisdom suggests using the extent of evolutionary conservation of a residue or a sequence motif as an indicator of its functional importance and thus vulnerability in case of mutation. In this work, we put forward the hypothesis that in addition to conservation, co-evolution of residues in a protein influences the likelihood of a residue to be functionally important and thus associated with disease. While the basic idea of a relation between co-evolution and functional sites has been explored before, we have conducted the first systematic and comprehensive analysis of point mutations causing disease in humans with respect to correlated mutations. We included 14,211 distinct positions with known disease-causing point mutations in 1,153 human proteins in our analysis. Our data show that (1) correlated positions are significantly more likely to be disease-associated than expected by chance, and that (2) this signal cannot be explained by conservation patterns of individual sequence positions. Although correlated residues have primarily been used to predict contact sites, our data are in agreement with previous observations that (3) many such correlations do not relate to physical contacts between amino acid residues. Access to our analysis results are provided at http://webclu.bio.wzw.tum.de/~pagel/supplements/correlated-positions/