Search CORE

11 research outputs found

FlatNJ: A novel network-based approach to visualize evolutionary and biogeographical relationships

Author: Balvočiūtė Monika
Moulton Vincent
Spillner Andreas
Publication venue: 'Oxford University Press (OUP)'
Publication date: 15/01/2014
Field of study

Split networks are a type of phylogenetic network that allow visualization of conflict in evolutionary data. We present a new method for constructing such networks called FlatNetJoining (FlatNJ). A key feature of FlatNJ is that it produces networks that can be drawn in the plane in which labels may appear inside of the network. For complex data sets that involve, for example, non-neutral molecular markers, this can allow additional detail to be visualized as compared to previous methods such as split decomposition and NeighborNet. We illustrate the application of FlatNJ by applying it to whole HIV genome sequences, where recombination has taken place, fluorescent proteins in corals, where ancestral sequences are present, and mitochondrial DNA sequences from gall wasps, where biogeographical relationships are of interest. We find that the networks generated by FlatNJ can facilitate the study of genetic variation in the underlying molecular sequence data and, in particular, may help to investigate processes such as intra-locus recombination. FlatNJ has been implemented in Java and is freely available at www.uea.ac.uk/computing/software/flatnj

Crossref

Dryad Digital Repository (Duke University)

University of East Anglia digital repository

Flat Embeddings of Genetic and Distance Data

Author: Balvočiūtė Monika
Publication venue: 'University of Otago Library'
Publication date: 17/03/2016
Field of study

The idea of displaying data in the plane is very attractive in many different fields of research. This thesis will focus on distance-based phylogenetics and multidimensional scaling (MDS). Both types of method can be viewed as a high-dimensional data reduction to pairwise distances and visualization of the data based on these distances. The difference between phylogenetics and multidimensional scaling is that the first one aims at finding a network or a tree structure that fits the distances, whereas MDS does not fix any structure and objects are simply placed in a low-dimensional space so that distances in the solution fit distances in the input as good as possible. Chapter 1 provides an introduction to the phylogenetics and multidimensional scaling. Chapter 2 focuses on the theoretical background of flat split systems (planar split networks). We prove equivalences between flat split systems, planar split networks and loop-free acyclic oriented matroids of rank three. The latter is a convenient mathematical structure that we used to design the algorithm for computing planar split networks that is described in Chapter 3. We base our approach on the well established agglomerative algorithms Neighbor-Joining and Neighbor-Net. In Chapter 4 we introduce multidimensional scaling and propose a new method for computing MDS plots that is based on the agglomerative approach and spring embeddings. Chapter 5 presents several case studies that we use to compare both of our methods and some classical agglomerative approaches in the distance-based phylogenetics

Te Tumu Eprints Repository

SPECTRE: a Suite of PhylogEnetiC Tools for Reticulate Evolution

Author: Alfonso Valencia
Andreas Spillner
Balvočiūtė
Bandelt
Bastkowski
Bollyky
Bryant
Daniel Mapleson
Grünewald
Grünewald
Huson
Levy
Monika Balvočiūtė
Sarah Bastkowski
Spillner
Taoyang Wu
Vincent Moulton
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

Split-networks are a generalization of phylogenetic trees that have proven to be a powerful tool in phylogenetics. Various ways have been developed for computing such networks, including split-decomposition, NeighborNet, QNet and FlatNJ. Some of these approaches are implemented in the user-friendly SplitsTree software package. However, to give the user the option to adjust and extend these approaches and to facilitate their integration into analysis pipelines, there is a need for robust, open-source implementations of associated data structures and algorithms. Here we present SPECTRE, a readily available, open-source library of data structures written in Java, that comes complete with new implementations of several pre-published algorithms and a basic interactive graphical interface for visualizing planar split networks. SPECTRE also supports the use of longer running algorithms by providing command line interfaces, which can be executed on servers or in High Performance Computing (HPC) environments

Crossref

Publikationsserver der Universität Tübingen

University of East Anglia digital repository

Critical Assessment of Metagenome Interpretation:A benchmark of metagenomics software

Author: A Mikheenko
Aaron E Darling
Adrian Fritz
Alexander Sczyrba
Alexey Gurevich
Alice C McHardy
Andreas Bremges
B Liu
Bernhard Y Renard
Bertrand Denis
Burton K H Chia
C Lozupone
Charles Deltel
Chirag Jain
Christopher Quince
Claire Lemaitre
D Coil
D Koslicki
D Koslicki
D Koslicki
D Li
D Turaev
Daniel A Cuevas
David Koslicki
DD Kang
DE Wood
DH Huson
Dmitrij Turaev
Dominique Lavenier
Dongwan Don Kang
E Pruesse
Edward M Rubin
Eik Dahms
Fernando Meyer
Genivaldo Gueiros Z Silva
GG Silva
Guillaume Rizk
H Klingenberg
Hans-Peter Klenk
Heiner Klingenberg
HH Lin
Hsin-Hung Lin
I Gregor
Ivan Gregor
J Alneberg
J Dröge
JA Chapman
Jeff L Froula
Jeffrey J Cook
Jessika Fiedler
Johannes Dröge
Julia A Vorholt
K Mavromatis
KT Konstantinidis
Lars Hestbjerg Hansen
M Arumugam
M Balvočiūtė
M Strous
M Yassour
Marc Strous
Markus Göker
Matthew Z DeMaere
Michael Beckstette
Michael D Barton
Mihai Pop
ML Bendall
Monika Balvočiūtė
N Kashtan
N Sangwan
N Segata
Nicole Shapiro
Nikos C Kyrpides
Niranjan Nagarajan
NP Nguyen
O Koren
P Belmann
Paul Schulze-Lefert
Peter Belmann
Peter Hofmann
Peter Meinicke
Philip D Blood
Pierre Peterlongo
R Chikhi
R Ounit
Rayan Chikhi
Robert A Edwards
Robert Egan
RR Miller
Ruben Garrido-Oter
S Boisvert
S Chatterjee
S Gao
S Lindgreen
S Sunagawa
Stefan Janssen
Stephan Majda
Steven W Singer
Surya Saha
Søren J Sørensen
T Thomas
Tanja Woyke
Thomas Lingner
Thomas Rattei
Tue Sparholt Jørgensen
V Marx
VC Piro
Vitor C Piro
Y Bai
Yang Bai
Yu-Chieh Liao
Yu-Wei Wu
YW Wu
Zhong Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

International audienceIn metagenome analysis, computational methods for assembly, taxonomic profilingand binning are key components facilitating downstream biological datainterpretation. However, a lack of consensus about benchmarking datasets andevaluation metrics complicates proper performance assessment. The CriticalAssessment of Metagenome Interpretation (CAMI) challenge has engaged the globaldeveloper community to benchmark their programs on datasets of unprecedentedcomplexity and realism. Benchmark metagenomes were generated from newlysequenced ~700 microorganisms and ~600 novel viruses and plasmids, includinggenomes with varying degrees of relatedness to each other and to publicly availableones and representing common experimental setups. Across all datasets, assemblyand genome binning programs performed well for species represented by individualgenomes, while performance was substantially affected by the presence of relatedstrains. Taxonomic profiling and binning programs were proficient at high taxonomicranks, with a notable performance decrease below the family level. Parametersettings substantially impacted performances, underscoring the importance ofprogram reproducibility. While highlighting current challenges in computationalmetagenomics, the CAMI results provide a roadmap for software selection to answerspecific research questions

Roskilde Universitet

HAL Descartes

Warwick Research Archives Portal Repository

MPG.PuRe

Hal-Diderot

Repository for Publications and Research Data

Crossref

National Health Research Institues

OPUS - University of Technology Sydney

INRIA a CCSD electronic archive server

Copenhagen University Research Information System

eScholarship - University of California

Publications at Bielefeld University

University of East Anglia digital repository

ScholarBank@NUS

HAL-Rennes 1

Flat Embeddings of Genetic and Distance Data

Author: Balvočiūtė Monika
Publication venue: 'University of Otago Library'
Publication date: 17/03/2016
Field of study

Otago University Research Archive

Additional file 1 of SILVA, RDP, Greengenes, NCBI and OTT — how do these taxonomies compare?

Author: Daniel Huson (3840025)
Monika Balvočiūtė (3840028)
Publication venue
Publication date
Field of study

Supplementary material. A PDF file containing supporting data for the figures and detailed visualizations of pairwise mappings. (PDF 197 kb

FigShare

Expression of KIR2DS1 does not significantly contribute to NK cell cytotoxicity in HLA-C1/C2 heterozygous haplotype B donors

Author: Andre Maya Caroline
Baltner Karla
Balvočiūtė Monika
Handgretinger Rupert
Kübler Ayline
Mezger Markus
Pal Marina
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2017
Field of study

Publikationsserver der Universität Tübingen

Comprehensive analysis of DNA polymerase III α subunits and their homologs in bacterial genomes

Author: Abella
Albertas Timinskas
Altschul
Aravind
Aravind
Ashkenazy
Bailey
Baker
Barros
Bentley
Boshoff
Briggs
Bruck
Caspi
Christopherson
Ciccarelli
Costes
Crooks
Dalrymple
Darriba
Davies
Dervyn
Dohrmann
Dolinsky
Dulermo
Eddy
Engelen
Erill
Eswar
Evans
Filee
Frickey
Galhardo
Georgescu
Gonzalez
Guo
Hershberg
Hildebrand
Huson
Inoue
Ito
Jones
Jones
Katoh
Kazlauskas
Kim
Koch
Koonin
Koonin
Koorits
Kornberg
Kurth
Kurz
Kęstutis Timinskas
Lamers
Le
Le Chatelier
Lehtinen
Leipe
Letunic
Li
Lind
Liu
Ludwig
McCutcheon
McHenry
McHenry
Mitchell
Monika Balvočiūtė
Musto
Ozawa
Perera
Pettersen
Pritchard
Punta
Remmert
Reyes-Lamothe
Robinson
Sanders
Sanders
Sawaya
Sippl
Stamatakis
Stano
Söding
Söding
Taft-Benz
Timinskas
Tsai
Venclovas
Warner
Whelan
Wieczorek
Wijffels
Wilkins
Wing
Wu
Zeng
Zhao
Zhao
Česlovas Venclovas
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Recommended from our members

Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.

Author: Bai Yang
Balvočiūtė Monika
Barton Michael D
Beckstette Michael
Belmann Peter
Blood Philip D
Bremges Andreas
Chia Burton KH
Chikhi Rayan
Cook Jeffrey J
Cuevas Daniel A
Dahms Eik
Darling Aaron E
Deltel Charles
DeMaere Matthew Z
Denis Bertrand
Don Kang Dongwan
Dröge Johannes
Edwards Robert A
Egan Robert
Fiedler Jessika
Fritz Adrian
Froula Jeff L
Garrido-Oter Ruben
Gregor Ivan
Gurevich Alexey
Göker Markus
Hansen Lars Hestbjerg
Hofmann Peter
Jain Chirag
Janssen Stefan
Jørgensen Tue Sparholt
Klenk Hans-Peter
Klingenberg Heiner
Koslicki David
Kyrpides Nikos C
Lavenier Dominique
Lemaitre Claire
Liao Yu-Chieh
Lin Hsin-Hung
Lingner Thomas
Majda Stephan
McHardy Alice C
Meinicke Peter
Meyer Fernando
Nagarajan Niranjan
Peterlongo Pierre
Piro Vitor C
Pop Mihai
Quince Christopher
Rattei Thomas
Renard Bernhard Y
Rizk Guillaume
Rubin Edward M
Saha Surya
Schulze-Lefert Paul
Sczyrba Alexander
Shapiro Nicole
Silva Genivaldo Gueiros Z
Singer Steven W
Strous Marc
Sørensen Søren J
Turaev Dmitrij
Vorholt Julia A
Wang Zhong
Woyke Tanja
Wu Yu-Wei
Publication venue: eScholarship, University of California
Publication date: 01/11/2017
Field of study

Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on highly complex and realistic data sets, generated from ∼700 newly sequenced microorganisms and ∼600 novel viruses and plasmids and representing common experimental setups. Assembly and genome binning programs performed well for species represented by individual genomes but were substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below family level. Parameter settings markedly affected performance, underscoring their importance for program reproducibility. The CAMI results highlight current challenges but also provide a roadmap for software selection to answer specific research questions

eScholarship - University of California