Search CORE

2 research outputs found

Fast estimation of genetic relatedness between members of heterogeneous populations of closely related genomic variants

Author: Alex Zelikovsky
David S. Campo
Pavel Skums
Seth Sims
Viachaslau Tsyvina
Yury Khudyakov
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2018
Field of study

Abstract Background Many biological analysis tasks require extraction of families of genetically similar sequences from large datasets produced by Next-generation Sequencing (NGS). Such tasks include detection of viral transmissions by analysis of all genetically close pairs of sequences from viral datasets sampled from infected individuals or studying of evolution of viruses or immune repertoires by analysis of network of intra-host viral variants or antibody clonotypes formed by genetically close sequences. The most obvious naïeve algorithms to extract such sequence families are impractical in light of the massive size of modern NGS datasets. Results In this paper, we present fast and scalable k-mer-based framework to perform such sequence similarity queries efficiently, which specifically targets data produced by deep sequencing of heterogeneous populations such as viruses. It shows better filtering quality and time performance when comparing to other tools. The tool is freely available for download at https://github.com/vyacheslav-tsivina/signature-sj Conclusion The proposed tool allows for efficient detection of genetic relatedness between genomic samples produced by deep sequencing of heterogeneous populations. It should be especially useful for analysis of relatedness of genomes of viruses with unevenly distributed variable genomic regions, such as HIV and HCV. For the future we envision, that besides applications in molecular epidemiology the tool can also be adapted to immunosequencing and metagenomics data

Directory of Open Access Journals

Fast estimation of genetic relatedness between members of heterogeneous populations of closely related genomic variants

Author: A Gionis
A Longmire
Alex Zelikovsky
AZ Broder
B Ma
C Li
C Prevention
D Bankwitz
D Gusfield
David S. Campo
DS Campo
I Rytsareva
J Qin
J Zobel
J-M Pawlotsky
JW Ward
L Cuypers
O Glebova
P Medvedev
P Peterlongo
P Skums
Pavel Skums
RA Wagner
Seth Sims
SF Altschul
SI Nikolenko
Viachaslau Tsyvina
Yury Khudyakov
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref