Search CORE

Kölner UniversitätsPublikationsServer

Computational problems of analysis of short next generation sequencing reads

Author: A. M. Spitsina
E. R. Galieva
F. M. Naumenko
I. I. Abnizova
I. V. Chadaeva
N. G. Orlova
R. te Boekhorst
Y. L. Orlov
Publication venue: 'Institute of Cytology and Genetics, SB RAS'
Publication date: 01/02/2017
Field of study

Short read next generation sequencing (NGS) has significant impacts on modern genomics, genetics, cell biology and medicine, especially on meta-genomics, comparative genomics, polymorphism detection, mutation screening, transcriptome profiling, methylation profiling, chromatin remodelling and many more applications. However, NGS are prone for errors which complicate scientific conclusions. NGS technologies consist of shearing DNA molecules into collection of numerous small fragments, called a ‘library’, and their further extensive parallel sequencing. These sequenced overlapping fragments are called ‘reads’, they are assembled into contiguous strings. The contiguous sequences are in turn assembled into genomes for further analysis. Computational sequencing problems are those arising from numerical processing of sequenced samples. The numerical processing involves procedures such as: quality-scoring, mapping/assembling, and surprisingly, error-correction of a data. This paper is reviewing post-processing errors and computational methods to discern them. It also includes sequencing dictionary. We present here quality control of raw data, errors arising at the steps of alignment of sequencing reads to a reference genome and assembly. Finally this work presents identification of mutations (“Variant calling”) in sequencing data and its quality control

Public Library of Science (PLOS)

Identifying Cis-Regulatory Sequences by Word Profile Similarity

Author: A Ivan
A Nasiadka
A Sosinsky
AG Nazina
AP Lifanov
BP Berman
BP Berman
BY Chan
C Zhang
D Bachtrog
DL Halligan
DS Johnson
E Emberly
EA Glazov
EE Hare
EH Davidson
F Poulin
Garmay Leung
H Janssens
I Abnizova
L Li
M Klingler
Michael B. Eisen
MR Kantorovitz
MS Halfon
N Pierstorff
N Rajewsky
Nicholas James Provart
S Prabhakar
S Sinha
XY Li
YH Grad
Publication venue: Public Library of Science
Publication date: 01/09/2009
Field of study

Recognizing regulatory sequences in genomes is a continuing challenge, despite a wealth of available genomic data and a growing number of experimentally validated examples.We discuss here a simple approach to search for regulatory sequences based on the compositional similarity of genomic regions and known cis-regulatory sequences. This method, which is not limited to searching for predefined motifs, recovers sequences known to be under similar regulatory control. The words shared by the recovered sequences often correspond to known binding sites. Furthermore, we show that although local word profile clustering is predictive for the regulatory sequences involved in blastoderm segmentation, local dissimilarity is a more universal feature of known regulatory sequences in Drosophila.Our method leverages sequence motifs within a known regulatory sequence to identify co-regulated sequences without explicitly defining binding sites. We also show that regulatory sequences can be distinguished from surrounding sequences by local sequence dissimilarity, a novel feature in identifying regulatory sequences across a genome. Source code for WPH-finder is available for download at http://rana.lbl.gov/downloads/wph.tar.gz

Public Library of Science (PLOS)

A Machine Learning Approach for Identifying Novel Cell Type–Specific Transcriptional Regulators of Myogenesis

Author: A Carmena
A Carmena
A Carmena
A Dastjerdi
A Erives
A Ivan
A Nose
A Paululat
A Siepel
A Subramanian
A Visel
A Visel
A Woolfe
AA Philippakis
AC Groth
AG Nazina
AG Nazina
AK Holloway
Alan M. Michelson
AM Michelson
AM Michelson
B Estrada
B Hanczar
BL Black
BP Berman
Brian W. Busser
BW Busser
C Bourgouin
C Chang
C Jiang
C Klämbt
CA Berkes
CI Swanson
CT Ong
DN Arnosti
DT Odom
E Davidson
EE Hare
EN Olson
FC Wardle
G Hon
G Junion
G Leung
G Ranganayakulu
GE Crawford
GG Loots
H Brohmann
H Rouault
HP Shih
I Abnizova
I Costello
I Guyon
I Ovcharenko
I Reim
I Reim
Ivan Ovcharenko
J Bischof
J Crocker
J Crocker
J Enriquez
J Ernst
J Shawe-Taylor
J Zeitlinger
JA Pederson
James W. Posakony
JD Pederson
JM Claycomb
JS Jakobsen
JW Mahaffey
K Jagla
K Robasky
K Senger
L Dubois
L Li
L Narlikar
L Narlikar
L Narlikar
Leila Taher
M Capovilla
M Frasch
M Ludwig
M Markstein
M Markstein
M Porsch
M Ruiz-Gomez
M Schwaiger
MA Beer
MB Noyes
MD Biggin
MF Berger
MI Arnone
MJ Blow
MK Baylies
MK Baylies
MK Baylies
MK Gross
Molly J. Bloom
MR Kantorovitz
MS Halfon
MS Halfon
MV Taylor
N Negre
N Reeves
OL Griffith
P Tomancak
PJ Clyne
R Bodmer
R Galant
RG Ramsay
RJ Bryson-Richardson
RP Zinzen
S Barolo
S Knirr
S Knirr
S MacArthur
S Mahony
SA Ness
SB Carroll
SD Weatherbee
SJ Raudys
SM Gallo
SY Kim
T Jagla
T Sandmann
T Sandmann
Terese Tansey
TL Bailey
U Grossniklaus
V Matys
V Tixier
Y Benjamini
YH Liu
Yongsok Kim
Z Han
Publication venue: Public Library of Science
Publication date: 08/03/2012
Field of study

Transcriptional enhancers integrate the contributions of multiple classes of transcription factors (TFs) to orchestrate the myriad spatio-temporal gene expression programs that occur during development. A molecular understanding of enhancers with similar activities requires the identification of both their unique and their shared sequence features. To address this problem, we combined phylogenetic profiling with a DNA–based enhancer sequence classifier that analyzes the TF binding sites (TFBSs) governing the transcription of a co-expressed gene set. We first assembled a small number of enhancers that are active in Drosophila melanogaster muscle founder cells (FCs) and other mesodermal cell types. Using phylogenetic profiling, we increased the number of enhancers by incorporating orthologous but divergent sequences from other Drosophila species. Functional assays revealed that the diverged enhancer orthologs were active in largely similar patterns as their D. melanogaster counterparts, although there was extensive evolutionary shuffling of known TFBSs. We then built and trained a classifier using this enhancer set and identified additional related enhancers based on the presence or absence of known and putative TFBSs. Predicted FC enhancers were over-represented in proximity to known FC genes; and many of the TFBSs learned by the classifier were found to be critical for enhancer activity, including POU homeodomain, Myb, Ets, Forkhead, and T-box motifs. Empirical testing also revealed that the T-box TF encoded by org-1 is a previously uncharacterized regulator of muscle cell identity. Finally, we found extensive diversity in the composition of TFBSs within known FC enhancers, suggesting that motif combinatorics plays an essential role in the cellular specificity exhibited by such enhancers. In summary, machine learning combined with evolutionary sequence analysis is useful for recognizing novel TFBSs and for facilitating the identification of cognate TFs that coordinate cell type–specific developmental gene expression patterns

CiteSeerX

FigShare

Effective transcription factor binding site prediction using a combination of optimization, a genetic algorithm and discriminant analysis to capture distant interactions

Author: A Hoglund
AE Kel
AE Kel
AE Vinogradov
B Efron
B Jaruga
BJ Deroo
C Burge
CD Schmid
CR Calladine
D Cai
D GuhaThakurta
DM Graunke
E Fayard
Elena A Ananko
Elena V Ignatieva
FA Wright
GD Stormo
HP Ko
I Abnizova
I Ben-Gal
IA Udalova
Igor I Turnaev
J Duarte
J Hu
JV Ponomarenko
K Ellrott
K Morohashi
K Quandt
KJ Campbell
L Quintana-Murci
LC Platanias
LG Cowell
M Beato
M Blanchette
M Costantini
M Ganapathi
M Lohoff
M Stepanova
M-LT Lee
ML Bulyk
MP Ponomarenko
MQ Zhang
MQ Zhang
NA Kolchanov
NI Gershenzon
Nikolay A Kolchanov
NV Klimova
O Kel-Margoulis
OA Podkolodnaia
OD King
OG Berg
P Val
PV Benos
Q Zhou
R Castelo
R Kiyama
R Osada
R Pudimat
RV Davuluri
S Kamalakaran
Tatyana I Merkulova
TC Hodgman
TK Man
TM Chen
TV Busygina
VG Levitskii
VG Levitsky
VG Levitsky
VG Levitsky
VG Levitsky
Victor G Levitsky
VV Solovyev
W Huang
WH Shen
WW Wasserman
X Xie
Y Barash
Publication venue: BioMed Central
Publication date: 01/12/2007
Field of study

Abstract Background Reliable transcription factor binding site (TFBS) prediction methods are essential for computer annotation of large amount of genome sequence data. However, current methods to predict TFBSs are hampered by the high false-positive rates that occur when only sequence conservation at the core binding-sites is considered. Results To improve this situation, we have quantified the performance of several Position Weight Matrix (PWM) algorithms, using exhaustive approaches to find their optimal length and position. We applied these approaches to bio-medically important TFBSs involved in the regulation of cell growth and proliferation as well as in inflammatory, immune, and antiviral responses (NF-κB, ISGF3, IRF1, STAT1), obesity and lipid metabolism (PPAR, SREBP, HNF4), regulation of the steroidogenic (SF-1) and cell cycle (E2F) genes expression. We have also gained extra specificity using a method, entitled SiteGA, which takes into account structural interactions within TFBS core and flanking regions, using a genetic algorithm (GA) with a discriminant function of locally positioned dinucleotide (LPD) frequencies. To ensure a higher confidence in our approach, we applied resampling-jackknife and bootstrap tests for the comparison, it appears that, optimized PWM and SiteGA have shown similar recognition performances. Then we applied SiteGA and optimized PWMs (both separately and together) to sequences in the Eukaryotic Promoter Database (EPD). The resulting SiteGA recognition models can now be used to search sequences for BSs using the web tool, SiteGA. Analysis of dependencies between close and distant LPDs revealed by SiteGA models has shown that the most significant correlations are between close LPDs, and are generally located in the core (footprint) region. A greater number of less significant correlations are mainly between distant LPDs, which spanned both core and flanking regions. When SiteGA and optimized PWM models were applied together, this substantially reduced false positives at least at higher stringencies. Conclusion Based on this analysis, SiteGA adds substantial specificity even to optimized PWMs and may be considered for large-scale genome analysis. It adds to the range of techniques available for TFBS prediction, and EPD analysis has led to a list of genes which appear to be regulated by the above TFs.</p

Springer - Publisher Connector

VarBin, a novel method for classifying true and false positive variants in NGS data

Author: A Luedtke
A McKenna
AE Minoche
C Ledergerber
DF Simola
DR Adams
EM Coonrod
Emily M Coonrod
F Meacham
H Lee
H Li
H Li
H Thorvaldsdottir
I Abnizova
Jacob Durtschi
Kalyan C Mallempati
Karl V Voelkerding
KV Fuentes Fajardo
MA DePristo
MR Ho
O Muralidharan
P Flaherty
Rebecca L Margraf
RL Margraf
RL Margraf
TJ Treangen
Y Shen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Detecting non-allelic homologous recombination from high-throughput sequencing data

Author: A Abyzov
A Datta
A Ritz
AB Smith
AD Irvine
AP Levy
AR Quinlan
B Efron
Benjamin J Raphael
C Alkan
Charles E Lawrence
CM Carvalho
D Pulido
DF Conrad
DJ Turner
E Sidransky
EE Eichler
EV Linardopoulou
F Hormozdiari
F Hormozdiari
H Li
HV Huang
I Kasvosve
II Abnizova
II Abnizova
J Clarke
J Eid
J Lee
J-M Chen
JA Bailey
JA Beck
JM Kidd
JM Kidd
JO Korbel
JR Lupski
K Chen
K Ye
LE Carvalho
M Papp
M Sasaki
M Torrent
M Yoshimoto
Matthew M Parks
MJP Chaisson
P Huertas
P Liu
P Medvedev
P Stankiewicz
P Stankiewicz
PA Pevzner
PH Sudmant
PJ Hastings
RE Mills
S Sindi
S Yoon
SA Forbes
T Jin
The 1000 Genomes Project Consortium
UM Zanger
X Hu
X She
X Sun
Y Costa
Y Savir
Z Chen
Z Gan-Or
Z Ou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Northumbria Research Link

Genomic reconstruction of the SARS-CoV-2 epidemic in England.

Author: Aanensen DM
Abnizova I
Abudahab K
Adams A
Adams H
Afifi S
Aggarwal D
Ahmad SSY
Aigrain L
Alcolea A
Alderton A
Ali M
Alikhan N-F
Allara E
Allen L
Amato R
Anderson R
Angyal A
Annett T
Aplin S
Ariani C
Ariani CV
Asad H
Ash A
Ashfield P
Ashford F
Atkinson L
Attwood SW
Auckland C
Austin-Guest S
Aydin A
Baker DJ
Baker P
Bala S
Balcazar CE
Ball J
Barrett J
Barrett JC
Barrow M
Barton E
Bashton M
Bassett A
Bassett AR
Batra R
Battleday K
Baxter C
Bayzid N
Beal J
Beale M
Beaver C
Beckett AH
Beckwith SM
Bedford L
Beer R
Beggs A
Bellany S
Bellerby T
Bellis K
Bellis KL
Berger D
Berriman M
Berry L
Bertolusso B
Best A
Betteridge E
Bevan P
Bibby D
Bicknell K
Binley S
Binns D
Birchley A
Bird PW
Birney E
Bishop C
Bishop J
Blackburn K
Blacow R
Blakey V
Blane B
Bolt F
Bonfield J
Bonner S
Bonsall D
Boswell T
Bosworth A
Boughton N
Bourgeois Y
Bowker S
Boyd O
Bradley DT
Breen C
Brendler-Spaeth T
Bresner C
Breuer J
Bridgett S
Bronner I
Bronner IF
Brooklyn T
Brooks E
Broos A
Brown JR
Bucca G
Buchan SL
Buck D
Buddenborg SK
Bull M
Burns PJ
Burton-Fanning S
Bush R
Byaruhanga T
Byott M
Caetano C
Cagan A
Campbell S
Carabelli AM
Cargill JS
Carlile M
Carter N
Cartwright J
Carvalho SF
Casey A
Castigador A
Catalan J
Chalker V
Chaloner NJ
Chand M
Chand M
Chapman L
Chappell JG
Charalampous T
Chatterton W
Chaudhry Y
Chillingworth T-J
Churcher CM
Clapham P
Clark G
Clark R
Clarke A
Clarke C
Clarke P
Cogger BJ
Cole D
Cole K
Collins J
Colquhoun R
Connor TR
Cook E
Cook KF
Coombes J
Coppola M
Corden S
Cormie C
Cornell L
Cornwell C
Cortes N
Corton C
Cotic M
Cotton S
Cottrell S
Coupland L
Cox A
Cox M
Crackett A
Craine N
Cranage A
Craven H
Craw S
Crawford L
Crawford M
Cross A
Crown MR
Crudgington D
Cumley N
Curran MD
Curran T
Cutts T
da Silva Filipe A
Dabrera G
Dabrowska M
Darby AC
Davidson RK
Davies A
Davies M
Davies R
Davies RM
Davis T
Dawson J
Day C
de Angelis D
De Lacy E
De Maio N
de Oliveira Martins L
de Silva TI
Debebe J
Densem A
Denton-Smith R
Dervisevic S
Dewar R
Dey J
Dias J
Dibling T
Dobie D
Dockree C
Dodd D
Dogga S
Dorman M
Dorman MJ
Dougan G
Dougherty M
Dove A
Downing F
Drummond L
Drury E
du Plessis L
Duckworth N
Dudek M
Durham J
Durrant L
Easthope E
Eastick K
Easton LJ
Eccles R
Eckert S
Edgeworth J
Edwards S
El Bouzidi K
Eldirdiri S
Ellaby N
Elliott S
Ellis P
Eltringham G
Ensell L
Erkiert MJ
Essex S
Evans C
Evans JM
Everson W
Fairley DJ
Fallon K
Fanaie A
Farr B
Farr BW
Fearn C
Feltwell T
Fenton M
Ferguson L
Ferrero M
Fina L
Flack N
Flaviani F
Fleming VM
Fordham H
Forrest S
Forsythe G
Foster-Nyarko E
Foulkes BH
Foulser L
Fragakis M
Frampton D
Francis M
Francois S
Fraser A
Fraser C
Freeman A
Freeman TM
Fryer H
Fuchs M
Fuller W
Funk S
Gajee K
Galai K
Gallagher A
Gallagher E
Gallagher MD
Gallis M
Galvin A
Garcia-Casado M
Gaskin A
Gatica-Wilcox B
Gedny A
Geidelberg L
Gemmell M
Georgana I
George RP
Gerstung M
Gifford L
Gilbert L
Girgis S
Girgis ST
Glaysher S
Glover J
Goater R
Goldman N
Goldstein EJ
Golubchik T
Gomes AN
Goncalves S
Gonçalves S
Gonçalves S
Goodfellow IG
Goodwin S
Goudarzi S
Gould O
Gourtovaia M
Graham C
Graham L
Grant PR
Gray A
Gray E
Green A
Green LR
Greenaway J
Gregory R
Griffiths C
Gu Y
Guerin F
Guest M
Gunson RN
Gupta RK
Gutierrez B
Haldenby ST
Hamilton W
Hamilton WL
Hanks H
Hansford SE
Haque T
Harris KA
Harrison E
Harrison EM
Harrison I
Harrott A
Harry E
Hart J
Hartley JA
Harvey M
Harvey WT
Harvison J
Hassan-Ibrahim MO
Heaney J
Heath P
Hellewell J
Helmer T
Henderson JH
Hernandez-Koutoucheva A
Hesketh AR
Hey J
Heyburn D
Higginson EE
Hill JD
Hill V
Hilson RA
Hilvers E
Hobbs R
Holden MTG
Holland D
Hollis A
Holmes AH
Holmes CW
Holmes N
Holmes S
Hopes R
Hornett G
Hornsby HR
Hosmillo M
Hough N
Houlihan C
Howson-Wells HC
Hsu SN
Hubb J
Huckle L
Huckson H
Hughes J
Hughes M
Hughes W
Hughes-Hallet L
Hunter A
Hutchings S
Idle G
Illingworth CJ
Impey R
Inglis S
Iqbal S
Irish-Tavares D
Iturriza-Gomara M
Izuagbe R
Jackson A
Jackson B
Jackson C
Jackson D
Jackson DK
Jackson KA
Jackson LM
Jahun AS
James K
James V
Jamrozy D
Jeanes C
Jeffries AR
Jeremiah S
Jermy A
John M
Johnson K
Johnson R
Johnston I
Jones CR
Jones H
Jones M
Jones N
Jones O
Jones S
Joseph A
Judges S
Jung AW
Kallepally K
Kane L
Kay GL
Kay K
Kay S
Keatley J
Keatley J-P
Keeley AJ
Keith A
Kenyon A
Kermack LM
Khakh M
Kidd SP
Kimuli M
King A
Kirk S
Kitchen C
Kitchin L
Kitchman K
Kleanthous M
Klimekova M
Knight BA
Korlevic P
Koshy C
Kraemer MUG
Krasheninnkova K
Kumziene-Summerhayes S
Kwiatkowski D
Kwiatkowski D
Lackenby A
Laing KG
Lampejo T
Lane G
Langford C
Langford CF
Laverack A
Lavin D
Law K
Lawniczak M
Lawton AI
Le-Viet T
Lee D
Lee JCD
Lensing S
Lensing SV
Leonard S
Letchford L
Levett LJ
Lewis J
Lewis K
Lewis-Wade A
Liddle J
Liddle J
Liggett S
Lillie PJ
Lin Q
Lindsay S
Lindsey BB
Linsdell S
Lister MM
Livett R
Lo S
Loman NJ
Long R
Loose MW
Louka SF
Lovell J
Lovell J
Loveson KF
Lowdon S
Lowe H
Lowe HL
Lucaci AO
Ludden C
Ludden C
Lynch J
Lyons RA
Lythgoe K
Machin NW
MacIntyre-Cockett G
Mack A
Mack J
Macklin B
Maclean A
Macnaughton E
Maddison M
Madona P
Maes M
Maftei L
Mahanama AIK
Mahungu TW
Mair D
Maksimovic J
Makunin A
Malone CS
Maloney D
Mamun I
Manesis N
Manley R
Mansfield J
Mantzouratou A
Marchbank A
Mariappan A
Marriott N
Martin M
Martincorena I
Martinez Nunez RT
Masters KM
Mather AE
Maxwell P
Mayhew M
Mayho M
Mbisa T
McCann CM
McCarthy S
McCarthy SA
McClintock J
McClure PC
McCrone JT
McGuigan S
McHugh MP
McHugh S
McKenna JP
McKerr C
McManus GM
McMinn L
McMurray C
McMurray CL
McNally A
Meadows C
Meadows L
Medd N
Megram O
Menegazzo M
Merrick I
Michell SL
Michelsen ML
Mirfenderesky M
Mirza J
Miskelly J
Mobley E
Moles-Garcia E
Moll R
Moll RJ
Molnar Z
Monahan IM
Mondani M
Monteiro TC
Mookerjee S
Moore C
Moore C
Moore J
Moore N
Morcrette H
Morgan M
Morgan S
Mori M
Morra M
Morriss A
Morrow L
Moses S
Mower C
Muir P
Mukaddas A
Munemo F
Munn R
Murie K
Murray A
Murray DR
Murray LJ
Mutingwende M
Myers R
Nash S
Nastouli E
Nathwani C
Naydenova P
Neaverson A
Nebbia G
Nelson A
Nelson C
Nelson R
Nerou E
Nguyen T
Nicholls S
Nichols J
Nicholson J
Nicodemi R
Nimz T
Noell GG
Nomikou K
Odedra M
Ohan V
Ohemeng-Kumi N
Oliver K
Olney C
Ormond D
Orton RJ
Osman H
Oszlanczi A
O’Brien S
O’Grady J
O’Meara S
O’Toole Á
Pacchiarini N
Padgett D
Page AJ
Palmer S
Pang YF
Panovska-Griffiths J
Pardubska B
Park EJ
Park N
Park NR
Parker MD
Parmar A
Parmar S
Partridge DG
Pascall D
Patel A
Patel B
Patel G
Patel M
Paterson S
Payne BAI
Payne M
Peacock S
Peacock SJ
Pearson C
Pelosi E
Percival B
Perkins J
Perry M
Petersen A
Pinckert ML
Platt S
Plowman D
Podplomyk O
Pohare M
Pond M
Pope CF
Poplawski R
Powell J
Poyner J
Preston T
Prestwood L
Price A
Price JR
Prieto JA
Pritchard DT
Prosolek SJ
Puethe C
Pugh G
Pusok M
Pybus OG
Pymont HM
Quail M
Quail MA
Quick J
Radulescu C
Raghwani J
Ragonnet-Cronin M
Rainbow L
Rajan D
Rajatileka S
Ramadan NA
Rambaut A
Ramble J
Rance R
Randell P
Randell PA
Ratcliffe L
Raviprakash V
Rawlings S
Raza M
Redshaw N
Redshaw NM
Rey S
Reynolds J
Reynolds M
Reynolds N
Rice S
Richardson M
Richter A
Roberts C
Robertson DL
Robinson D
Robinson E
Robinson K
Robinson M
Robson SC
Rogan F
Rogers H
Rojo EM
Rooke S
Roopra D
Rose M
Rowe W
Roy S
Rudd L
Rudder S
Ruis C
Rushton S
Sadri R
Saeed K
Saint C
Salmon N
Samaraweera B
Sambles CM
Sanderson R
Sanderson T
Sanderson T
Sang F
Sass T
Saul D
Scher E
Schwach F
Schwach F
Scott C
Scott G
Seekings P
Sehmi J
Shaaban S
Shah D
Shaw J
Shelest E
Shepherd JG
Sheridan LA
Sheriff N
Shirley L
Sillitoe J
Silviera S
Simms A
Simpson DA
Singh A
Singleton D
Sinnott M
Sinnott M
Sivadasan S
Siwek B
Sizer D
Skeldon K
Skelton J
Skvortsov T
Slater-Tunstill J
Sloan TJ
Sloper L
Sluga G
Smerdon N
Smith C
Smith C
Smith CP
Smith DL
Smith J
Smith K
Smith K
Smith KS
Smith L
Smith M
Smith N
Smith P
Smith S
Smith T
Smollett KL
Sneade L
Snell LB
Somassa T
Soria CD
Sousa C
Souster E
Southgate J
Sparkes A
Spellman K
Spencer Chapman MH
Spencer-Chapman M
Spurgin LG
Spyer MJ
Squares J
Stanley R
Stanley R
Stanley W
Stanton TD
Starinskij I
Steed C
Stickland T
Still I
Stockton J
Stonehouse S
Storey N
Stratton MR
Strickland M
Studholme DJ
Suciu M
Sudhanva M
Swann A
Swiatkowska A
Swift E
Swindells E
Sycamore N
Symons E
Szluha S
Taha Y
Taluy E
Tan NK
Tang JW
Tang M
Tao N
Taylor BEW
Taylor JF
Taylor K
Taylor S
Taylor S
Temperton B
Templeton KE
Thomas C
Thompson M
Thompson S
Thomson EC
Thomson L
Thomson M
Thomson N
Thornton A
Thurston S
Thurston SAJ
Todd JA
Tomb R
Tong L
Tonkin-Hill G
Toombs D
Topping B
Torok ME
Tovar-Corona J
Tovar-Corona JM
Trebes A
Trotter AJ
Tsatsani I
Turnbull R
Turtle L
Twohig KA
Umpleby H
Underwood AP
Ungureanu D
Uphill J
Urbanova J
Vamos EE
Van Vuuren PJ
Vancollie V
Vasylyeva TI
Vattipally S
Verdejo CJ
Vernet G
Vipond BB
Voak P
Volz E
Volz EM
Vöhringer HS
Walker D
Walker M
Waller M
Walsh S
Wang D
Ward G
Warne B
Warwick-Dugdale J
Wastnedge E
Watkins J
Watson LK
Waugh S
Weatherhogg C
Webb N
Webster HJ
Weldon D
Wells A
Wells E
Westwick E
Westwood L
Whalley T
Wheeler H
Whipp T
Whitehead M
Whiteley M
Whiteley T
Whitton G
Whitwham A
Widaa S
Wierzbicki C
Willford NJ
Williams C
Williams C
Williams C
Williams CA
Williams L-A
Williams M
Williams R
Williams RJ
Williams T
Williamson KA
Wilson M
Wilson-Davies E
Witele E
Withell KT
Witney AA
Wolverson P
Wong N
Workman T
Wright DW
Wright S
Wright V
Wyatt T
Wyllie S
Xu-McCrae L
Yavus M
Yaze G
Yeats CA
Yebra G
Yew WC
Young GR
Young J
Zamudio ME
Zarebski AE
Zhang P
Publication venue: Nature
Publication date: 01/01/2021
Field of study

The evolution of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus leads to new variants that warrant timely epidemiological characterization. Here we use the dense genomic surveillance data generated by the COVID-19 Genomics UK Consortium to reconstruct the dynamics of 71 different lineages in each of 315 English local authorities between September 2020 and June 2021. This analysis reveals a series of subepidemics that peaked in early autumn 2020, followed by a jump in transmissibility of the B.1.1.7/Alpha lineage. The Alpha variant grew when other lineages declined during the second national lockdown and regionally tiered restrictions between November and December 2020. A third more stringent national lockdown suppressed the Alpha variant and eliminated nearly all other lineages in early 2021. Yet a series of variants (most of which contained the spike E484K mutation) defied these trends and persisted at moderately increasing proportions. However, by accounting for sustained introductions, we found that the transmissibility of these variants is unlikely to have exceeded the transmissibility of the Alpha variant. Finally, B.1.617.2/Delta was repeatedly introduced in England and grew rapidly in early summer 2021, constituting approximately 98% of sampled SARS-CoV-2 genomes on 26 June 2021

LSHTM Research Online