47 research outputs found
Population Genomics of Mycobacterium tuberculosis in Ethiopia Contradicts the Virgin Soil Hypothesis for Human Tuberculosis in Sub-Saharan Africa
Colonial medical reports claimed that tuberculosis (TB) was largely unknown in Africa prior to European contact, providing a "virgin soil" for spread of TB in highly susceptible populations previously unexposed to the disease [1, 2]. This is in direct contrast to recent phylogenetic models which support an African origin for TB [3-6]. To address this apparent contradiction, we performed a broad genomic sampling of Mycobacterium tuberculosis in Ethiopia. All members of the M. tuberculosis complex (MTBC) arose from clonal expansion of a single common ancestor [7] with a proposed origin in East Africa [3, 4, 8]. Consistent with this proposal, MTBC lineage 7 is almost exclusively found in that region [9-11]. Although a detailed medical history of Ethiopia supports the view that TB was rare until the 20(th) century [12], over the last century Ethiopia has become a high-burden TB country [13]. Our results provide further support for an African origin for TB, with some genotypes already present on the continent well before European contact. Phylogenetic analyses reveal a pattern of serial introductions of multiple genotypes into Ethiopia in association with human migration and trade. In place of a "virgin soil" fostering the spread of TB in a previously naive population, we propose that increased TB mortality in Africa was driven by the introduction of European strains of M. tuberculosis alongside expansion of selected indigenous strains having biological characteristics that carry a fitness benefit in the urbanized settings of post-colonial Africa
The SeqCOVID-Spain consortium: unravelling the dynamics of the COVID-19 first epidemic wave in Spain
Póster presentado a la Applied Bioinformatics and Public Health Microbiology 2021 Virtual Conference, celebrada del 5 al 7 de mayo de 2021.The COVID-19 pandemic has shaken the world since the beginning of 2020. Spain is among the European countries with the highest incidence of the disease during the first pandemic wave. We established a multidisciplinary consortium to monitor and study the evolution of the epidemic, with the aim of contributing to decision making and stopping rapid spreading across the country. We present the results for 2170 sequences from the first wave of the SARS-Cov-2 epidemic in Spain, representing
12% of diagnosed cases until 14th March. This effort allows us to document at least 500 initialintroductions, between early February-March from multiple international sources. Importantly, we document the early raise of two dominant genetic variants in Spain (Spanish Epidemic Clades), named SEC7 and SEC8, likely amplified by superspreading events. In sharp contrast to other non Asian countries those two variants were closely related to the initial variants of SARS-CoV-2 described in Asia and represented 40% of the genome sequences analyzed. The two dominant SECs were widely spread across the country compared to other genetic variants with SEC8 reaching a 60% prevalence just before the lockdown. Employing Bayesian phylodynamic analysis, we inferred a reduction in the effective reproductive number of these two SECs from around 2.5 to below 0.5 after the implementation of strict public-health interventions in mid-March. The effects of lockdown on the genetic variants of the virus are reflected in the general replacement of pre-existing SECs by a new
variant at the beginning of the summer season. Our results reveal a significant difference in the genetic makeup of the epidemic in Spain and support the effectiveness of lockdown measures in controlling virus spread even for the most successful genetic variants.This work was funded by the Instituto de Salud Carlos III project COV20/00140, Spanish National Research Council project CSIC-COV19-021, Ministerio de Ciencia PID2019-104477RB-I00 and ERC StG 638553 to IC, and BFU2017-89594R to FGC. MC is supported by Ramón y Cajal program from Ministerio de Ciencia and grants RTI2018-094399-A-I00 and SEJI/2019/011.Peer reviewe
Spread of a SARS-CoV-2 variant through Europe in the summer of 2020
[EN] Following its emergence in late 2019, the spread of SARS-CoV-21,2 has been tracked by phylogenetic analysis of viral genome sequences in unprecedented detail3,4,5. Although the virus spread globally in early 2020 before borders closed, intercontinental travel has since been greatly reduced. However, travel within Europe resumed in the summer of 2020. Here we report on a SARS-CoV-2 variant, 20E (EU1), that was identified in Spain in early summer 2020 and subsequently spread across Europe. We find no evidence that this variant has increased transmissibility, but instead demonstrate how rising incidence in Spain, resumption of travel, and lack of effective screening and containment may explain the variant’s success. Despite travel restrictions, we estimate that 20E (EU1) was introduced hundreds of times to European countries by summertime travellers, which is likely to have undermined local efforts to minimize infection with SARS-CoV-2. Our results illustrate how a variant can rapidly become dominant even in the absence of a substantial transmission advantage in favourable epidemiological settings. Genomic surveillance is critical for understanding how travel can affect transmission of SARS-CoV-2, and thus for informing future containment strategies as travel resumes.S
Geographical and temporal distribution of SARS-CoV-2 clades in the WHO European Region, January to June 2020
We show the distribution of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) genetic clades over time and between countries and outline potential genomic surveillance objectives. We applied three genomic nomenclature systems to all sequence data from the World Health Organization European Region available until 10 July 2020. We highlight the importance of real-time sequencing and data dissemination in a pandemic situation, compare the nomenclatures and lay a foundation for future European genomic surveillance of SARS-CoV-2
The first wave of the COVID-19 epidemic in Spain was associated with early introductions and fast spread of a dominating genetic variant
SeqCOVID-Spain consortium:
Álvaro Chiner-Oms, Irving Cancino-Muñoz, Mariana G. López, Manuela Torres-Puente, Inmaculada Gómez-Navarro, Santiago Jiménez-Serrano, Jordi Pérez-Tur, Darío García de Viedma, Laura Pérez-Lago, Marta Herranz, Jon Sicilia, Pilar Catalán-Alonso, Julia Suárez González, Patricia Muñoz, Mireia Coscolla, Paula Ruiz-Rodríguez, Fernando González-Candelas, Iñaki Comas, Lidia Ruiz-Roldán, María Alma Bracho, Neris García-González, Llúcia Martínez Priego, Inmaculada Galán-Vendrell, Paula Ruiz-Hueso, Griselda De Marco, María Loreto Ferrús-Abad, Sandra Carbó-Ramírez, Giuseppe D’Auria, Galo Adrian Goig, Juan Alberola, Jose Miguel Nogueira, Juan José Camarena, David Navarro, Eliseo Albert, Ignacio Torres, Maitane Aranzamendi Zaldumbide, Óscar Martínez Expósito, Nerea Antona Urieta, María de Toro, María Pilar Bea-Escudero, Jose Antonio Boga, Cristian Castelló-Abietar, Susana Rojo-Alba, Marta Elena Álvarez-Argüelles, Santiago Melón, Elisa Martró, Antoni E. Bordoy, Anna Not, Adrián Antuori, Anabel Fernández-Navarro, Andrés Canut-Blasco, Silvia Hernáez Crespo, Maria Luz Cordón Rodríguez, Maria Concepción Lecaroz Agara, Carmen Gómez-González, Amaia Aguirre-Quiñonero, José Israel López-Mirones, Marina Fernández-Torres, Maria Rosario Almela-Ferrer, Ana Carvajal, Juan Miguel Fregeneda-Grandes, Héctor Argüello, Gustavo Cilla Eguiluz, Milagrosa Montes Ros, Luis Piñeiro Vázquez, Ane Sorarrain, José María Marimón, José J. Costa-Alcalde, Rocío Trastoy, Gema Barbeito Castiñeiras, Amparo Coira, María Luisa Pérez del Molino, Antonio Aguilera, Begoña Palop-Borrás, Inmaculada de Toro Peinado, Maria Concepción Mediavilla Gradolph, Mercedes Pérez-Ruiz, Mirian Fernández-Alonso, Jose Luis del Pozo, Oscar González-Recio, Mónica Gutiérrez-Rivas, Jovita Fernández-Pinero, Miguel Ángel Jiménez Clavero, Begoña Fuster Escrivá, Concepción Gimeno Cardona, María Dolores Ocete Mochón, Rafael Medina-Gonzalez, José Antonio Lepe, Verónica González Galán, Ángel Rodríguez-Villodres, Nieves Gonzalo Jiménez, Jordi Reina, Carla López-Causapé, Maria Dolores Gómez-Ruiz, Eva M. Gonzalez-Barbera, José Luis López-Hontangas, Vicente Martín, Antonio J. Molina, Tania Fernandez-Villa, Ana Milagro Beamonte, Nieves Felisa Martínez-Cameo, Yolanda Gracia-Grataloup, Rosario Moreno-Muñoz, Maria Dolores Tirado Balaguer, José María Navarro-Marí, Irene Pedrosa-Corral, Sara Sanbonmatsu-Gámez, Antonio Oliver, Mónica Parra Grande, Bárbara Gómez Alonso, Francisco José Arjona Zaragozí, Maria Carmen Pérez González, Francisco Javier Chamizo López, Ana Bordes-Benítez, Núria Rabella, Ferran Navarro, Elisenda Miró, Antonio Rezusta, Alexander Tristancho, Encarnación Simarro Córdoba, Julia Lozano-Serra, Lorena Robles Fonseca, Álex Soriano, Francisco Javier Roig Sena, Hermelinda Vanaclocha Luna, Isabel Sanmartín, Daniel García-Souto, Ana Pequeño-Valtierra, Jose M. C. Tubio, Javier Temes, Jorge Rodríguez-Castro, Martín Santamarina García, Manuel Rodríguez-Iglesias, Fátima Galán-Sanchez, Salud Rodríguez-Pallares, José Manuel Azcona-Gutiérrez, Miriam Blasco-Alberdi, Alfredo Mayor, Alberto L. García-Basteiro, Gemma Moncunill, Carlota Dobaño, Pau Cisteró, Oriol Mitjà, Camila González-Beiras, Martí Vall-Mayans, Marc Corbacho-Monné, Andrea Alemany, Cristina Muñoz-Cuevas, Guadalupe Rodríguez-Rodríguez, Rafael Benito, Sonia Algarate, Jessica Bueno, Andrea Vergara-Gómez, Miguel J. Martínez, Jordi Vila, Elisa Rubio, Aida Peiró-Mestres, Jessica Navero-Castillejos, David Posada, Diana Valverde, Nuria Estévez, Iria Fernández-Silva, Loretta de Chiara, Pilar Gallego-García, Nair Varela, Ulises Gómez-Pinedo, Mónica Gozalo-Margüello, Maria Eliecer Cano García, José Manuel Méndez-Legaza, Jesus Rodríguez-Lozano, María Siller, Daniel Pablo-Marcos, Maria Montserrat Ruiz-García, Antonio Galiana, Judith Sánchez-Almendro, Maria Isabel Gascón Ros, Cristina Juana Torregrosa-Hetland, Eva María Pastor Boix, Paloma Cascales Ramos, Pedro Luis Garcinuño Enríquez, Salvador Raga Borja, Julia González Cantó, Olalla Martínez Macias, Adolfo de Salazar, Laura Viñuela González, Natalia Chueca, Federico García, Cristina Gómez-Camarasa, Amparo Farga Martí, Rocío Falcón, Victoria Domínguez-Márquez, Anna M. Planas, Israel Fernández-Cádenas, Maria Ángeles Marcos, Carmen Ezpeleta, Ana Navascués, Ana Miqueleiz Zapatero, Manuel Segovia, Antonio Moreno-Docón, Esther Viedma, Raúl Recio Martínez, Irene Muñoz-Gallego, Sara Gonzalez-Bodi, Maria Dolores Folgueira, Jesús Mingorance, Elias Dahdouh, Fernando Lázaro-Perona, María Rodríguez-Tejedor, María Pilar Romero-Gómez, Julio García-Rodríguez, Juan Carlos Galán, Mario Rodríguez-Dominguez, Laura Martínez-García, Melanie Abreu Di Berardino, Manuel Ponce-Alonso, Jose Maria González-Alba, Ivan Sanz-Muñoz, Diana Pérez San José, Maria Gil Fortuño, Juan B. Bellido-Blasco, Alberto Yagüe Muñoz, Noelia Hernández Pérez, Helena Buj Jordá, Óscar Pérez Olaso, Alejandro González Praetorius, Nora Mariela Martínez Ramírez, Aida Ramírez Marinero, Eduardo Padilla León, Alba Vilas Basil, Mireia Canal Aranda, Albert Bernet Sánchez, Alba Bellés Bellés, Eric López González, Iván Prats Sánchez, Mercè García-González, Miguel José Martínez-Lirola, Manuel Ángel Rodríguez Maresca, Maria Teresa Cabezas Fernández, María Eugenia Carrillo Gil, Maria Paz Ventero Martín, Carmen Molina Pardines, Nieves Orta Mira, María Navarro Cots, Inmaculada Vidal Catalá, Isabel García Nava, Soledad Illescas Fernández-Bermejo, José Martínez-Alarcón, Marta Torres-Narbona, Cristina Colmenarejo, Lidia García-Agudo, Jorge A. Pérez García, Martín Yago López, María Ángeles Goberna Bravo, Victoria Simón García, Gonzalo Llop Furquet, Agustín Iranzo Tatay, Sandra Moreno-Marro, Noelia Lozano Rodríguez, Amparo Broseta Tamarit, Juan José Badiola Díez, Amparo Martínez-Ramírez, Ana Dopazo, Sergio Callejas, Alberto Benguría, Begoña Aguado, Antonio Alcamí, Marta Bermejo Bermejo, Ricardo Ramos-Ruíz, Víctor Manuel Fernández Soria, Fernando Simón Soria & Mercedes Roig CardellsThe coronavirus disease 2019 (COVID-19) pandemic has affected the world radically since 2020. Spain was one of the European countries with the highest incidence during the first wave. As a part of a consortium to monitor and study the evolution of the epidemic, we sequenced 2,170 samples, diagnosed mostly before lockdown measures. Here, we identified at least 500 introductions from multiple international sources and documented the early rise of two dominant Spanish epidemic clades (SECs), probably amplified by superspreading events. Both SECs were related closely to the initial Asian variants of SARS-CoV-2 and spread widely across Spain. We inferred a substantial reduction in the effective reproductive number of both SECs due to public-health interventions (Re < 1), also reflected in the replacement of SECs by a new variant over the summer of 2020. In summary, we reveal a notable difference in the initial genetic makeup of SARS-CoV-2 in Spain compared with other European countries and show evidence to support the effectiveness of lockdown measures in controlling virus spread, even for the most successful genetic variants.This work was mainly funded by the Instituto de Salud Carlos III project COV20/00140, with additional funding by Spanish National Research Council project CSIC-COV19-021, Ministerio de Ciencia project PID2019-104477RB-100, ERC StG 638553 and ERC CoG 101001038 to I.C., and BFU2017-89594R to F.G.C. M.C. is supported by Ramón y Cajal program from Ministerio de Ciencia and grants RTI2018-094399-A-I00 and Generalitat Valenciana (Regional Government) project SEJI/2019/011. We gratefully acknowledge Hospital Universitari Vall d’Hebron, Instituto de Salud Carlos III, IrsiCaixa AIDS Research Lab and all the international researchers and institutions that submitted sequenced SARS-CoV-2 genomes to the GISAID’s EpiCov Database (Supplementary Table 1), as an important part of our analyses has been made possible by the sharing of their work. We also thank Unidad de Bioinformática y Estadística, Centro de Investigación Príncipe Felipe, for allowing us to use the Computer Cluster to perform some of the bioinformatic analysis.Peer reviewe
Contaminant DNA in bacterial sequencing experiments is a major source of false genetic variability
Background: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic
repositories. Strikingly, analysis workflows for whole-genome sequencing (WGS) data commonly do not account for
errors potentially introduced by contamination, which could lead to the wrong assessment of allele frequency both
in basic and clinical research.
Results: We used a taxonomic filter to remove contaminant reads from more than 4000 bacterial samples from 20
different studies and performed a comprehensive evaluation of the extent and impact of contaminant DNA in WGS.
We found that contamination is pervasive and can introduce large biases in variant analysis. We showed that these
biases can result in hundreds of false positive and negative SNPs, even for samples with slight contamination.
Studies investigating complex biological traits from sequencing data can be completely biased if contamination is
neglected during the bioinformatic analysis, and we demonstrate that removing contaminant reads with a
taxonomic classifier permits more accurate variant calling. We used both real and simulated data to evaluate and
implement reliable, contamination-aware analysis pipelines.
Conclusion: As sequencing technologies consolidate as precision tools that are increasingly adopted in the
research and clinical context, our results urge for the implementation of contamination-aware analysis pipelines.
Taxonomic classifiers are a powerful tool to implement such pipelines
A high resolution picture of tuberculosis transmission obtained from direct-from- sputum whole-genome sequencing
Abstract de la comunicación oral presentada al Scientific Meeting on Mycobacteria. MycoPORTO 2019
Porto (Portugal), 19-20 de septiembre de 2019Whole-genome sequencing of M. tuberculosis (MTB) directly from clinical specimens (dWGS)
will suppose a major breakthrough in tuberculosis diagnosis and control. To date, different
strategies have been followed to sequence MTB from sputum samples and accurately predict
drug-resistance. However, dWGS of MTB has never been used in genomic epidemiology.
Here, we test and optimize both the laboratory and computational protocols to implement a
dWGS pipeline able to produce a detailed genomic analysis from sputum samples in a week.
We used dWGS on clinical specimens of 27 TB patients to evaluate its performance for drugresistance prediction and genomic epidemiology. We were able to predict full drug resistance profiles and epidemiological links for the 28 out of the 37 specimens analyzed. In these samples, the agreement between dWGS and WGS from matching cultures was of 100% for both drug-resistance prediction and epidemiological clusters. In our work, we evaluate the state-of-the-art of dWGS for MTB and provide a cost-effective diagnostic algorithm for tuberculosis based on dWGS. We demonstrate that dWGS is a powerful tool to conduct highprecision genomic epidemiology in real time.Peer reviewe
Whole-genome sequencing of Mycobacterium tuberculosis directly from clinical samples for high-resolution genomic epidemiology and drug resistance surveillance: an observational study
9 páginas, 3 figurasBackground Direct whole-genome sequencing of Mycobacterium tuberculosis from clinical specimens will be a major breakthrough in tuberculosis diagnosis and control. To date, direct whole-genome sequencing has never been used in genomic epidemiology, and its accuracy in transmission inference remains unknown. We investigated the technical challenges imposed by direct whole-genome sequencing, and used it to infer transmission clusters and predict drug resistance.
Methods Using an optimised workflow, we did direct whole-genome sequencing for 37 clinical specimens from
23 tuberculosis patients. Nine sputum samples from nine patients who were infected with different non-tuberculous mycobacteria and culture-negative for tuberculosis were used as controls in the qPCR assays and pre-sequencing runs. Additionally, 780 clinical isolates in the region of Comunidad Valenciana (Spain) were whole-genome sequenced between Jan 1, 2014, and Dec 31, 2016. We analysed the genomic variants to build a tuberculosis transmission network for the region, including the clinical specimens, and to predict drug susceptibility profiles.
Findings After sequencing 37 clinical specimens, 28 specimens (22 [85%] of 26 smear-positive and six [55%] of
11 smear-negative) met the quality criteria for downstream analysis. All 28 clinical specimens clustered with their matching culture isolates, with a median distance of 0 single nucleotide polymorphisms. Of the 28 clinical specimens, 16 (57%) were accurately assigned to ten transmission clusters in the region, and 12 (43%) were unique cases.
Transmission inferences and drug-susceptibility predictions from direct whole-genome sequencing data were
concordant with sequences from corresponding cultures and phenotypic drug-susceptibility testing. Complete
genomic analysis, within a week of specimen receipt, cost €217 per sample (excluding personnel costs).
Interpretation Direct whole-genome sequencing could be used to accurately delineate transmission clusters of
tuberculosis and conduct culture-independent surveillance. Compared with conventional approaches, direct wholegenome sequencing allows researchers to do real-time genomic epidemiology and drug resistance surveillance in settings where culture and drug susceptibility testing are not available.This work was supported by projects of the European Research Council (638553-TB-ACCELERATE), Ministerio de Economía y Competitividad, and Ministerio de Ciencia, Innovación y Universidades (Spanish Government; SAF2013-43521-R, SAF2016-77346-R, and SAF2017-92345-EXP [to IC], and BES-2014-071066 [to GAG])Peer reviewe
New insights in the genome content of the bacterium Mycobacterium brumae
Resumen del póster presentado al 41st Annual Congress of the European Society of Mycobacteriology (ESM), celebrado de forma virtual del 28 al 29 de junio de 2019.Mycobacterium brumae is a fast-growing, non-pathogenic Mycobacterium species, originally isolated from environmental and human samples in Barcelona, Spain. Previous studies have shown that the implementation of non-pathogenic mycobacteria, such as Mycobacterium bovis BCG can improve the treatment against high-risk non-muscle invasive bladder cancer (BC) by intravesical administration. M. brumae has been shown to be non-pathogenic and its phenotype and immunogenic effect have been well characterized. However, the knowledge of its underlying genetic composition is still incomplete. In this study we have sequenced the genome by means of PACBIO of the M. brumae strain CR-270 obtaining the most complete assembly to date. We describe its genetic content by showing evolutionary relationships between different mycobacteria and we compare its virulence gene content with other virulent mycobacteria such as H37Rv reference strain. Furthermore, we describe the genetic variability of M. brumae by comparative genomics using obtained Illumina sequences. Our results contribute to increase the knowledge about the genetic bases that explain the non-pathogenic phenotype of this bacterium with therapeutic potential.Peer reviewe
Signatures of transmission in within-host M. tuberculosis variation
24 páginas, 4 figurasBackground: Because M. tuberculosis evolves slowly, transmission clusters often contain multiple individuals with identical consensus genomes, making it difficult to reconstruct transmission chains. Finding additional sources of shared M. tuberculosis variation could help overcome this problem. Previous studies have reported M. tuberculosis diversity within infected individuals; however, whether within-host variation improves transmission inferences remains unclear.
Methods: To evaluate the transmission information present in within-host M. tuberculosis variation, we re-analyzed publicly available sequence data from three household transmission studies, using household membership as a proxy for transmission linkage between donor-recipient pairs.
Findings: We found moderate levels of minority variation present in M. tuberculosis sequence data from cultured isolates that varied significantly across studies (mean: 6, 7, and 170 minority variants above a 1% minor allele frequency threshold, outside of PE/PPE genes). Isolates from household members shared more minority variants than did isolates from unlinked individuals in the three studies (mean 98 shared minority variants vs. 10; 0.8 vs. 0.2, and 0.7 vs. 0.2, respectively). Shared within-host variation was significantly associated with household membership (OR: 1.51 [1.30,1.71], for one standard deviation increase in shared minority variants). Models that included shared within-host variation improved the accuracy of predicting household membership in all three studies as compared to models without within-host variation (AUC: 0.95 versus 0.92, 0.99 versus 0.95, and 0.93 versus 0.91).
Interpretation: Within-host M. tuberculosis variation persists through culture and could enhance the resolution of transmission inferences. The substantial differences in minority variation recovered across studies highlights the need to optimize approaches to recover and incorporate within-host variation into automated phylogenetic and transmission inference.This study was funded by NIAID, grant 5K01AI173385Peer reviewe