Search CORE

256 research outputs found

Final report of ENGAGE ‐ Establishing Next Generation sequencing Ability for Genomic analysis in Europe

Modeling Bacterial Species: Using Sequence Similarity with Clustering Techniques

Author: García Barriocanal María Elena
González García Lino
Mora Cantallops Marçal
Sicilia Urbán Miguel Ángel
Sánchez Alonso Salvador
Publication venue
Publication date: 13/04/2021
Field of study

Existing studies have challenged the current definition of named bacterial species, especially in the case of highly recombinogenic bacteria. This has led to considering the use of computational procedures to examine potential bacterial clusters that are not identified by species naming. This paper describes the use of sequence data obtained from MLST databases as input for a k-means algorithm extended to deal with housekeeping gene sequences as a metric of similarity for the clustering process. An implementation of the k-means algorithm has been developed based on an existing source code implementation, and it has been evaluated against MLST data. Results point out to potential bacterial clusters that are close to more than one different named species and thus may become candidates for alternative classifications accounting for genotypic information. The use of hierarchical clustering with sequence comparison as similarity metric has the potential to find clusters different from named species by using a more informed cluster formation strategy than a conventional nominal variant of the algorithm

e_Buah - Biblioteca Digital de la Universidad de Alcalá

Bayesian Non-Exhaustive Classification A Case Study: Online Name Disambiguation using Temporal Record Streams

Author: Bunescu R.
Chen P.-Y.
Davis A.
de Carvalho A. P.
Dundar M.
Lee D. D.
Michaud D. J.
Sethuraman J.
Zhang B.
Publication venue
Publication date: 01/09/2016
Field of study

The name entity disambiguation task aims to partition the records of multiple real-life persons so that each partition contains records pertaining to a unique person. Most of the existing solutions for this task operate in a batch mode, where all records to be disambiguated are initially available to the algorithm. However, more realistic settings require that the name disambiguation task be performed in an online fashion, in addition to, being able to identify records of new ambiguous entities having no preexisting records. In this work, we propose a Bayesian non-exhaustive classification framework for solving online name disambiguation task. Our proposed method uses a Dirichlet process prior with a Normal * Normal * Inverse Wishart data model which enables identification of new ambiguous entities who have no records in the training data. For online classification, we use one sweep Gibbs sampler which is very efficient and effective. As a case study we consider bibliographic data in a temporal stream format and disambiguate authors by partitioning their papers into homogeneous groups. Our experimental results demonstrate that the proposed method is better than existing methods for performing online name disambiguation task.Comment: to appear in CIKM 201

arXiv.org e-Print Archive

Crossref

IUPUIScholarWorks

Recommended from our members

The Humoral Response against Salmonella Typhi Protein Antigens During Acute, Convalescent, and Chronic Typhoid Fever

Author: Tran Vu Thieu Nga
Publication venue
Publication date: 08/07/2019
Field of study

Enteric (typhoid) fever is a life-threatening disease caused by the Salmonella enterica subspecies enterica serovars Typhi (S. Typhi) and Paratyphi A, B, and C (S. Paratyphi A, B, and C). The disease still causes major public health problems in low- and middle-income countries, principally in Asia and Africa. The increasing frequency of multi-drug resistant (MDR) and extended-drug resistant isolates (XDR) of S. Typhi and an increasing incidence of S. Paratyphi A mean that the international dynamics of enteric fever are changing. These changes add urgency to the demand for more efficient enteric fever control campaigns. The aim of this thesis was to assess control measures for enteric fever in Vietnam and to develop techniques that can be used as further control methods. I firstly systemically reviewed retrospective information regarding enteric fever in Vietnam and combined these data with data on economic development. This investigation revealed that national economic growth, the provision of improved quality drinking water, and better sanitation were likely the greatest contributors to the decline and ultimate elimination of enteric fever in Vietnam. My work then evaluated the serodiagnostic potential of a panel of novel S. Typhi protein antigens and the Vi capsular polysaccharide (Vi) in a group of patients with febrile diseases in Bangladesh. These data demonstrated the utility of serology for typhoid diagnostics when exploiting a combination of Vi and at least one protein antigen. I then assessed the acquisition of antibody against typhoid toxin during natural S. Typhi and S. Paratyphi A infections and measured the capability of these antibodies to neutralise the toxin. The data provided supporting evidence for generating an antitoxin treatment for enteric fever (caused by both S. Typhi and S. Paratyphi A), and potentially encourages the use of typhoid toxin in vaccine formulations. Within the scope of searching for vaccine novel candidates, my work further identified a panel of immunogenic antigens shared between S. Typhi and S. Paratyphi A that can stimulate an antibody response which can instigate bactericidal killing during natural infection. Finally, by exploiting the unique immunological profiles of S. Typhi carriers (cytokines and antibody), I developed a method of identifying S. Typhi carriers and estimating the prevalence of S. Typhi carriage in a typhoid endemic population. My findings will potentially lead to the development of novel enteric fever control strategies. I conclude that improved case detection and widespread vaccination campaigns using polyvalent Salmonella vaccines should be initiated for reducing the burden of enteric fever in endemic areas

Open Research Online (The Open University)

An image classification approach to analyze the suppression of plant immunity by the human pathogen <it>Salmonella</it> Typhimurium

Abstract Background The enteric pathogen <it>Salmonella</it> is the causative agent of the majority of food-borne bacterial poisonings. Resent research revealed that colonization of plants by <it>Salmonella</it> is an active infection process. <it>Salmonella</it> changes the metabolism and adjust the plant host by suppressing the defense mechanisms. In this report we developed an automatic algorithm to quantify the symptoms caused by <it>Salmonella</it> infection on <it>Arabidopsis</it>. Results The algorithm is designed to attribute image pixels into one of the two classes: healthy and unhealthy. The task is solved in three steps. First, we perform segmentation to divide the image into foreground and background. In the second step, a support vector machine (SVM) is applied to predict the class of each pixel belonging to the foreground. And finally, we do refinement by a neighborhood-check in order to omit all falsely classified pixels from the second step. The developed algorithm was tested on infection with the non-pathogenic <it>E. coli</it> and the plant pathogen <it>Pseudomonas syringae</it> and used to study the interaction between plants and <it>Salmonella</it> wild type and T3SS mutants. We proved that T3SS mutants of <it>Salmonella</it> are unable to suppress the plant defenses. Results obtained through the automatic analyses were further verified on biochemical and transcriptome levels. Conclusion This report presents an automatic pixel-based classification method for detecting “unhealthy” regions in leaf images. The proposed method was compared to existing method and showed a higher accuracy. We used this algorithm to study the impact of the human pathogenic bacterium <it>Salmonella</it> Typhimurium on plants immune system. The comparison between wild type bacteria and T3SS mutants showed similarity in the infection process in animals and in plants. Plant epidemiology is only one possible application of the proposed algorithm, it can be easily extended to other detection tasks, which also rely on color information, or even extended to other features.</p

HAL Evry

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Fraunhofer-ePrints

PubMed Central

ProdInra

Computational Identification of the Plausible Molecular Vaccine Candidates of Multidrug-Resistant Salmonella enterica

Author: El-Aal Amr Adel Ahmed Abd
Lahiri Chandrajit
Mishra Rohit
Tan Yong Chiang
Publication venue: 'IntechOpen'
Publication date: 23/04/2021
Field of study

Salmonella enterica serovars are responsible for the life-threatening, fatal, invasive diseases that are common in children and young adults. According to the most recent estimates, globally, there are approximately 11–20 million cases of morbidity and between 128,000 and 161,000 mortality per year. The high incidence rates of diseases like typhoid, caused by the serovars Typhi and Paratyphi, and gastroenteritis, caused by the non-typhoidal Salmonellae, have become worse, with the ever-increasing pathogenic strains being resistant to fluoroquinolones or almost even the third generation cephalosporins, such as ciprofloxacin and ceftriaxone. With vaccination still being one of the chosen methods of eradicating this disease, identification of candidate proteins, to be utilized for effective molecular vaccines, has probably remained a challenging issue. In our study here, we portray the usage of computational tools to analyze and predict potential vaccine candidate(s) for the multi-drug resistant serovars of S. enterica

IntechOpen

Multilocus Sequence Typing as a Replacement for Serotyping in Salmonella enterica

Author: A Miko
A Miko
AJ Baumler
AJ Gatto
Alexandra Uesbeck
B Linz
B Malorny
B Swaminathan
BA Lindstedt
BJ Shapiro
BT Leader
C Dauga
C Fitzgerald
C Kidgell
C Yoshida
CH Chiu
D Falush
D Falush
Debra E. Bessen
DL Baggesen
DL Swofford
EF Boyd
EJ Feil
EJ Feil
EL Best
EM Daniels
F Kauffmann
F Kauffmann
F Kauffmann
F Walsh
FJ Cooke
FM Cohan
François-Xavier Weill
FX Weill
FX Weill
G Moran
G Morelli
G Morelli
GG Perron
GG Perron
Gordon Dougan
H Harbottle
H Levy
HC den Bakker
Heather Harbottle
J Corander
J Li
James L. Hale
JE Cooper
John Wain
JR McQuiston
K Tamura
KA Jolley
KE Holt
KH Han
KM Turner
KM Turner
L Le Minor
LA Hughes
Lee H. Harrison
M Achtman
M Achtman
M Achtman
M Achtman
M Eppinger
M Torpdahl
M Wiesner
Mark Achtman
Mary G. Krauland
MC Maiden
MCJ Maiden
MG Krauland
NH Smith
NR Thomson
P Beltran
P Petrov
P Roumagnac
PA Barrow
PA Grimont
R Dieckmann
R Hershberg
R Laukkanen-Ninios
R Okinaka
R Prager
RA Kingsley
RA Kingsley
RK Selander
RK Selander
RK Selander
S Guindon
S Herrera-Leon
S Nair
S Trupschuch
S Uzzau
S Uzzau
Satheesh Nair
SK Parsons
SK Sheppard
SM Tennant
SS Abby
Sylvain Brisse
T Wirth
V Sangal
Vartul Sangal
W Rabsch
X Didelot
X Didelot
Y Moodley
Y Moodley
Y Soyer
Zhemin Zhou
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Salmonella enterica subspecies enterica is traditionally subdivided into serovars by serological and nutritional characteristics. We used Multilocus Sequence Typing (MLST) to assign 4,257 isolates from 554 serovars to 1092 sequence types (STs). The majority of the isolates and many STs were grouped into 138 genetically closely related clusters called eBurstGroups (eBGs). Many eBGs correspond to a serovar, for example most Typhimurium are in eBG1 and most Enteritidis are in eBG4, but many eBGs contained more than one serovar. Furthermore, most serovars were polyphyletic and are distributed across multiple unrelated eBGs. Thus, serovar designations confounded genetically unrelated isolates and failed to recognize natural evolutionary groupings. An inability of serotyping to correctly group isolates was most apparent for Paratyphi B and its variant Java. Most Paratyphi B were included within a sub-cluster of STs belonging to eBG5, which also encompasses a separate sub-cluster of Java STs. However, diphasic Java variants were also found in two other eBGs and monophasic Java variants were in four other eBGs or STs, one of which is in subspecies salamae and a second of which includes isolates assigned to Enteritidis, Dublin and monophasic Paratyphi B. Similarly, Choleraesuis was found in eBG6 and is closely related to Paratyphi C, which is in eBG20. However, Choleraesuis var. Decatur consists of isolates from seven other, unrelated eBGs or STs. The serological assignment of these Decatur isolates to Choleraesuis likely reflects lateral gene transfer of flagellar genes between unrelated bacteria plus purifying selection. By confounding multiple evolutionary groups, serotyping can be misleading about the disease potential of S. enterica. Unlike serotyping, MLST recognizes evolutionary groupings and we recommend that Salmonella classification by serotyping should be replaced by MLST or its equivalents

Northumbria Research Link

Directory of Open Access Journals

HAL Descartes

Warwick Research Archives Portal Repository

D-Scholarship@Pitt

MPG.PuRe

FigShare

Public Library of Science (PLOS)

Crossref

PubMed Central

University of Melbourne Institutional Repository

HAL-Pasteur

Final report of ENGAGE ‐ Establishing Next Generation sequencing Ability for Genomic analysis in Europe

Modeling Bacterial Species: Using Sequence Similarity with Clustering Techniques

Bayesian Non-Exhaustive Classification A Case Study: Online Name Disambiguation using Temporal Record Streams

The Humoral Response against <i>Salmonella</i> Typhi Protein Antigens During Acute, Convalescent, and Chronic Typhoid Fever

An image classification approach to analyze the suppression of plant immunity by the human pathogen <it>Salmonella</it> Typhimurium

Computational Identification of the Plausible Molecular Vaccine Candidates of Multidrug-Resistant <em>Salmonella enterica</em>

Multilocus Sequence Typing as a Replacement for Serotyping in Salmonella enterica