Search CORE

REDHORSE-REcombination and Double crossover detection in Haploid Organisms using next-geneRation SEquencing data

Author: Beverley Stephen M
Khan Asis
Shaik Jahangheer S
Sibley L. David
Publication venue: Digital Commons@Becker
Publication date: 01/01/2015
Field of study

BACKGROUND: Next-generation sequencing technology provides a means to study genetic exchange at a higher resolution than was possible using earlier technologies. However, this improvement presents challenges as the alignments of next generation sequence data to a reference genome cannot be directly used as input to existing detection algorithms, which instead typically use multiple sequence alignments as input. We therefore designed a software suite called REDHORSE that uses genomic alignments, extracts genetic markers, and generates multiple sequence alignments that can be used as input to existing recombination detection algorithms. In addition, REDHORSE implements a custom recombination detection algorithm that makes use of sequence information and genomic positions to accurately detect crossovers. REDHORSE is a portable and platform independent suite that provides efficient analysis of genetic crosses based on Next-generation sequencing data. RESULTS: We demonstrated the utility of REDHORSE using simulated data and real Next-generation sequencing data. The simulated dataset mimicked recombination between two known haploid parental strains and allowed comparison of detected break points against known true break points to assess performance of recombination detection algorithms. A newly generated NGS dataset from a genetic cross of Toxoplasma gondii allowed us to demonstrate our pipeline. REDHORSE successfully extracted the relevant genetic markers and was able to transform the read alignments from NGS to the genome to generate multiple sequence alignments. Recombination detection algorithm in REDHORSE was able to detect conventional crossovers and double crossovers typically associated with gene conversions whilst filtering out artifacts that might have been introduced during sequencing or alignment. REDHORSE outperformed other commonly used recombination detection algorithms in finding conventional crossovers. In addition, REDHORSE was the only algorithm that was able to detect double crossovers. CONCLUSION: REDHORSE is an efficient analytical pipeline that serves as a bridge between genomic alignments and existing recombination detection algorithms. Moreover, REDHORSE is equipped with a recombination detection algorithm specifically designed for Next-generation sequencing data. REDHORSE is portable, platform independent Java based utility that provides efficient analysis of genetic crosses based on Next-generation sequencing data. REDHORSE is available at http://redhorse.sourceforge.net/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1309-7) contains supplementary material, which is available to authorized users

Leishmania sexual reproductive strategies as resolved through computational methods designed for aneuploid genomes

Author: Beverley Stephen M
Dobson Deborah E
Sacks David L
Shaik Jahangheer S
Publication venue: Digital Commons@Becker
Publication date: 01/01/2021
Field of study

A cryptic sexual reproductive cycle i

Identification of transcriptional regulatory networks specific to pilocytic astrocytoma.

Author: Deshmukh Hrishikesh
Gutmann David H
MacDonald Tobey J
Nagarajan Rakesh
Payton Jacqueline E
Perry Arie
Shaik Jahangheer
Watson Mark A
Yu Jinsheng
Publication venue: eScholarship, University of California
Publication date: 01/01/2011
Field of study

BackgroundPilocytic Astrocytomas (PAs) are common low-grade central nervous system malignancies for which few recurrent and specific genetic alterations have been identified. In an effort to better understand the molecular biology underlying the pathogenesis of these pediatric brain tumors, we performed higher-order transcriptional network analysis of a large gene expression dataset to identify gene regulatory pathways that are specific to this tumor type, relative to other, more aggressive glial or histologically distinct brain tumours.MethodsRNA derived from frozen human PA tumours was subjected to microarray-based gene expression profiling, using Affymetrix U133Plus2 GeneChip microarrays. This data set was compared to similar data sets previously generated from non-malignant human brain tissue and other brain tumour types, after appropriate normalization.ResultsIn this study, we examined gene expression in 66 PA tumors compared to 15 non-malignant cortical brain tissues, and identified 792 genes that demonstrated consistent differential expression between independent sets of PA and non-malignant specimens. From this entire 792 gene set, we used the previously described PAP tool to assemble a core transcriptional regulatory network composed of 6 transcription factor genes (TFs) and 24 target genes, for a total of 55 interactions. A similar analysis of oligodendroglioma and glioblastoma multiforme (GBM) gene expression data sets identified distinct, but overlapping, networks. Most importantly, comparison of each of the brain tumor type-specific networks revealed a network unique to PA that included repressed expression of ONECUT2, a gene frequently methylated in other tumor types, and 13 other uniquely predicted TF-gene interactions.ConclusionsThese results suggest specific transcriptional pathways that may operate to create the unique molecular phenotype of PA and thus opportunities for corresponding targeted therapeutic intervention. Moreover, this study also demonstrates how integration of gene expression data with TF-gene and TF-TF interaction data is a powerful approach to generating testable hypotheses to better understand cell-type specific genetic programs relevant to cancer

eScholarship - University of California

Self-hybridization in Leishmania major

Author: Beverley Stephen M
Dobson Deborah E
Ferreira Tiago R
Ghosh Kashinath
Inbar Ehud
Jeffrey Brendan M
Sacks David
Shaik Jahangheer
Publication venue: 'American Society for Microbiology'
Publication date: 17/11/2022
Field of study

Genetic exchange between differen

Whole genome sequencing of experimental hybrids supports meiosis-like sexual recombination in Leishmania

Author: Beverley Stephen M
Cotton James A
Dobson Deborah
Grigg Michael E
Iantorno Stefano A
Inbar Ehud
Nzelu Chukwunonso O
Owens Katherine
Romano Audrey
Sacks David
Sanders Mandy J
Shaik Jahangheer
Publication venue: Digital Commons@Becker
Publication date: 01/01/2019
Field of study

Hybrid genotypes have been repeatedly described among natural isolates of Leishmania, and the recovery of experimental hybrids from sand flies co-infected with different strains or species of Leishmania has formally demonstrated that members of the genus possess the machinery for genetic exchange. As neither gamete stages nor cell fusion events have been directly observed during parasite development in the vector, we have relied on a classical genetic analysis to determine if Leishmania has a true sexual cycle. Here, we used whole genome sequencing to follow the chromosomal inheritance patterns of experimental hybrids generated within and between different strains of L. major and L. infantum. We also generated and sequenced the first experimental hybrids in L. tropica. We found that in each case the parental somy and allele contributions matched the inheritance patterns expected under meiosis 97–99% of the time. The hybrids were equivalent to F1 progeny, heterozygous throughout most of the genome for the markers that were homozygous and different between the parents. Rare, non-Mendelian patterns of chromosomal inheritance were observed, including a gain or loss of somy, and loss of heterozygosity, that likely arose during meiosis or during mitotic divisions of the progeny clones in the fly or culture. While the interspecies hybrids appeared to be sterile, the intraspecies hybrids were able to produce backcross and outcross progeny. Analysis of 5 backcross and outcross progeny clones generated from an L. major F1 hybrid, as well as 17 progeny clones generated from backcrosses involving a natural hybrid of L. tropica, revealed genome wide patterns of recombination, demonstrating that classical crossing over occurs at meiosis, and allowed us to construct the first physical and genetic maps in Leishmania. Altogether, the findings provide strong evidence for meiosis-like sexual recombination in Leishmania, presenting clear opportunities for forward genetic analysis and positional cloning of important genes.</div

The Francis Crick Institute

Enlighten

A unified framework for finding differentially expressed genes from microarray experiments

Author: C Tang
C Zhang
D Stekel
DL Davies
G Casella
G Getz
GJ McLachlan
H Hui-Huang
H Sahai
I Guyon
I Lonnstedt
IB Jeffery
J Shaik
J Shaik
J Shaik
J Shaik
J Shaik
Jahangheer S Shaik
JD Storey
Mohammed Yeasin
P Tamayo
RA Fisher
RL Fernando
RM Miller
RO Duda
S Mukherjee
S Tavazoie
T Li
TR Golub
U Alon
VG Tusher
X Chen
Y Benjamini
Y Benjamini
Y Su
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background This paper presents a unified framework for finding differentially expressed genes (DEGs) from the microarray data. The proposed framework has three interrelated modules: (i) gene ranking, ii) significance analysis of genes and (iii) validation. The first module uses two gene selection algorithms, namely, a) two-way clustering and b) combined adaptive ranking to rank the genes. The second module converts the gene ranks into p-values using an R-test and fuses the two sets of p-values using the Fisher's omnibus criterion. The DEGs are selected using the FDR analysis. The third module performs three fold validations of the obtained DEGs. The robustness of the proposed unified framework in gene selection is first illustrated using false discovery rate analysis. In addition, the clustering-based validation of the DEGs is performed by employing an adaptive subspace-based clustering algorithm on the training and the test datasets. Finally, a projection-based visualization is performed to validate the DEGs obtained using the unified framework. Results The performance of the unified framework is compared with well-known ranking algorithms such as t-statistics, Significance Analysis of Microarrays (SAM), Adaptive Ranking, Combined Adaptive Ranking and Two-way Clustering. The performance curves obtained using 50 simulated microarray datasets each following two different distributions indicate the superiority of the unified framework over the other reported algorithms. Further analyses on 3 real cancer datasets and 3 Parkinson's datasets show the similar improvement in performance. First, a 3 fold validation process is provided for the two-sample cancer datasets. In addition, the analysis on 3 sets of Parkinson's data is performed to demonstrate the scalability of the proposed method to multi-sample microarray datasets. Conclusion This paper presents a unified framework for the robust selection of genes from the two-sample as well as multi-sample microarray experiments. Two different ranking methods used in module 1 bring diversity in the selection of genes. The conversion of ranks to p-values, the fusion of p-values and FDR analysis aid in the identification of significant genes which cannot be judged based on gene ranking alone. The 3 fold validation, namely, robustness in selection of genes using FDR analysis, clustering, and visualization demonstrate the relevance of the DEGs. Empirical analyses on 50 artificial datasets and 6 real microarray datasets illustrate the efficacy of the proposed approach. The analyses on 3 cancer datasets demonstrate the utility of the proposed approach on microarray datasets with two classes of samples. The scalability of the proposed unified approach to multi-sample (more than two sample classes) microarray datasets is addressed using three sets of Parkinson's Data. Empirical analyses show that the unified framework outperformed other gene selection methods in selecting differentially expressed genes from microarray data.</p

University of Memphis Digital Commons

The mating competence of geographically diverse Leishmania major strains in their natural and unnatural sand fly vectors

Author: Akopyants Natalia S
Barhoumi Mourad
Beverley Stephen M
Charmoy Melanie
Dobson Deborah E
Elnaiem Dia-Eldin A
Fay Michael
Grigg Michael
Inbar Ehud
Kauffmann Florence
Lawyer Phillip
Owens Katherine
Romano Audrey
Sacks David
Shaik Jahangheer
Publication venue: Digital Commons@Becker
Publication date: 01/01/2013
Field of study

Invertebrate stages of Leishmania are capable of genetic exchange during their extracellular growth and development in the sand fly vector. Here we explore two variables: the ability of diverse L. major strains from across its natural range to undergo mating in pairwise tests; and the timing of the appearance of hybrids and their developmental stage associations within both natural (Phlebotomus duboscqi) and unnatural (Lutzomyia longipalpis) sand fly vectors. Following co-infection of flies with parental lines bearing independent drug markers, doubly-drug resistant hybrid progeny were selected, from which 96 clonal lines were analyzed for DNA content and genotyped for parent alleles at 4-6 unlinked nuclear loci as well as the maxicircle DNA. As seen previously, the majority of hybrids showed '2n' DNA contents, but with a significant number of '3n' and one '4n' offspring. In the natural vector, 97% of the nuclear loci showed both parental alleles; however, 3% (4/150) showed only one parental allele. In the unnatural vector, the frequency of uniparental inheritance rose to 10% (27/275). We attribute this to loss of heterozygosity after mating, most likely arising from aneuploidy which is both common and temporally variable in Leishmania. As seen previously, only uniparental inheritance of maxicircle kDNA was observed. Hybrids were recovered at similar efficiencies in all pairwise crosses tested, suggesting that L. major lacks detectable 'mating types' that limit free genetic exchange. In the natural vector, comparisons of the timing of hybrid formation with the presence of developmental stages suggest nectomonads as the most likely sexually competent stage, with hybrids emerging well before the first appearance of metacyclic promastigotes. These studies provide an important perspective on the prevalence of genetic exchange in natural populations of L. major and a guide for experimental studies to understand the biology of mating

The Francis Crick Institute

NextGen sequencing reveals short double crossovers contribute disproportionately to genetic diversity in Toxoplasma gondii

Author: Asis Khan
Benjamin M Rosenthal
Hernan A Lorenzi
Jahangheer S Shaik
James W Ajioka
Jitender P Dubey
L Sibley
Michael Behnke
Qiuling Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

BACKGROUND: Toxoplasma gondii is a widespread protozoan parasite of animals that causes zoonotic disease in humans. Three clonal variants predominate in North America and Europe, while South American strains are genetically diverse, and undergo more frequent recombination. All three northern clonal variants share a monomorphic version of chromosome Ia (ChrIa), which is also found in unrelated, but successful southern lineages. Although this pattern could reflect a selective advantage, it might also arise from non-Mendelian segregation during meiosis. To understand the inheritance of ChrIa, we performed a genetic cross between the northern clonal type 2 ME49 strain and a divergent southern type 10 strain called VAND, which harbors a divergent ChrIa. RESULTS: NextGen sequencing of haploid F1 progeny was used to generate a genetic map revealing a low level of conventional recombination, with an unexpectedly high frequency of short, double crossovers. Notably, both the monomorphic and divergent versions of ChrIa were isolated with equal frequency. As well, ChrIa showed no evidence of being a sex chromosome, of harboring an inversion, or distorting patterns of segregation. Although VAND was unable to self fertilize in the cat, it underwent successful out-crossing with ME49 and hybrid survival was strongly associated with inheritance of ChrIII from ME49 and ChrIb from VAND. CONCLUSIONS: Our findings suggest that the successful spread of the monomorphic ChrIa in the wild has not been driven by meiotic drive or related processes, but rather is due to a fitness advantage. As well, the high frequency of short double crossovers is expected to greatly increase genetic diversity among progeny from genetic crosses, thereby providing an unexpected and likely important source of diversity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-1168) contains supplementary material, which is available to authorized users