48 research outputs found

    Identifying dramatic selection shifts in phylogenetic trees

    Get PDF
    BACKGROUND: The rate of evolution varies spatially along genomes and temporally in time. The presence of evolutionary rate variation is an informative signal that often marks functional regions of genomes and historical selection events. There exist many tests for temporal rate variation, or heterotachy, that start by partitioning sampled sequences into two or more groups and testing rate homogeneity among the groups. I develop a Bayesian method to infer phylogenetic trees with a divergence point, or dramatic temporal shifts in selection pressure that affect many nucleotide sites simultaneously, located at an unknown position in the tree. RESULTS: Simulation demonstrates that the method is most able to detect divergence points when rate variation and the number of affected sites is high, but not beyond biologically relevant values. The method is applied to two viral data sets. A divergence point is identified separating the B and C subtypes, two genetically distinct variants of HIV that have spread into different human populations with the AIDS epidemic. In contrast, no strong signal of temporal rate variation is found in a sample of F and H genotypes, two genetic variants of HBV that have likely evolved with humans during their immigration and expansion into the Americas. CONCLUSION: Temporal shifts in evolutionary rate of sufficient magnitude are detectable in the history of sampled sequences. The ability to detect such divergence points without the need to specify a prior hypothesis about the location or timing of the divergence point should help scientists identify historically important selection events and decipher mechanisms of evolution

    An Efficient kk-modes Algorithm for Clustering Categorical Datasets

    Get PDF
    Mining clusters from data is an important endeavor in many applications. The kk-means method is a popular, efficient, and distribution-free approach for clustering numerical-valued data, but does not apply for categorical-valued observations. The kk-modes method addresses this lacuna by replacing the Euclidean with the Hamming distance and the means with the modes in the kk-means objective function. We provide a novel, computationally efficient implementation of kk-modes, called OTQT. We prove that OTQT finds updates to improve the objective function that are undetectable to existing kk-modes algorithms. Although slightly slower per iteration due to algorithmic complexity, OTQT is always more accurate per iteration and almost always faster (and only barely slower on some datasets) to the final optimum. Thus, we recommend OTQT as the preferred, default algorithm for kk-modes optimization.Comment: 16 pages, 10 figures, 5 table

    In the Garden of Branching Processes

    Get PDF
    The current paper surveys and develops numerical methods for Markovian multitype branching processes in continuous time. Particular attention is paid to the calculation of means, variances, extinction probabilities, and marginal distributions in the presence of a Poisson stream of immigrant particles. The Poisson process assumption allows for temporally complex patterns of immigration and facilitates application of marked Poisson processes and Campbell’s formulas. The methods and formulas derived are applied to four models: two population genetics models, a model for vaccination against an infectious disease in a community of households, and a model for the growth of resistant HIV virus in patients undergoing drug therap

    Rev Variation during Persistent Lentivirus Infection

    Get PDF
    The ability of lentiviruses to continually evolve and escape immune control is the central impediment in developing an effective vaccine for HIV-1 and other lentiviruses. Equine infectious anemia virus (EIAV) is considered a useful model for immune control of lentivirus infection. Virus-specific cytotoxic T lymphocytes (CTL) and broadly neutralizing antibody effectively control EIAV replication during inapparent stages of disease, but after years of low-level replication, the virus is still able to produce evasion genotypes that lead to late re-emergence of disease. There is a high rate of genetic variation in the EIAV surface envelope glycoprotein (SU) and in the region of the transmembrane protein (TM) overlapped by the major exon of Rev. This review examines genetic and phenotypic variation in Rev during EIAV disease and a possible role for Rev in immune evasion and virus persistence

    Genomic Sequence and Phylogenetic Analysis of Culex Flavivirus, an Insect-Specific Flavivirus, Isolated From Culex pipiens (Diptera: Culicidae) in Iowa

    Get PDF
    Adult mosquitoes (Diptera: Culicidae) were collected in 2007 and tested for specific viruses, including West Nile virus, as part of the ongoing arbovirus surveillance efforts in the state of Iowa. A subset of these mosquitoes (6,061 individuals in 340 pools) was further tested by reverse transcription-polymerase chain reaction (RT-PCR) using flavivirus universal primers. Of the 211 pools of Culex pipiens (L.) tested, 50 were positive. One of 51 pools of Culex tarsalis Coquillet was also positive. The flavivirus minimum infection rates (expressed as the number of positive mosquito pools per 1,000 mosquitoes tested) for Cx. pipiens and Cx. tarsalis were 10.3 and 1.2, respectively. Flavivirus RNA was not detected in Aedes triseriatus (Say) (52 pools), Culex erraticus (Dyar & Knab) (25 pools), or Culex territans Walker (one pool). Sequence analysis of all RT-PCR products revealed that the mosquitoes had been infected with Culex flavivirus (CxFV), an insect-specific virus previously isolated in Japan, Indonesia, Texas, Mexico, Guatemala and Trinidad. The complete genome of one isolate was sequenced, as were the envelope protein genes of eight other isolates. Phylogenetic analysis revealed that CxFV isolates from the United States (Iowa and Texas) are more closely related to CxFV isolates from Asia than those from Mexico, Guatemala, and Trinidad

    Universal primers for the amplification and sequence analysis pf actin-1 from diverse mosquito species

    Get PDF
    We report the development of universal primers for the reverse-transcription polymerase chain reaction (RT-PCR) amplification and nucleotide sequence analysis of actin cDNAs from taxonomically diverse mosquito species. Primers specific to conserved regions of the invertebrate actin-1 gene were designed after actin cDNA sequences of Anopheles gambiae, Bombyx mori, Drosophila melanogaster, and Caenorhabditis elegans. The efficacy of these primers was determined by RT-PCR with the use of total RNA from mosquitoes belonging to 30 species and 8 genera (Aedes, Anopheles, Culex, Deinocerites, Mansonia, Psorophora, Toxorhynchites, and Wyeomyia). The RT-PCR products were sequenced, and sequence data were used to design additional primers. One primer pair, denoted as Act-2F (5′-ATGGTCGGYATGGGNCAGAAGGACTC-3′) and Act-8R (5′-GATTCCATACCCAGGAAG-GADGG-3′), successfully amplified an RT-PCR product of the expected size (683-nt) in all mosquito spp. tested. We propose that this primer pair can be used as an internal control to test the quality of RNA from mosquitoes collected in vector surveillance studies. These primers can also be used in molecular experiments in which the detection, amplification or silencing of a ubiquitously expressed mosquito housekeeping gene is necessary. Sequence and phylogenetic data are also presented in this report

    Detection of RNA from a Novel West Nile-like Virus and High Prevalence of an Insect-specific Flavivirus in Mosquitoes in the Yucatan Peninsula of Mexico

    Get PDF
    As part of our ongoing surveillance efforts for West Nile virus (WNV) in the Yucatan Peninsula of Mexico, 96,687 mosquitoes collected from January through December 2007 were assayed by virus isolation in mammalian cells. Three mosquito pools caused cytopathic effect. Two isolates were orthobunyaviruses (Cache Valley virus and Kairi virus) and the identity of the third infectious agent was not determined. A subset of mosquitoes was also tested by reverse transcription-polymerase chain reaction (RT-PCR) using WNV-, flavivirus-, alphavirus-, and orthobunyavirus-specific primers. A total of 7,009 Culex quinquefasciatus in 210 pools were analyzed. Flavivirus RNA was detected in 146 (70%) pools, and all PCR products were sequenced. The nucleotide sequence of one PCR product was most closely related (71-73% identity) with homologous regions of several other flaviviruses, including WNV, St. Louis encephalitis virus, and Ilheus virus. These data suggest that a novel flavivirus (tentatively named T\u27Ho virus) is present in Mexico. The other 145 PCR products correspond to Culex flavivirus, an insect-specific flavivirus first isolated in Japan in 2003. Culex flavivirus was isolated in mosquito cells from approximately one in four homogenates tested. The genomic sequence of one isolate was determined. Surprisingly, heterogeneous sequences were identified at the distal end of the 5\u27 untranslated region

    Fate of the H-NS–Repressed bgl Operon in Evolution of Escherichia coli

    Get PDF
    In the enterobacterial species Escherichia coli and Salmonella enterica, expression of horizontally acquired genes with a higher than average AT content is repressed by the nucleoid-associated protein H-NS. A classical example of an H-NS–repressed locus is the bgl (aryl-β,D-glucoside) operon of E. coli. This locus is “cryptic,” as no laboratory growth conditions are known to relieve repression of bgl by H-NS in E. coli K12. However, repression can be relieved by spontaneous mutations. Here, we investigated the phylogeny of the bgl operon. Typing of bgl in a representative collection of E. coli demonstrated that it evolved clonally and that it is present in strains of the phylogenetic groups A, B1, and B2, while it is presumably replaced by a cluster of ORFans in the phylogenetic group D. Interestingly, the bgl operon is mutated in 20% of the strains of phylogenetic groups A and B1, suggesting erosion of bgl in these groups. However, bgl is functional in almost all B2 isolates and, in approximately 50% of them, it is weakly expressed at laboratory growth conditions. Homologs of bgl genes exist in Klebsiella, Enterobacter, and Erwinia species and also in low GC-content Gram-positive bacteria, while absent in E. albertii and Salmonella sp. This suggests horizontal transfer of bgl genes to an ancestral Enterobacterium. Conservation and weak expression of bgl in isolates of phylogenetic group B2 may indicate a functional role of bgl in extraintestinal pathogenic E. coli

    CAPG

    No full text
    Data used in release of CAP
    corecore