350 research outputs found

    Genomics: From Microbes to Man

    Get PDF
    Sir John Crawford Memorial Lecture delivered by J. Craig Venter, President and Chief Scientific Officer of Celera Genomics Corporation and founder and Chairman of the Board of The Institute for Genomic Research (TIGR), during CGIAR International Centers Week 2000. Venter describes the development of new technologies and methods for rapidly characterizing and sequencing genomes. He refers to collaboration between TIGR and ILRI on the sequencing of the genome of Theileria parva, the tick-born parasite that causes East Coast Fever, as a step toward development of a vaccine. More generally, he discusses the significance of comparing genes across species, and the potential applications of this knowledge in developing vaccines and predicting individual susceptibility to specific diseases

    An Efficient Algorithm For Chinese Postman Walk on Bi-directed de Bruijn Graphs

    Full text link
    Sequence assembly from short reads is an important problem in biology. It is known that solving the sequence assembly problem exactly on a bi-directed de Bruijn graph or a string graph is intractable. However finding a Shortest Double stranded DNA string (SDDNA) containing all the k-long words in the reads seems to be a good heuristic to get close to the original genome. This problem is equivalent to finding a cyclic Chinese Postman (CP) walk on the underlying un-weighted bi-directed de Bruijn graph built from the reads. The Chinese Postman walk Problem (CPP) is solved by reducing it to a general bi-directed flow on this graph which runs in O(|E|2 log2(|V |)) time. In this paper we show that the cyclic CPP on bi-directed graphs can be solved without reducing it to bi-directed flow. We present a ?(p(|V | + |E|) log(|V |) + (dmaxp)3) time algorithm to solve the cyclic CPP on a weighted bi-directed de Bruijn graph, where p = max{|{v|din(v) - dout(v) > 0}|, |{v|din(v) - dout(v) < 0}|} and dmax = max{|din(v) - dout(v)}. Our algorithm performs asymptotically better than the bidirected flow algorithm when the number of imbalanced nodes p is much less than the nodes in the bi-directed graph. From our experimental results on various datasets, we have noticed that the value of p/|V | lies between 0.08% and 0.13% with 95% probability

    Meeting the requirements of both classroom-based and systemic assessment of mathematics proficiency: the potential of Rasch measurement theory

    Get PDF
    The challenges inherent in assessing mathematical proficiency depend on a number of factors, amongst which are an explicit view of what constitutes mathematical proficiency, an understanding of how children learn and the purpose and function of teaching. All of these factors impact on the choice of approach to assessment. In this article we distinguish between two broad types of assessment, classroom-based and systemic assessment. We argue that the process of assessment informed by Rasch measurement theory (RMT) can potentially support the demands of both classroom-based and systemic assessment, particularly if a developmental approach to learning is adopted, and an underlying model of developing mathematical proficiency is explicit in the assessment instruments and their supporting material. An example of a mathematics instrument and its analysis which illustrates this approach, is presented. We note that the role of assessment in the 21st century is potentially powerful. This influential role can only be justified if the assessments are of high quality and can be selected to match suitable moments in learning progress and the teaching process. Users of assessment data must have sufficient knowledge and insight to interpret the resulting numbers validly, and have sufficient discernment to make considered educational inferences from the data for teaching and learning responses

    Evaluating African horse sickness virus in horses and field-caught Culicoides biting midges on the East Rand, Gauteng Province, South Africa

    Get PDF
    A prospective study was undertaken during 2013 and 2014, to determine the prevalence of African horse sickness virus (AHSV) in Culicoides midges and the incidence of infection caused by the virus in 28 vaccinated resident horses on two equine establishments on the East Rand, Gauteng Province, South Africa. Field caught Culicoides midges together with whole blood samples from participating horses were collected every two weeks at each establishment. Culicoides midges and blood samples were tested for the presence of AHSV RNA by real-time quantitative reverse transcription polymerase chain reaction. Nine immunised horses became infected with AHSV during the study period, although infections were subclinical. African horse sickness virus was also identified from a field-collected midge pool. The observations recapitulate previously published data in another setting, where further investigation is warranted to determine what role subclinical infection plays in the diseases epidemiology

    Evolution of allostery in the cyclic nucleotide binding module

    Get PDF
    Analysis of cyclic nucleotide binding (CNB) domains shows that they have evolved to sense a wide variety of second messenger signals; a mechanism for allosteric regulation by CNB domains is proposed

    Stalking the Fourth Domain in Metagenomic Data: Searching for, Discovering, and Interpreting Novel, Deep Branches in Marker Gene Phylogenetic Trees

    Get PDF
    BACKGROUND: Most of our knowledge about the ancient evolutionary history of organisms has been derived from data associated with specific known organisms (i.e., organisms that we can study directly such as plants, metazoans, and culturable microbes). Recently, however, a new source of data for such studies has arrived: DNA sequence data generated directly from environmental samples. Such metagenomic data has enormous potential in a variety of areas including, as we argue here, in studies of very early events in the evolution of gene families and of species. METHODOLOGY/PRINCIPAL FINDINGS: We designed and implemented new methods for analyzing metagenomic data and used them to search the Global Ocean Sampling (GOS) expedition data set for novel lineages in three gene families commonly used in phylogenetic studies of known and unknown organisms: small subunit rRNA and the recA and rpoB superfamilies. Though the methods available could not accurately identify very deeply branched ss-rRNAs (largely due to difficulties in making robust sequence alignments for novel rRNA fragments), our analysis revealed the existence of multiple novel branches in the recA and rpoB gene families. Analysis of available sequence data likely from the same genomes as these novel recA and rpoB homologs was then used to further characterize the possible organismal source of the novel sequences. CONCLUSIONS/SIGNIFICANCE: Of the novel recA and rpoB homologs identified in the metagenomic data, some likely come from uncharacterized viruses while others may represent ancient paralogs not yet seen in any cultured organism. A third possibility is that some come from novel cellular lineages that are only distantly related to any organisms for which sequence data is currently available. If there exist any major, but so-far-undiscovered, deeply branching lineages in the tree of life, we suggest that methods such as those described herein currently offer the best way to search for them

    Observable dynamic models of reagent effects for model based froth flotation control

    Get PDF
    This article demonstrates the feasibility of including simple reagent addition models in an existing observable dynamic model of a froth flotation circuit. The existing model has full state observability and parameter identifiability using measurements that are commonly available on flotation circuits. This article qualitatively evaluates the possible impact of varying frother dosage on the model parameters. A Sobol sensitivity analysis indicates that the air recovery model parameters are most influential in the determination of grade and recovery. The model is expanded with two different reagent effect models. Both expansions include mass balance models of the frother concentration in each cell. The first model expands an empirical parameter in the air recovery model, related to the froth height at which peak air recovery (PAR) is achieved, as a linear function of frother concentration. The second model adds a linear frother concentration term to the existing air recovery model to modify the steady-state air recovery directly. Observability analyses of the expanded models show that all states and the important time-varying model parameters are observable (and identifiable) from the available on-line measurements. Most importantly, the frother concentrations are shown to be observable without concentration measurements. Simulations of the model expansions show that the second model can qualitatively predict the impact of increased frother dosage on air recovery, grade, and recovery, while the first model can only predict the correct effect under certain conditions.https://www.journals.elsevier.com/ifac-papersonlineElectrical, Electronic and Computer Engineerin

    Designer diatom episomes delivered by bacterial conjugation.

    Get PDF
    Eukaryotic microalgae hold great promise for the bioproduction of fuels and higher value chemicals. However, compared with model genetic organisms such as Escherichia coli and Saccharomyces cerevisiae, characterization of the complex biology and biochemistry of algae and strain improvement has been hampered by the inefficient genetic tools. To date, many algal species are transformable only via particle bombardment, and the introduced DNA is integrated randomly into the nuclear genome. Here we describe the first nuclear episomal vector for diatoms and a plasmid delivery method via conjugation from Escherichia coli to the diatoms Phaeodactylum tricornutum and Thalassiosira pseudonana. We identify a yeast-derived sequence that enables stable episome replication in these diatoms even in the absence of antibiotic selection and show that episomes are maintained as closed circles at copy number equivalent to native chromosomes. This highly efficient genetic system facilitates high-throughput functional characterization of algal genes and accelerates molecular phytoplankton research
    corecore