89 research outputs found
Resilience development in the new generation of production mining employees
Abstract: This paper proposes that at least as much effort as is placed on technical development of youth, should be placed on values development. In the context of the minerals industry in the developing world and particularly in Africa, the importance of values development is grossly under emphasised. With commodity prices falling and extraction costs soaring it is vital that meaningful and practical effort is made to counter the despondency that accompanies such challenges. As a contribution to such resilience building, the University of Johannesburg Mining Department introduced the “Stairs Project.” This project is a practical way not only of developing physical fitness but also developing personal values of resilience, dedication, commitment, integrity, and many more. Such values are essential if sustainability is to be achieved in the minerals industry during the challenging times currently being experienced. The authors propose that without strong focus on development of resilience of the individual; the resilience and survival of the industry and the economy itself, will be severely threatened. The appropriate time to start such developmental intervention is “yesterday” and in “infancy” but the best we can do is to do it “now” with the “stock of youth” that we have entering the industry
Inferring bona fide transfrags in RNA-Seq derived-transcriptome assemblies of non-model organisms
Background: De novo transcriptome assembly of short transcribed fragments (transfrags) produced from sequencing-by-synthesis technologies often results in redundant datasets with differing levels of unassembled, partially assembled or mis-assembled transcripts. Post-assembly processing intended to reduce redundancy typically involves reassembly or clustering of assembled sequences. However, these approaches are mostly based on common word heuristics and often create clusters of biologically unrelated sequences, resulting in loss of unique
transfrags annotations and propagation of mis-assemblies.
Results: Here, we propose a structured framework that consists of a few steps in pipeline architecture for Inferring Functionally Relevant Assembly-derived Transcripts (IFRAT). IFRAT combines 1) removal of identical subsequences,
2) error tolerant CDS prediction, 3) identification of coding potential, and 4) complements BLAST with a multiple domain architecture annotation that reduces non-specific domain annotation. We demonstrate that independent of the assembler, IFRAT selects bona fide transfrags (with CDS and coding potential) from the transcriptome assembly of a model organism without relying on post-assembly clustering or reassembly. The robustness of IFRAT is inferred on RNA-Seq data of Neurospora crassa assembled using de Bruijn graph-based assemblers, in single (Trinity and Oases-25) and multiple (Oases-Merge and additive or pooled) k-mer modes. Single k-mer assemblies contained fewer transfrags compared to the multiple k-mer assemblies. However, Trinity identified a comparable number of
predicted coding sequence and gene loci to Oases pooled assembly. IFRAT selects bona fide transfrags representing over 94% of cumulative BLAST-derived functional annotations of the unfiltered assemblies. Between 4-6% are lost when orphan transfrags are excluded and this represents only a tiny fraction of annotation derived from functional transference by sequence similarity. The median length of bona fide transfrags ranged from 1.5kb (Trinity) to 2kb (Oases), which is consistent with the average coding sequence length in fungi. The fraction of transfrags that could be associated with gene ontology terms ranged from 33-50%, which is also high for domain based annotation. We showed that unselected transfrags were mostly truncated and represent sequences from intronic, untranslated
(5′ and 3′) regions and non-coding gene loci.
Conclusions: IFRAT simplifies post-assembly processing providing a reference transcriptome enriched with functionally relevant assembly-derived transcripts for non-model organism.Department of Science and Technology
National Research Foundation
South African Research Chair initiativeWeb of Scienc
Chromosomal-level assembly of the Asian seabass genome using long sequence reads and multi-layered scaffolding
We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a
tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical
and genetic mapping along with shared synteny from closely related fish species to
derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50
size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various
regions across the species’ native range. SNP analyses identified high levels of genetic
diversity and confirmed earlier indications of a population stratification comprising three
clades with signs of admixture apparent in the South-East Asian population. The quality of
the Asian seabass genome assembly far exceeds that of any other fish species, and will
serve as a new standard for fish genomics.Web of Scienc
Establishing minion sequencing and genome assembly procedures for the analysis of the rooibos (aspalathus linearis) genome
While plant genome analysis is gaining speed worldwide, few plant genomes have been
sequenced and analyzed on the African continent. Yet, this information holds the potential to
transform diverse industries as it unlocks medicinally and industrially relevant biosynthesis pathways
for bioprospecting. Considering that South Africa is home to the highly diverse Cape Floristic
Region, local establishment of methods for plant genome analysis is essential. Long-read sequencing
is becoming standard procedure for plant genome research, as these reads can span repetitive
regions of the DNA, substantially facilitating reassembly of a contiguous genome. With the MinION,
Oxford Nanopore offers a cost-efficient sequencing method to generate long reads; however, DNA
purification protocols must be adapted for each plant species to generate ultra-pure DNA, essential
for these analyses
The combat-tb workbench: Making powerful Mycobacterium tuberculosis bioinformatics accessible
Whole-genome sequencing (WGS) is a powerful method for detecting drug resistance, genetic diversity, and transmission dynamics of Mycobacterium tuberculosis. Implementation of WGS in public health microbiology laboratories is impeded by a lack of user-friendly, automated, and semiautomated pipelines. We present the COMBAT-TB Workbench, a modular, easy-to-install application that provides a web-based environment for Mycobacterium tuberculosis bioinformatics. The COMBAT-TB Workbench is built using two main software components: the IRIDA platform for its web-based user interface and data management capabilities and the Galaxy bioinformatics workflow platform for workflow execution
Galaxy training: A powerful framework for teaching!
There is an ongoing explosion of scientific datasets being generated, brought on by recent technological advances in many areas of the natural sciences. As a result, the life sciences have become increasingly computational in nature, and bioinformatics has taken on a central role in research studies. However, basic computational skills, data analysis, and stewardship are still rarely taught in life science educational programs, resulting in a skills gap in many of the researchers tasked with analysing these big datasets. In order to address this skills gap and empower researchers to perform their own data analyses, the Galaxy Training Network (GTN) has previously developed the Galaxy Training Platform (https://training.galaxyproject.org), an open access, community-driven framework for the collection of FAIR (Findable, Accessible, Interoperable, Reusable) training materials for data analysis utilizing the user-friendly Galaxy framework as its primary data analysis platform
Taste and odorant receptors of the coelecanth- a gene repertoire in transition
G-protein coupled chemosensory receptors (GPCR-CRs) aid in the perception of odors and tastes in vertebrates. So far, six GPCR-CR families have been identified that are conserved in most vertebrate species. Phylogenetic analyses indicate differing evolutionary dynamics between teleost fish and tetrapods. The coelacanth Latimeria chalumnae belongs to the lobe-finned fishes, which represent a phylogenetic link between these two groups. We searched the genome of L. chalumnae for GPCR-CRs and found that coelacanth taste receptors are more similar to those in tetrapods than in teleost fish: two coelacanth T1R2s co-segregate with the tetrapod T1R2s that recognize sweet substances, and our phylogenetic analyses indicate that the teleost T1R2s are closer related to T1R1s (umami taste receptors) than to tetrapod T1R2s. Furthermore, coelacanths are the first fish with a large repertoire of bitter taste receptors (58 T2Rs). Considering current knowledge on feeding habits of coelacanths the question arises if perception of bitter taste is the only function of these receptors. Similar to teleost fish, coelacanths have a variety of olfactory receptors (ORs) necessary for perception of water-soluble substances. However, they also have seven genes in the two tetrapod OR subfamilies predicted to recognize airborne molecules. The two coelacanth vomeronasal receptor families are larger than those in teleost fish, and similar to tetrapods, form V1R and V2R monophyletic clades. This may point to an advanced development of the vomeronasal organ as reported for lungfish. Our results show that the intermediate position of Latimeria in the phylogeny is reflected in its GPCR-CR repertoire.The South African Research Chairs Initiative of the Department of Science and Technology and the National Research Foundation: Grant 64751.Web of Scienc
The African Coelecanth genome provides insights into tetrapod evolution
The discovery of a living coelacanth specimen in 1938 was remarkable, as this lineage of lobe-finned fish was thought to have become extinct 70 million years ago. The modern coelacanth looks remarkably similar to many of its ancient relatives, and its evolutionary proximity to our own fish ancestors provides a glimpse of the fish that first walked on land. Here we report the genome sequence of the African coelacanth, Latimeria chalumnae. Through a phylogenomic analysis, we conclude that the lungfish, and not the coelacanth, is the closest living relative of tetrapods. Coelacanth protein-coding genes are significantly more slowly evolving than those of tetrapods, unlike other genomic features. Analyses of changes in genes and regulatory elements during the vertebrate adaptation to land highlight genes involved in immunity, nitrogen excretion and the development of fins, tail, ear, eye, brain and olfaction. Functional assays of enhancers involved in the fin-to-limb transition and in the emergence of extra-embryonic tissues show the importance of the coelacanth genome as a blueprint for understanding tetrapod evolution.Web of Scienc
COMBAT-TB-NeoDB: Fostering tuberculosis research through integrative analysis using graph database technologies
Tuberculosis (TB) is a significant global health threat, with onethird of the population infected with its causative agent
Mycobacterium tuberculosis (M.tb). Globally, researchers have been
responding with a plethora of heterogeneous TB databases with
each focusing on different subsets of TB data and present limited
options for data integration thus impeding the chances of integrative
analysis. Although each database can provide answers to certain
questions in its scope, it falls short in answering questions that require federated queries across multiple domains of biological
knowledge
The contribution of exon-skipping events on chromosome 22 to protein coding diversity
Completion of the human genome sequence provides evidence for a gene count with lower bound 30,000–40,000. Significant protein complexity may derive in part from multiple transcript isoforms. Recent EST based studies have revealed that alternate transcription, including alternative splicing, polyadenylation and transcription start sites, occurs within at least 30–40% of human genes. Transcript form surveys have yet to integrate the genomic context, expression, frequency, and contribution to protein diversity of isoform variation. We determine here the degree to which protein coding diversity may be influenced by alternate expression of transcripts by exhaustive manual confirmation of genome sequence annotation, and comparison to available transcript data to accurately associate skipped exon isoforms with genomic sequence. Relative expression levels of transcripts are estimated from EST database representation. The rigorous in silico method accurately identifies exon skipping using verified genome sequence. 545 genes have been studied in this first hand-curated assessment of exon skipping on chromosome 22
- …