419 research outputs found

    A Non-coding RNA of Insect HzNV-1 Virus Establishes Latent Viral Infection through MicroRNA

    Get PDF
    Heliothis zea nudivirus-1 (HzNV-1) is an insect virus previously known as Hz-1 baculovirus. One of its major early genes, hhi1, is responsible for the establishment of productive viral infection; another gene, pag1, which expresses a non-coding RNA, is the only viral transcript detectable during viral latency. Here we showed that this non-coding RNA was further processed into at least two distinct miRNAs, which targeted and degraded hhi1 transcript. This is a result strikingly similar to a recent report that herpes simplex virus produces tightly-regulated latent specific miRNAs to silence its own key early transcripts. Nevertheless, proof for the establishment of viral latency by miRNA is still lacking. We further showed that HzNV-1 latency could be directly induced by pag1-derived miRNAs in cells infected with a pag1-deleted, latency-deficient virus. This result suggests the existence of a novel mechanism, where miRNAs can be functional for the establishment of viral latency

    Methods to study splicing from high-throughput RNA Sequencing data

    Full text link
    The development of novel high-throughput sequencing (HTS) methods for RNA (RNA-Seq) has provided a very powerful mean to study splicing under multiple conditions at unprecedented depth. However, the complexity of the information to be analyzed has turned this into a challenging task. In the last few years, a plethora of tools have been developed, allowing researchers to process RNA-Seq data to study the expression of isoforms and splicing events, and their relative changes under different conditions. We provide an overview of the methods available to study splicing from short RNA-Seq data. We group the methods according to the different questions they address: 1) Assignment of the sequencing reads to their likely gene of origin. This is addressed by methods that map reads to the genome and/or to the available gene annotations. 2) Recovering the sequence of splicing events and isoforms. This is addressed by transcript reconstruction and de novo assembly methods. 3) Quantification of events and isoforms. Either after reconstructing transcripts or using an annotation, many methods estimate the expression level or the relative usage of isoforms and/or events. 4) Providing an isoform or event view of differential splicing or expression. These include methods that compare relative event/isoform abundance or isoform expression across two or more conditions. 5) Visualizing splicing regulation. Various tools facilitate the visualization of the RNA-Seq data in the context of alternative splicing. In this review, we do not describe the specific mathematical models behind each method. Our aim is rather to provide an overview that could serve as an entry point for users who need to decide on a suitable tool for a specific analysis. We also attempt to propose a classification of the tools according to the operations they do, to facilitate the comparison and choice of methods.Comment: 31 pages, 1 figure, 9 tables. Small corrections adde

    Assessment of clusters of transcription factor binding sites in relationship to human promoter, CpG islands and gene expression

    Get PDF
    BACKGROUND: Gene expression is regulated mainly by transcription factors (TFs) that interact with regulatory cis-elements on DNA sequences. To identify functional regulatory elements, computer searching can predict TF binding sites (TFBS) using position weight matrices (PWMs) that represent positional base frequencies of collected experimentally determined TFBS. A disadvantage of this approach is the large output of results for genomic DNA. One strategy to identify genuine TFBS is to utilize local concentrations of predicted TFBS. It is unclear whether there is a general tendency for TFBS to cluster at promoter regions, although this is the case for certain TFBS. Also unclear is the identification of TFs that have TFBS concentrated in promoters and to what level this occurs. This study hopes to answer some of these questions. RESULTS: We developed the cluster score measure to evaluate the correlation between predicted TFBS clusters and promoter sequences for each PWM. Non-promoter sequences were used as a control. Using the cluster score, we identified a PWM group called PWM-PCP, in which TFBS clusters positively correlate with promoters, and another PWM group called PWM-NCP, in which TFBS clusters negatively correlate with promoters. The PWM-PCP group comprises 47% of the 199 vertebrate PWMs, while the PWM-NCP group occupied 11 percent. After reducing the effect of CpG islands (CGI) against the clusters using partial correlation coefficients among three properties (promoter, CGI and predicted TFBS cluster), we identified two PWM groups including those strongly correlated with CGI and those not correlated with CGI. CONCLUSION: Not all PWMs predict TFBS correlated with human promoter sequences. Two main PWM groups were identified: (1) those that show TFBS clustered in promoters associated with CGI, and (2) those that show TFBS clustered in promoters independent of CGI. Assessment of PWM matches will allow more positive interpretation of TFBS in regulatory regions

    Linkage mapping bovine EST-based SNP

    Get PDF
    BACKGROUND: Existing linkage maps of the bovine genome primarily contain anonymous microsatellite markers. These maps have proved valuable for mapping quantitative trait loci (QTL) to broad regions of the genome, but more closely spaced markers are needed to fine-map QTL, and markers associated with genes and annotated sequence are needed to identify genes and sequence variation that may explain QTL. RESULTS: Bovine expressed sequence tag (EST) and bacterial artificial chromosome (BAC)sequence data were used to develop 918 single nucleotide polymorphism (SNP) markers to map genes on the bovine linkage map. DNA of sires from the MARC reference population was used to detect SNPs, and progeny and mates of heterozygous sires were genotyped. Chromosome assignments for 861 SNPs were determined by twopoint analysis, and positions for 735 SNPs were established by multipoint analyses. Linkage maps of bovine autosomes with these SNPs represent 4585 markers in 2475 positions spanning 3058 cM . Markers include 3612 microsatellites, 913 SNPs and 60 other markers. Mean separation between marker positions is 1.2 cM. New SNP markers appear in 511 positions, with mean separation of 4.7 cM. Multi-allelic markers, mostly microsatellites, had a mean (maximum) of 216 (366) informative meioses, and a mean 3-lod confidence interval of 3.6 cM Bi-allelic markers, including SNP and other marker types, had a mean (maximum) of 55 (191) informative meioses, and were placed within a mean 8.5 cM 3-lod confidence interval. Homologous human sequences were identified for 1159 markers, including 582 newly developed and mapped SNP. CONCLUSION: Addition of these EST- and BAC-based SNPs to the bovine linkage map not only increases marker density, but provides connections to gene-rich physical maps, including annotated human sequence. The map provides a resource for fine-mapping quantitative trait loci and identification of positional candidate genes, and can be integrated with other data to guide and refine assembly of bovine genome sequence. Even after the bovine genome is completely sequenced, the map will continue to be a useful tool to link observable phenotypes and animal genotypes to underlying genes and molecular mechanisms influencing economically important beef and dairy traits

    Comparison of Influenza and SIV Specific CD8 T Cell Responses in Macaques

    Get PDF
    Macaques are a potentially useful non-human primate model to compare memory T-cell immunity to acute virus pathogens such as influenza virus and effector T-cell responses to chronic viral pathogens such as SIV. However, immunological reagents to study influenza CD8+ T-cell responses in the macaque model are limited. We recently developed an influenza-SIV vaccination model of pigtail macaques (Macaca nemestrina) and used this to study both influenza-specific and SIV-specific CD8+ T-cells in 39 pigtail macaques expressing the common Mane-A*10+ (Mane-A01*084) MHC-I allele. To perform comparative studies between influenza and SIV responses a common influenza nucleoprotein-specific CD8+ T-cell response was mapped to a minimal epitope (termed RA9), MHC-restricted to Mane-A*10 and an MHC tetramer developed to study this response. Influenza-specific memory CD8+ T-cell response maintained a highly functional profile in terms of multitude of effector molecule expression (CD107a, IFN-γ, TNF-α, MIP-1β and IL-2) and showed high avidity even in the setting of SIV infection. In contrast, within weeks following active SIV infection, SIV-specific CD8+ effector T-cells expressed fewer cytokines/degranulation markers and had a lower avidity compared to influenza specific CD8+ T-cells. Further, the influenza specific memory CD8 T-cell response retained stable expression of the exhaustion marker programmed death-marker-1 (PD-1) and co-stimulatory molecule CD28 following infection with SIV. This contrasted with the effector SIV-specific CD8+ T-cells following SIV infection which expressed significantly higher amounts of PD-1 and lower amounts of CD28. Our results suggest that strategies to maintain a more functional CD8+ T-cell response, profile may assist in controlling HIV disease

    MICA: desktop software for comprehensive searching of DNA databases

    Get PDF
    BACKGROUND: Molecular biologists work with DNA databases that often include entire genomes. A common requirement is to search a DNA database to find exact matches for a nondegenerate or partially degenerate query. The software programs available for such purposes are normally designed to run on remote servers, but an appealing alternative is to work with DNA databases stored on local computers. We describe a desktop software program termed MICA (K-Mer Indexing with Compact Arrays) that allows large DNA databases to be searched efficiently using very little memory. RESULTS: MICA rapidly indexes a DNA database. On a Macintosh G5 computer, the complete human genome could be indexed in about 5 minutes. The indexing algorithm recognizes all 15 characters of the DNA alphabet and fully captures the information in any DNA sequence, yet for a typical sequence of length L, the index occupies only about 2L bytes. The index can be searched to return a complete list of exact matches for a nondegenerate or partially degenerate query of any length. A typical search of a long DNA sequence involves reading only a small fraction of the index into memory. As a result, searches are fast even when the available RAM is limited. CONCLUSION: MICA is suitable as a search engine for desktop DNA analysis software

    The reference human nuclear mitochondrial sequences compilation validated and implemented on the UCSC genome browser

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Eukaryotic nuclear genomes contain fragments of mitochondrial DNA called NumtS (Nuclear mitochondrial Sequences), whose mode and time of insertion, as well as their functional/structural role within the genome are debated issues. Insertion sites match with chromosomal breaks, revealing that micro-deletions usually occurring at non-homologous end joining <it>loci </it>become reduced in presence of NumtS. Some NumtS are involved in recombination events leading to fragment duplication. Moreover, NumtS are polymorphic, a feature that renders them candidates as population markers. Finally, they are a cause of contamination during human mtDNA sequencing, leading to the generation of false heteroplasmies.</p> <p>Results</p> <p>Here we present RHNumtS.2, the most exhaustive human NumtSome catalogue annotating 585 NumtS, 97% of which were here validated in a European individual and in HapMap samples. The NumtS complete dataset and related features have been made available at the UCSC Genome Browser. The produced sequences have been submitted to INSDC databases. The implementation of the RHNumtS.2 tracks within the UCSC Genome Browser has been carried out with the aim to facilitate browsing of the NumtS tracks to be exploited in a wide range of research applications.</p> <p>Conclusions</p> <p>We aimed at providing the scientific community with the most exhaustive overview on the human NumtSome, a resource whose aim is to support several research applications, such as studies concerning human structural variation, diversity, and disease, as well as the detection of false heteroplasmic mtDNA variants. Upon implementation of the NumtS tracks, the application of the BLAT program on the UCSC Genome Browser has now become an additional tool to check for heteroplasmic artefacts, supported by data available through the NumtS tracks.</p

    Genome Assembly Has a Major Impact on Gene Content: A Comparison of Annotation in Two Bos Taurus Assemblies

    Get PDF
    Gene and SNP annotation are among the first and most important steps in analyzing a genome. As the number of sequenced genomes continues to grow, a key question is: how does the quality of the assembled sequence affect the annotations? We compared the gene and SNP annotations for two different Bos taurus genome assemblies built from the same data but with significant improvements in the later assembly. The same annotation software was used for annotating both sequences. While some annotation differences are expected even between high-quality assemblies such as these, we found that a staggering 40% of the genes (>9,500) varied significantly between assemblies, due in part to the availability of new gene evidence but primarily to genome mis-assembly events and local sequence variations. For instance, although the later assembly is generally superior, 660 protein coding genes in the earlier assembly are entirely missing from the later genome's annotation, and approximately 3,600 (15%) of the genes have complex structural differences between the two assemblies. In addition, 12–20% of the predicted proteins in both assemblies have relatively large sequence differences when compared to their RefSeq models, and 6–15% of bovine dbSNP records are unrecoverable in the two assemblies. Our findings highlight the consequences of genome assembly quality on gene and SNP annotation and argue for continued improvements in any draft genome sequence. We also found that tracking a gene between different assemblies of the same genome is surprisingly difficult, due to the numerous changes, both small and large, that occur in some genes. As a side benefit, our analyses helped us identify many specific loci for improvement in the Bos taurus genome assembly

    Rice-Map: a new-generation rice genome browser

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The concurrent release of rice genome sequences for two subspecies (<it>Oryza sativa </it>L. ssp. <it>japonica </it>and <it>Oryza sativa </it>L. ssp. <it>indica</it>) facilitates rice studies at the whole genome level. Since the advent of high-throughput analysis, huge amounts of functional genomics data have been delivered rapidly, making an integrated online genome browser indispensable for scientists to visualize and analyze these data. Based on next-generation web technologies and high-throughput experimental data, we have developed Rice-Map, a novel genome browser for researchers to navigate, analyze and annotate rice genome interactively.</p> <p>Description</p> <p>More than one hundred annotation tracks (81 for <it>japonica </it>and 82 for <it>indica</it>) have been compiled and loaded into Rice-Map. These pre-computed annotations cover gene models, transcript evidences, expression profiling, epigenetic modifications, inter-species and intra-species homologies, genetic markers and other genomic features. In addition to these pre-computed tracks, registered users can interactively add comments and research notes to Rice-Map as User-Defined Annotation entries. By smoothly scrolling, dragging and zooming, users can browse various genomic features simultaneously at multiple scales. On-the-fly analysis for selected entries could be performed through dedicated bioinformatic analysis platforms such as WebLab and Galaxy. Furthermore, a BioMart-powered data warehouse "Rice Mart" is offered for advanced users to fetch bulk datasets based on complex criteria.</p> <p>Conclusions</p> <p>Rice-Map delivers abundant up-to-date <it>japonica </it>and <it>indica </it>annotations, providing a valuable resource for both computational and bench biologists. Rice-Map is publicly accessible at <url>http://www.ricemap.org/</url>, with all data available for free downloading.</p

    A randomised control crossover trial of a theory based intervention to improve sun-safe and healthy behaviours in construction workers:Study protocol

    Get PDF
    Abstract Background Exposure to sunlight can have both positive and negative health impacts. Excessive exposure to ultra-violet (UV) radiation from the sun can cause skin cancer, however insufficient exposure to sunlight has a detrimental effect on production of Vitamin D. In the construction industry there are onsite proactive behaviours for safety, but sun-safety remains a low priority. There is limited research on understanding the barriers to adopting sun-safe behaviours and the association this may have with Vitamin D production. This paper reports a protocol for an intervention study, using text messaging in combination with a supportive smartphone App. The intervention aims to both reduce UV exposure during months with higher UV levels and promote appropriate dietary changes to boost Vitamin D levels during months with low UV levels. Method/design Approximately 60 construction workers will be recruited across the United Kingdom. A randomised control crossover trial (RCCT) will be used to test the intervention, with randomisation at site level – i.e. participants will receive both the control (no text messages or supportive App support) and intervention (daily text messages and supportive App). Using the Theory of Planned Behaviour (TPB) the intervention focuses on supporting sun-safety and healthy dietary decisions in relation to Vitamin D intake. The intervention emphasises cultivating the perception of normative support in the workplace, increasing awareness of control and self-efficacy in taking sun-protective behaviours, making healthier eating choices to boost Vitamin D, and tackling stigmas attached to image and group norms. Each study epoch will last 21 days with intervention text messages delivered on workdays only. The supportive App will provide supplementary information about sun protective behaviours and healthy dietary choices. The primary outcome measure is 25-hydroxy-Vitamin D [25(OH)D] level (obtained using blood spot sampling), which will be taken pre and post control and intervention periods. Secondary outcome measures are two-fold, (1) using the TPB to detect changes in behaviour, and (2) quantifying UV exposure during the UK peak radiation season (April–September) using body-mounted UV sensors. Discussion This study will provide important information about the effectiveness of a technology-based intervention to promote sun-safety and healthy behaviours in outdoor construction workers. Trial registration ISRCTN15888934 retrospectively registered 15.01.2018
    corecore