135 research outputs found
TransFind—predicting transcriptional regulators for gene sets
The analysis of putative transcription factor binding sites in promoter regions of coregulated genes allows to infer the transcription factors that underlie observed changes in gene expression. While such analyses constitute a central component of the in-silico characterization of transcriptional regulatory networks, there is still a lack of simple-to-use web servers able to combine state-of-the-art prediction methods with phylogenetic analysis and appropriate multiple testing corrected statistics, which returns the results within a short time. Having these aims in mind we developed TransFind, which is freely available at http://transfind.sys-bio.net/
Integrating omics datasets with the OmicsPLS package
Background: With the exponential growth in available biomedical data, there is a need for data integration methods that can extract information about relationships between the data sets. However, these data sets might have very different characteristics. For interpretable results, data-specific variation needs to be quantified. For this task, Two-way Orthogonal Partial Least Squares (O2PLS) has been proposed. To facilitate application and development of the methodology, free and open-source software is required. However, this is not the case with O2PLS. Results: We introduce OmicsPLS, an open-source implementation of the O2PLS method in R. It can handle both low- and high-dimensional datasets efficiently. Generic methods for inspecting and visualizing results are implemented. Both a standard and faster alternative cross-validation methods are available to determine the number of components. A simulation study shows good performance of OmicsPLS compared to alternatives, in terms of accuracy and CPU runtime. We demonstrate OmicsPLS by integrating genetic and glycomic data. Conclusions: We propose the OmicsPLS R package: a free and open-source implementation of O2PLS for statistical data integration. OmicsPLS is available at https://cran.r-project.org/package=OmicsPLSand can be installed in R via install.packages("OmicsPLS")
Targetfinder.org: a resource for systematic discovery of transcription factor target genes
Targetfinder.org (http://targetfinder.org/) provides a web-based resource for finding genes that show a similar expression pattern to a group of user-selected genes. It is based on a large-scale gene expression compendium (>1200 experiments, >13 000 genes). The primary application of Targetfinder.org is to expand a list of known transcription factor targets by new candidate target genes. The user submits a group of genes (the ‘seed’), and as a result the web site provides a list of other genes ranked by similarity of their expression to the expression of the seed genes. Additionally, the web site provides information on a recovery/cross-validation test to check for consistency of the provided seed and the quality of the ranking. Furthermore, the web site allows to analyse affinities of a selected transcription factor to the promoter regions of the top-ranked genes in order to select the best new candidate target genes for further experimental analysis
Gentle Masking of Low-Complexity Sequences Improves Homology Search
Detection of sequences that are homologous, i.e. descended from a common ancestor, is a fundamental task in computational biology. This task is confounded by low-complexity tracts (such as atatatatatat), which arise frequently and independently, causing strong similarities that are not homologies. There has been much research on identifying low-complexity tracts, but little research on how to treat them during homology search. We propose to find homologies by aligning sequences with “gentle” masking of low-complexity tracts. Gentle masking means that the match score involving a masked letter is , where is the unmasked score. Gentle masking slightly but noticeably improves the sensitivity of homology search (compared to “harsh” masking), without harming specificity. We show examples in three useful homology search problems: detection of NUMTs (nuclear copies of mitochondrial DNA), recruitment of metagenomic DNA reads to reference genomes, and pseudogene detection. Gentle masking is currently the best way to treat low-complexity tracts during homology search
Regulation of Clock-Controlled Genes in Mammals
The complexity of tissue- and day time-specific regulation of thousands of clock-controlled genes (CCGs) suggests that many regulatory mechanisms contribute to the transcriptional output of the circadian clock. We aim to predict these mechanisms using a large scale promoter analysis of CCGs
Age-Associated Salivary MicroRNA Biomarkers for Oculopharyngeal Muscular Dystrophy
Small non-coding microRNAs (miRNAs) are involved in the regulation of mRNA stability. Their features, including high stability and secretion to biofluids, make them attractive as potential biomarkers for diverse pathologies. This is the first study reporting miRNA as potential biomarkers for oculopharyngeal muscular dystrophy (OPMD), an adult-onset myopathy. We hypothesized that miRNA that is differentially expressed in affected muscles from OPMD patients is secreted to biofluids and those miRNAs could be used as biomarkers for OPMD. We first identified candidate miRNAs from OPMD-affected muscles and from muscles from an OPMD mouse model using RNA sequencing. We then compared the OPMD-deregulated miRNAs to the literature and, subsequently, we selected a few candidates for expression studies in serum and saliva biofluids using qRT-PCR. We identified 126 miRNAs OPMD-deregulated in human muscles, but 36 deregulated miRNAs in mice only (pFDR < 0.05). Only 15 OPMD-deregulated miRNAs overlapped between the in humans and mouse studies. The majority of the OPMD-deregulated miRNAs showed opposite deregulation direction compared with known muscular dystrophies miRNAs (myoMirs), which are associated. In contrast, similar dysregulation direction was found for 13 miRNAs that are common between OPMD and aging muscles. A significant age-association (p < 0.05) was found for 17 OPMD-deregulated miRNAs (13.4%), whereas in controls, only six miRNAs (1.4%) showed a significant age-association, suggesting that miRNA expression in OPMD is highly age-associated. miRNA expression in biofluids revealed that OPMD-associated deregulation in saliva was similar to that in muscles, but not in serum. The same as in muscle, miRNA expression levels in saliva were also found to be associated with age (p < 0.05). Moreover, the majority of OPMD-miRNAs were found to be associated with dysphagia as an initial symptom. We suggest that levels of specific miRNAs in saliva can mark muscle degeneration in general and dysphagia in OPMDFrench Muscular Dystrophy Association (AFM-Téléthon). Research grant to the eOPMD (European OPMD consortium, V.R. and B.G.M.v.E.)
Age-associated salivary microRNA biomarkers for oculopharyngeal muscular dystrophy
Small non-coding microRNAs (miRNAs) are involved in the regulation of mRNA stability. Their features, including high stability and secretion to biofluids, make them attractive as potential biomarkers for diverse pathologies. This is the first study reporting miRNA as potential biomarkers for oculopharyngeal muscular dystrophy (OPMD), an adult-onset myopathy. We hypothesized that miRNA that is differentially expressed in affected muscles from OPMD patients is secreted to biofluids and those miRNAs could be used as biomarkers for OPMD. We first identified candidate miRNAs from OPMD-affected muscles and from muscles from an OPMD mouse model using RNA sequencing. We then compared the OPMD-deregulated miRNAs to the literature and, subsequently, we selected a few candidates for expression studies in serum and saliva biofluids using qRT-PCR. We identified 126 miRNAs OPMD-deregulated in human muscles, but 36 deregulated miRNAs in mice only (pFDR < 0.05). Only 15 OPMD-deregulated miRNAs overlapped between the in humans and mouse studies. The majority of the OPMD-deregulated miRNAs showed opposite deregulation direction compared with known muscular dystrophies miRNAs (myoMirs), which are associated. In contrast, similar dysregulation direction was found for 13 miRNAs that are common between OPMD and aging muscles. A significant age-association (p< 0.05) was found for 17 OPMD-deregulated miRNAs (13.4%), whereas in controls, only six miRNAs (1.4%) showed a significant age-association, suggesting that miRNA expression in OPMD is highly age-associated. miRNA expression in biofluids revealed that OPMD-associated deregulation in saliva was similar to that in muscles, but not in serum. The same as in muscle, miRNA expression levels in saliva were also found to be associated with age (p< 0.05). Moreover, the majority of OPMD-miRNAs were found to be associated with dysphagia as an initial symptom. We suggest that levels of specific miRNAs in saliva can mark muscle degeneration in general and dysphagia in OPMD.Molecular Epidemiolog
New function of the myostatin/activin type I receptor (ALK4) as a mediator of muscle atrophy and muscle regeneration
Skeletal muscle fibrosis and impaired muscle regeneration are major contributors to muscle wasting in Duchenne muscular dystrophy (DMD). Muscle growth is negatively regulated by myostatin (MSTN) and activins. Blockage of these pathways may improve muscle quality and function in DMD. Antisense oligonucleotides (AONs) were designed specifically to block the function of ALK4, a key receptor for the MSTN/activin pathway in skeletal muscle. AON-induced exon skipping resulted in specific Alk4 down-regulation, inhibition of MSTN activity, and increased myoblast differentiation in vitro Unexpectedly, a marked decrease in muscle mass (10%) was found after Alk4 AON treatment in mdx mice. In line with in vitro results, muscle regeneration was stimulated, and muscle fiber size decreased markedly. Notably, when Alk4 was down-regulated in adult wild-type mice, muscle mass decreased even more. RNAseq analysis revealed dysregulated metabolic functions and signs of muscle atrophy. We conclude that ALK4 inhibition increases myogenesis but also regulates the tight balance of protein synthesis and degradation. Therefore, caution must be used when developing therapies that interfere with MSTN/activin pathways
Chromosomal-level assembly of the Asian Seabass genome using long sequence reads and multi-layered scaffolding
We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species' native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics
Discourse Semantics for the Analysis of Change in Language
This paper purports to elaborate and address several issues which lie at the intersection of computational linguistics and psychology. The first issue addressed is that of the interaction between discourse and semantics by virtue of empirical linguistic and psychotherapeutic evidence. This paper then gives a formal account of the knowledge representation and reasoning processes involved in the construction of an XML knowledge base for use in the sematic analysis of psychotherapeutic transcripts. Computational methods for the automatic mark-up and inference of the psychotherapeutic phenomena under investigation are detailed in order to further develop intuitions behind a particular pragmatic theory of language known as the Metamodel. The work presented here ultimately aims to produce a sustainable system for the evaluation of the effectiveness of any given psychotherapeutic technique. The possibility exists for such a system to recognise successful therapeutic mechanisms and further still, to infer new ones, or suggest improvements, or offer novel explanations as to the success or failure of the therapy itself. The work discussed here stems from research in computational linguistics, psychotherapy, and philosophy. The corpus used is a culmination of client transcripts taken before, during, and after therapy. The particular therapeutic technique used here is known as the Metamodel (Bandler and Grinder, 1975). The Metamodel was originally proffered as a method of language analysis suitable for use by practitioners of any psychotherapeutic technique. It theorises that speech utterances are related to a clients deep structure through three primary mechanisms, namely generalisation, deletion, and distortion. Previous hand tagging of our data has proven support for such claims. It is our aim to automate the identification and reasoning process. The issues and processes involved in the automation of such tagging are discussed here. Architectural and philosophical issues relating syntax (or grammar), semantics (Larson and Segal, 1995), and pragmatics (Grice, 1989; Searle, 1969) are raised. Discourse Representation Theory (Kamp, 1981; Asher and Lascarides, 1995) is discussed and used here in order to infer discourse relations.Hosted by the Scholarly Text and Imaging Service (SETIS), the University of Sydney Library, and the Research Institute for Humanities and Social Sciences (RIHSS), the University of Sydney
- …