6,289 research outputs found
Bitopic binding mode of an M1 muscarinic acetylcholine receptor agonist associated with adverse clinical trial outcomes
The realisation of the therapeutic potential of targeting the M1 muscarinic acetylcholine receptor (M1 mAChR) for the treatment of cognitive decline in Alzheimer's disease has prompted the discovery of M1 mAChR ligands showing efficacy in alleviating cognitive dysfunction in both rodents and humans. Among these is GSK1034702, described previously as a potent M1 receptor allosteric agonist, which showed pro-cognitive effects in rodents and improved immediate memory in a clinical nicotine withdrawal test but induced significant side-effects. Here we provide evidence using ligand binding, chemical biology and functional assays to establish that rather than the allosteric mechanism claimed, GSK1034702 interacts in a bitopic manner at the M1 mAChR such that it can concomitantly span both the orthosteric and an allosteric binding site. The bitopic nature of GSK1034702 together with the intrinsic agonist activity and a lack of muscarinic receptor subtype selectivity reported here, all likely contribute to the adverse effects of this molecule in clinical trials. We conclude that these properties, whilst imparting beneficial effects on learning and memory, are undesirable in a clinical candidate due to the likelihood of adverse side effects. Rather, our data supports the notion that "pure" positive allosteric modulators showing selectivity for the M1 mAChR with low levels of intrinsic activity would be preferable to provide clinical efficacy with low adverse responses
QuasR: quantification and annotation of short reads in R
Summary: QuasR is a package for the integrated analysis of high-throughput sequencing data in R, covering all steps from read preprocessing, alignment and quality control to quantification. QuasR supports different experiment types (including RNA-seq, ChIP-seq and Bis-seq) and analysis variants (e.g. paired-end, stranded, spliced and allele-specific), and is integrated in Bioconductor so that its output can be directly processed for statistical analysis and visualization. Availability and implementation: QuasR is implemented in R and C/C++. Source code and binaries for major platforms (Linux, OS X and MS Windows) are available from Bioconductor (www.bioconductor.org/packages/release/bioc/html/QuasR.html). The package includes a ‘vignette' with step-by-step examples for typical work flows. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin
A robust SNP barcode for typing Mycobacterium tuberculosis complex strains
Strain-specific genomic diversity in the Mycobacterium tuberculosis complex (MTBC) is an important factor in pathogenesis that may affect virulence, transmissibility, host response and emergence of drug resistance. Several systems have been proposed to classify MTBC strains into distinct lineages and families. Here, we investigate single-nucleotide polymorphisms (SNPs) as robust (stable) markers of genetic variation for phylogenetic analysis. We identify ~92k SNP across a global collection of 1,601 genomes. The SNP-based phylogeny is consistent with the gold-standard regions of difference (RD) classification system. Of the ~7k strain-specific SNPs identified, 62 markers are proposed to discriminate known circulating strains. This SNP-based barcode is the first to cover all main lineages, and classifies a greater number of sublineages than current alternatives. It may be used to classify clinical isolates to evaluate tools to control the disease, including therapeutics and vaccines whose effectiveness may vary by strain type
Approximate String Matching Using a Bidirectional Index
International audienceWe study strategies of approximate pattern matching that exploit bidirectional text indexes, extending and generalizing ideas of [5]. We introduce a formalism, called search schemes, to specify search strate-gies of this type, then develop a probabilistic measure for the efficiency of a search scheme, prove several combinatorial results on efficient search schemes, and finally, provide experimental computations supporting the superiority of our strategies
Plasmodium knowlesi Genome Sequences from Clinical Isolates Reveal Extensive Genomic Dimorphism.
Plasmodium knowlesi is a newly described zoonosis that causes malaria in the human population that can be severe and fatal. The study of P. knowlesi parasites from human clinical isolates is relatively new and, in order to obtain maximum information from patient sample collections, we explored the possibility of generating P. knowlesi genome sequences from archived clinical isolates. Our patient sample collection consisted of frozen whole blood samples that contained excessive human DNA contamination and, in that form, were not suitable for parasite genome sequencing. We developed a method to reduce the amount of human DNA in the thawed blood samples in preparation for high throughput parasite genome sequencing using Illumina HiSeq and MiSeq sequencing platforms. Seven of fifteen samples processed had sufficiently pure P. knowlesi DNA for whole genome sequencing. The reads were mapped to the P. knowlesi H strain reference genome and an average mapping of 90% was obtained. Genes with low coverage were removed leaving 4623 genes for subsequent analyses. Previously we identified a DNA sequence dimorphism on a small fragment of the P. knowlesi normocyte binding protein xa gene on chromosome 14. We used the genome data to assemble full-length Pknbpxa sequences and discovered that the dimorphism extended along the gene. An in-house algorithm was developed to detect SNP sites co-associating with the dimorphism. More than half of the P. knowlesi genome was dimorphic, involving genes on all chromosomes and suggesting that two distinct types of P. knowlesi infect the human population in Sarawak, Malaysian Borneo. We use P. knowlesi clinical samples to demonstrate that Plasmodium DNA from archived patient samples can produce high quality genome data. We show that analyses, of even small numbers of difficult clinical malaria isolates, can generate comprehensive genomic information that will improve our understanding of malaria parasite diversity and pathobiology
Transkingdom Networks: A Systems Biology Approach to Identify Causal Members of Host-Microbiota Interactions
Improvements in sequencing technologies and reduced experimental costs have
resulted in a vast number of studies generating high-throughput data. Although
the number of methods to analyze these "omics" data has also increased,
computational complexity and lack of documentation hinder researchers from
analyzing their high-throughput data to its true potential. In this chapter we
detail our data-driven, transkingdom network (TransNet) analysis protocol to
integrate and interrogate multi-omics data. This systems biology approach has
allowed us to successfully identify important causal relationships between
different taxonomic kingdoms (e.g. mammals and microbes) using diverse types of
data
The Escherichia coli transcriptome mostly consists of independently regulated modules
Underlying cellular responses is a transcriptional regulatory network (TRN) that modulates gene expression. A useful description of the TRN would decompose the transcriptome into targeted effects of individual transcriptional regulators. Here, we apply unsupervised machine learning to a diverse compendium of over 250 high-quality Escherichia coli RNA-seq datasets to identify 92 statistically independent signals that modulate the expression of specific gene sets. We show that 61 of these transcriptomic signals represent the effects of currently characterized transcriptional regulators. Condition-specific activation of signals is validated by exposure of E. coli to new environmental conditions. The resulting decomposition of the transcriptome provides: a mechanistic, systems-level, network-based explanation of responses to environmental and genetic perturbations; a guide to gene and regulator function discovery; and a basis for characterizing transcriptomic differences in multiple strains. Taken together, our results show that signal summation describes the composition of a model prokaryotic transcriptome
Low-Bandwidth and Non-Compute Intensive Remote Identification of Microbes from Raw Sequencing Reads
Cheap high-throughput DNA sequencing may soon become routine not only for
human genomes but also for practically anything requiring the identification of
living organisms from their DNA: tracking of infectious agents, control of food
products, bioreactors, or environmental samples.
We propose a novel general approach to the analysis of sequencing data in
which the reference genome does not have to be specified. Using a distributed
architecture we are able to query a remote server for hints about what the
reference might be, transferring a relatively small amount of data, and the
hints can be used for more computationally-demanding work.
Our system consists of a server with known reference DNA indexed, and a
client with raw sequencing reads. The client sends a sample of unidentified
reads, and in return receives a list of matching references known to the
server. Sequences for the references can be retrieved and used for exhaustive
computation on the reads, such as alignment.
To demonstrate this approach we have implemented a web server, indexing tens
of thousands of publicly available genomes and genomic regions from various
organisms and returning lists of matching hits from query sequencing reads. We
have also implemented two clients, one of them running in a web browser, in
order to demonstrate that gigabytes of raw sequencing reads of unknown origin
could be identified without the need to transfer a very large volume of data,
and on modestly powered computing devices.
A web access is available at http://tapir.cbs.dtu.dk. The source code for a
python command-line client, a server, and supplementary data is available at
http://bit.ly/1aURxkc
Profiling allele-specific gene expression in brains from individuals with autism spectrum disorder reveals preferential minor allele usage.
One fundamental but understudied mechanism of gene regulation in disease is allele-specific expression (ASE), the preferential expression of one allele. We leveraged RNA-sequencing data from human brain to assess ASE in autism spectrum disorder (ASD). When ASE is observed in ASD, the allele with lower population frequency (minor allele) is preferentially more highly expressed than the major allele, opposite to the canonical pattern. Importantly, genes showing ASE in ASD are enriched in those downregulated in ASD postmortem brains and in genes harboring de novo mutations in ASD. Two regions, 14q32 and 15q11, containing all known orphan C/D box small nucleolar RNAs (snoRNAs), are particularly enriched in shifts to higher minor allele expression. We demonstrate that this allele shifting enhances snoRNA-targeted splicing changes in ASD-related target genes in idiopathic ASD and 15q11-q13 duplication syndrome. Together, these results implicate allelic imbalance and dysregulation of orphan C/D box snoRNAs in ASD pathogenesis
The genome and transcriptome of Trichormus sp NMC-1: insights into adaptation to extreme environments on the Qinghai-Tibet Plateau
The Qinghai-Tibet Plateau (QTP) has the highest biodiversity for an extreme environment worldwide, and provides an ideal natural laboratory to study adaptive evolution. In this study, we generated a draft genome sequence of cyanobacteria Trichormus sp. NMC-1 in the QTP and performed whole transcriptome sequencing under low temperature to investigate the genetic mechanism by which T. sp. NMC-1 adapted to the specific environment. Its genome sequence was 5.9 Mb with a G+C content of 39.2% and encompassed a total of 5362 CDS. A phylogenomic tree indicated that this strain belongs to the Trichormus and Anabaena cluster. Genome comparison between T. sp. NMC-1 and six relatives showed that functionally unknown genes occupied a much higher proportion (28.12%) of the T. sp. NMC-1 genome. In addition, functions of specific, significant positively selected, expanded orthogroups, and differentially expressed genes involved in signal transduction, cell wall/membrane biogenesis, secondary metabolite biosynthesis, and energy production and conversion were analyzed to elucidate specific adaptation traits. Further analyses showed that the CheY-like genes, extracellular polysaccharide and mycosporine-like amino acids might play major roles in adaptation to harsh environments. Our findings indicate that sophisticated genetic mechanisms are involved in cyanobacterial adaptation to the extreme environment of the QTP
- …
