94 research outputs found

    The breast cancer somatic 'muta-ome': tackling the complexity

    Get PDF
    Acquired somatic mutations are responsible for approximately 90% of breast tumours. However, only one somatic aberration, amplification of the HER2 locus, is currently used to define a clinical subtype, one that accounts for approximately 10% to 15% of breast tumours. In recent years, a number of mutational profiling studies have attempted to further identify clinically relevant mutations. While these studies have confirmed the oncogenic or tumour suppressor role of many known suspects, they have exposed complexity as a main feature of the breast cancer mutational landscape (the 'muta-ome'). The two defining features of this complexity are (a) a surprising richness of low-frequency mutants contrasting with the relative rarity of high-frequency events and (b) the relatively large number of somatic genomic aberrations (approximately 20 to 50) driving an average tumour. Structural features of this complex landscape have begun to emerge from follow-up studies that have tackled the complexity by integrating the spectrum of genomic mutations with a variety of complementary biological knowledge databases. Among these structural features are the growing links between somatic gene disruptions and those conferring breast cancer risk, mutually exclusive coexistence and synergistic mutational patterns, and a clearly non-random distribution of mutations implicating specific molecular pathways in breast tumour initiation and progression. Recognising that a shift from a gene-centric to a pathway-centric approach is necessary, we envisage that further progress in identifying clinically relevant genomic aberration patterns and associated breast cancer subtypes will require not only multi-dimensional integrative analyses that combine mutational and functional profiles, but also larger profiling studies that use second- and third-generation sequencing technologies in order to fill out the important gaps in the current mutational landscape

    Mutations in SLC39A14 disrupt manganese homeostasis and cause childhood-onset parkinsonism-dystonia

    Get PDF
    Although manganese is an essential trace metal, little is known about its transport and homeostatic regulation. Here we have identified a cohort of patients with a novel autosomal recessive manganese transporter defect caused by mutations in SLC39A14. Excessive accumulation of manganese in these patients results in rapidly progressive childhood-onset parkinsonism-dystonia with distinctive brain magnetic resonance imaging appearances and neurodegenerative features on post-mortem examination. We show that mutations in SLC39A14 impair manganese transport in vitro and lead to manganese dyshomeostasis and altered locomotor activity in zebrafish with CRISPR-induced slc39a14 null mutations. Chelation with disodium calcium edetate lowers blood manganese levels in patients and can lead to striking clinical improvement. Our results demonstrate that SLC39A14 functions as a pivotal manganese transporter in vertebrates

    Next-generation sequencing

    Get PDF
    Next-generation sequencing (also known as massively parallel sequencing) technologies are revolutionising our ability to characterise cancers at the genomic, transcriptomic and epigenetic levels. Cataloguing all mutations, copy number aberrations and somatic rearrangements in an entire cancer genome at base pair resolution can now be performed in a matter of weeks. Furthermore, massively parallel sequencing can be used as a means for unbiased transcriptomic analysis of mRNAs, small RNAs and noncoding RNAs, genome-wide methylation assays and high-throughput chromatin immunoprecipitation assays. Here, I discuss the potential impact of this technology on breast cancer research and the challenges that come with this technological breakthrough

    Deep RNA sequencing analysis of readthrough gene fusions in human prostate adenocarcinoma and reference samples

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Readthrough fusions across adjacent genes in the genome, or transcription-induced chimeras (TICs), have been estimated using expressed sequence tag (EST) libraries to involve 4-6% of all genes. Deep transcriptional sequencing (RNA-Seq) now makes it possible to study the occurrence and expression levels of TICs in individual samples across the genome.</p> <p>Methods</p> <p>We performed single-end RNA-Seq on three human prostate adenocarcinoma samples and their corresponding normal tissues, as well as brain and universal reference samples. We developed two bioinformatics methods to specifically identify TIC events: a targeted alignment method using artificial exon-exon junctions within 200,000 bp from adjacent genes, and genomic alignment allowing splicing within individual reads. We performed further experimental verification and characterization of selected TIC and fusion events using quantitative RT-PCR and comparative genomic hybridization microarrays.</p> <p>Results</p> <p>Targeted alignment against artificial exon-exon junctions yielded 339 distinct TIC events, including 32 gene pairs with multiple isoforms. The false discovery rate was estimated to be 1.5%. Spliced alignment to the genome was less sensitive, finding only 18% of those found by targeted alignment in 33-nt reads and 59% of those in 50-nt reads. However, spliced alignment revealed 30 cases of TICs with intervening exons, in addition to distant inversions, scrambled genes, and translocations. Our findings increase the catalog of observed TIC gene pairs by 66%.</p> <p>We verified 6 of 6 predicted TICs in all prostate samples, and 2 of 5 predicted novel distant gene fusions, both private events among 54 prostate tumor samples tested. Expression of TICs correlates with that of the upstream gene, which can explain the prostate-specific pattern of some TIC events and the restriction of the <it>SLC45A3-ELK4 </it>e4-e2 TIC to <it>ERG</it>-negative prostate samples, as confirmed in 20 matched prostate tumor and normal samples and 9 lung cancer cell lines.</p> <p>Conclusions</p> <p>Deep transcriptional sequencing and analysis with targeted and spliced alignment methods can effectively identify TIC events across the genome in individual tissues. Prostate and reference samples exhibit a wide range of TIC events, involving more genes than estimated previously using ESTs. Tissue specificity of TIC events is correlated with expression patterns of the upstream gene. Some TIC events, such as <it>MSMB-NCOA4</it>, may play functional roles in cancer.</p

    RNA-Seq Mapping and Detection of Gene Fusions with a Suffix Array Algorithm

    Get PDF
    High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions–particularly those expressed with low abundance– is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample

    HIV and Hepatitis B and C incidence rates in US correctional populations and high risk groups: a systematic review and meta-analysis

    Get PDF

    Variability in Working Memory Performance Explained by Epistasis vs Polygenic Scores in the ZNF804A Pathway

    Get PDF
    Importance: We investigated the variation in neuropsychological function explained by risk alleles at the psychosis susceptibility gene ZNF804A and its interacting partners using single nucleotide polymorphisms (SNPs), polygenic scores, and epistatic analyses. Of particular importance was the relative contribution of the polygenic score vs epistasis in variation explained. Objectives To (1) assess the association between SNPs in ZNF804A and the ZNF804A polygenic score with measures of cognition in cases with psychosis and (2) assess whether epistasis within the ZNF804A pathway could explain additional variation above and beyond that explained by the polygenic score. Design, Setting, and Participants: Patients with psychosis (n = 424) were assessed in areas of cognitive ability impaired in schizophrenia including IQ, memory, attention, and social cognition. We used the Psychiatric GWAS Consortium 1 schizophrenia genome-wide association study to calculate a polygenic score based on identified risk variants within this genetic pathway. Cognitive measures significantly associated with the polygenic score were tested for an epistatic component using a training set (n = 170), which was used to develop linear regression models containing the polygenic score and 2-SNP interactions. The best-fitting models were tested for replication in 2 independent test sets of cases: (1) 170 individuals with schizophrenia or schizoaffective disorder and (2) 84 patients with broad psychosis (including bipolar disorder, major depressive disorder, and other psychosis). Main Outcomes and Measures: Participants completed a neuropsychological assessment battery designed to target the cognitive deficits of schizophrenia including general cognitive function, episodic memory, working memory, attentional control, and social cognition. Results: Higher polygenic scores were associated with poorer performance among patients on IQ, memory, and social cognition, explaining 1% to 3% of variation on these scores (range, P = .01 to .03). Using a narrow psychosis training set and independent test sets of narrow phenotype psychosis (schizophrenia and schizoaffective disorder), broad psychosis, and control participants (n = 89), the addition of 2 interaction terms containing 2 SNPs each increased the R2 for spatial working memory strategy in the independent psychosis test sets from 1.2% using the polygenic score only to 4.8% (P = .11 and .001, respectively) but did not explain additional variation in control participants. Conclusions and Relevance: These data support a role for the ZNF804A pathway in IQ, memory, and social cognition in cases. Furthermore, we showed that epistasis increases the variation explained above the contribution of the polygenic score

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency–Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research
    corecore