    c-Myc Regulates Self-Renewal in Bronchoalveolar Stem Cells

    BACKGROUND: Bronchoalveolar stem cells (BASCs) located in the bronchoalveolar duct junction are thought to regenerate both bronchiolar and alveolar epithelium during homeostatic turnover and in response to injury. The mechanisms directing self-renewal in BASCs are poorly understood. METHODS: BASCs (Sca-1(+), CD34(+), CD31(-) and, CD45(-)) were isolated from adult mouse lung using FACS, and their capacity for self-renewal and differentiation were demonstrated by immunostaining. A transcription factor network of 53 genes required for pluripotency in embryonic stem cells was assessed in BASCs, Kras-initiated lung tumor tissue, and lung organogenesis by real-time PCR. c-Myc was knocked down in BASCs by infection with c-Myc shRNA lentivirus. Comprehensive miRNA and mRNA profiling for BASCs was performed, and significant miRNAs and mRNAs potentially regulated by c-Myc were identified. We explored a c-Myc regulatory network in BASCs using a number of statistical and computational approaches through two different strategies; 1) c-Myc/Max binding sites within individual gene promoters, and 2) miRNA-regulated target genes. RESULTS: c-Myc expression was upregulated in BASCs and downregulated over the time course of lung organogenesis in vivo. The depletion of c-Myc in BASCs resulted in decreased proliferation and cell death. Multiple mRNAs and miRNAs were dynamically regulated in c-Myc depleted BASCs. Among a total of 250 dynamically regulated genes in c-Myc depleted BASCs, 57 genes were identified as potential targets of miRNAs through miRBase and TargetScan-based computational mapping. A further 88 genes were identified as potential downstream targets through their c-Myc binding motif. CONCLUSION: c-Myc plays a critical role in maintaining the self-renewal capacity of lung bronchoalveolar stem cells through a combination of miRNA and transcription factor regulatory networks

    Digital Gene Expression Analysis Based on Integrated De Novo Transcriptome Assembly of Sweet Potato [Ipomoea batatas (L.) Lam.]

    Background: Sweet potato (Ipomoea batatas L. [Lam.]) ranks among the top six most important food crops in the world. It is widely grown throughout the world with high and stable yield, strong adaptability, rich nutrient content, and multiple uses. However, little is known about the molecular biology of this important non-model organism due to lack of genomic resources. Hence, studies based on high-throughput sequencing technologies are needed to get a comprehensive and integrated genomic resource and better understanding of gene expression patterns in different tissues and at various developmental stages. Methodology/Principal Findings: Illumina paired-end (PE) RNA-Sequencing was performed, and generated 48.7 million of 75 bp PE reads. These reads were de novo assembled into 128,052 transcripts ($100 bp), which correspond to 41.1 million base pairs, by using a combined assembly strategy. Transcripts were annotated by Blast2GO and 51,763 transcripts got BLASTX hits, in which 39,677 transcripts have GO terms and 14,117 have ECs that are associated with 147 KEGG pathways. Furthermore, transcriptome differences of seven tissues were analyzed by using Illumina digital gene expression (DGE) tag profiling and numerous differentially and specifically expressed transcripts were identified. Moreover, the expression characteristics of genes involved in viral genomes, starch metabolism and potential stress tolerance and insect resistance were also identified

    miRNA Expression in Colon Polyps Provides Evidence for a Multihit Model of Colon Cancer

    Changes in miRNA expression are a common feature in colon cancer. Those changes occurring in the transition from normal to adenoma and from adenoma to carcinoma, however, have not been well defined. Additionally, miRNA changes among tumor subgroups of colon cancer have also not been adequately evaluated. In this study, we examined the global miRNA expression in 315 samples that included 52 normal colonic mucosa, 41 tubulovillous adenomas, 158 adenocarcinomas with proficient DNA mismatch repair (pMMR) selected for stage and age of onset, and 64 adenocarcinomas with defective DNA mismatch repair (dMMR) selected for sporadic (n = 53) and inherited colon cancer (n = 11). Sporadic dMMR tumors all had MLH1 inactivation due to promoter hypermethylation. Unsupervised PCA and cluster analysis demonstrated that normal colon tissue, adenomas, pMMR carcinomas and dMMR carcinomas were all clearly discernable. The majority of miRNAs that were differentially expressed between normal and polyp were also differentially expressed with a similar magnitude in the comparison of normal to both the pMMR and dMMR tumor groups, suggesting a stepwise progression for transformation from normal colon to carcinoma. Among the miRNAs demonstrating the largest fold up- or down-regulated changes (≥4), four novel (miR-31, miR-1, miR-9 and miR-99a) and two previously reported (miR-137 and miR-135b) miRNAs were identified in the normal/adenoma comparison. All but one of these (miR-99a) demonstrated similar expression differences in the two normal/carcinoma comparisons, suggesting that these early tumor changes are important in both the pMMR- and dMMR-derived cancers. The comparison between pMMR and dMMR tumors identified four miRNAs (miR-31, miR-552, miR-592 and miR-224) with statistically significant expression differences (≥2-fold change)

    A Transposon-Based Genetic Screen in Mice Identifies Genes Altered in Colorectal Cancer

    Human colorectal cancers (CRCs) display a large number of genetic and epigenetic alterations, some of which are causally involved in tumorigenesis (drivers) and others that have little functional impact (passengers). To help distinguish between these two classes of alterations, we used a transposon-based genetic screen in mice to identify candidate genes for CRC. Mice harboring mutagenic Sleeping Beauty (SB) transposons were crossed with mice expressing SB transposase in gastrointestinal tract epithelium. Most of the offspring developed intestinal lesions, including intraepithelial neoplasia, adenomas, and adenocarcinomas. Analysis of over 16,000 transposon insertions identified 77 candidate CRC genes, 60 of which are mutated and/or dysregulated in human CRC and thus are most likely to drive tumorigenesis. These genes include APC, PTEN, and SMAD4. The screen also identified 17 candidate genes that had not previously been implicated in CRC, including POLI, PTPRK, and RSPO2

    Genome-wide transcriptional profiling of appressorium development by the rice blast fungus Magnaporthe oryzae.

    addresses: College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom.notes: PMCID: PMC3276559The rice blast fungus Magnaporthe oryzae is one of the most significant pathogens affecting global food security. To cause rice blast disease the fungus elaborates a specialised infection structure called an appressorium. Here, we report genome wide transcriptional profile analysis of appressorium development using next generation sequencing (NGS). We performed both RNA-Seq and High-Throughput SuperSAGE analysis to compare the utility of these procedures for identifying differential gene expression in M. oryzae. We then analysed global patterns of gene expression during appressorium development. We show evidence for large-scale gene expression changes, highlighting the role of autophagy, lipid metabolism and melanin biosynthesis in appressorium differentiation. We reveal the role of the Pmk1 MAP kinase as a key global regulator of appressorium-associated gene expression. We also provide evidence for differential expression of transporter-encoding gene families and specific high level expression of genes involved in quinate uptake and utilization, consistent with pathogen-mediated perturbation of host metabolism during plant infection. When considered together, these data provide a comprehensive high-resolution analysis of gene expression changes associated with cellular differentiation that will provide a key resource for understanding the biology of rice blast disease

    Tumor Transcriptome Sequencing Reveals Allelic Expression Imbalances Associated with Copy Number Alterations

    Due to growing throughput and shrinking cost, massively parallel sequencing is rapidly becoming an attractive alternative to microarrays for the genome-wide study of gene expression and copy number alterations in primary tumors. The sequencing of transcripts (RNA-Seq) should offer several advantages over microarray-based methods, including the ability to detect somatic mutations and accurately measure allele-specific expression. To investigate these advantages we have applied a novel, strand-specific RNA-Seq method to tumors and matched normal tissue from three patients with oral squamous cell carcinomas. Additionally, to better understand the genomic determinants of the gene expression changes observed, we have sequenced the tumor and normal genomes of one of these patients. We demonstrate here that our RNA-Seq method accurately measures allelic imbalance and that measurement on the genome-wide scale yields novel insights into cancer etiology. As expected, the set of genes differentially expressed in the tumors is enriched for cell adhesion and differentiation functions, but, unexpectedly, the set of allelically imbalanced genes is also enriched for these same cancer-related functions. By comparing the transcriptomic perturbations observed in one patient to his underlying normal and tumor genomes, we find that allelic imbalance in the tumor is associated with copy number mutations and that copy number mutations are, in turn, strongly associated with changes in transcript abundance. These results support a model in which allele-specific deletions and duplications drive allele-specific changes in gene expression in the developing tumor

    Nematode and Arthropod Genomes Provide New Insights into the Evolution of Class 2 B1 GPCRs

    Nematodes and arthropods are the most speciose animal groups and possess Class 2 B1 G-protein coupled receptors (GPCRs). Existing models of invertebrate Class 2 B1 GPCR evolution are mainly centered on Caenorhabditis elegans and Drosophila melanogaster and a few other nematode and arthropod representatives. The present study reevaluates the evolution of metazoan Class 2 B1 GPCRs and orthologues by exploring the receptors in several nematode and arthropod genomes and comparing them to the human receptors. Three novel receptor phylogenetic clusters were identified and designated cluster A, cluster B and PDF-R-related cluster. Clusters A and B were identified in several nematode and arthropod genomes but were absent from D. melanogaster and Culicidae genomes, whereas the majority of the members of the PDF-R-related cluster were from nematodes. Cluster A receptors were nematode and arthropod-specific but shared a conserved gene environment with human receptor loci. Cluster B members were orthologous to human GCGR, PTHR and Secretin members with which they probably shared a common origin. PDF-R and PDF-R related clusters were present in representatives of both nematodes and arthropods. The results of comparative analysis of GPCR evolution and diversity in protostomes confirm previous notions that C. elegans and D. melanogaster genomes are not good representatives of nematode and arthropod phyla. We hypothesize that at least four ancestral Class 2 B1 genes emerged early in the metazoan radiation, which after the protostome-deuterostome split underwent distinct selective pressures that resulted in duplication and deletion events that originated the current Class 2 B1 GPCRs in nematode and arthropod genomes.This work was supported by the Portuguese Foundation for Science and Technology (FCT) project PTDC/BIA-BCM/114395/2009, by the European Regional Development Fund through COMPETE and FCT under the project ‘‘PEst-C/MAR/LA0015/2011.’’ RCF is in receipt of an FCT grant (SFRH/BPD/89811/2012) and JCRC is supported by auxiliary research contract FCT Pluriannual funds attributed to CCMAR. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

    RNA-Seq Mapping and Detection of Gene Fusions with a Suffix Array Algorithm

    High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusions–particularly those expressed with low abundance– is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency–Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research

