186 research outputs found

    Simultaneous non-negative matrix factorization for multiple large scale gene expression datasets in toxicology

    Get PDF
    Non-negative matrix factorization is a useful tool for reducing the dimension of large datasets. This work considers simultaneous non-negative matrix factorization of multiple sources of data. In particular, we perform the first study that involves more than two datasets. We discuss the algorithmic issues required to convert the approach into a practical computational tool and apply the technique to new gene expression data quantifying the molecular changes in four tissue types due to different dosages of an experimental panPPAR agonist in mouse. This study is of interest in toxicology because, whilst PPARs form potential therapeutic targets for diabetes, it is known that they can induce serious side-effects. Our results show that the practical simultaneous non-negative matrix factorization developed here can add value to the data analysis. In particular, we find that factorizing the data as a single object allows us to distinguish between the four tissue types, but does not correctly reproduce the known dosage level groups. Applying our new approach, which treats the four tissue types as providing distinct, but related, datasets, we find that the dosage level groups are respected. The new algorithm then provides separate gene list orderings that can be studied for each tissue type, and compared with the ordering arising from the single factorization. We find that many of our conclusions can be corroborated with known biological behaviour, and others offer new insights into the toxicological effects. Overall, the algorithm shows promise for early detection of toxicity in the drug discovery process

    Recent acquisition of Helicobacter pylori by Baka Pygmies

    Get PDF
    Both anatomically modern humans and the gastric pathogen Helicobacter pylori originated in Africa, and both species have been associated for at least 100,000 years. Seven geographically distinct H. pylori populations exist, three of which are indigenous to Africa: hpAfrica1, hpAfrica2, and hpNEAfrica. The oldest and most divergent population, hpAfrica2, evolved within San hunter-gatherers, who represent one of the deepest branches of the human population tree. Anticipating the presence of ancient H. pylori lineages within all hunter-gatherer populations, we investigated the prevalence and population structure of H. pylori within Baka Pygmies in Cameroon. Gastric biopsies were obtained by esophagogastroduodenoscopy from 77 Baka from two geographically separated populations, and from 101 non-Baka individuals from neighboring agriculturalist populations, and subsequently cultured for H. pylori. Unexpectedly, Baka Pygmies showed a significantly lower H. pylori infection rate (20.8%) than non-Baka (80.2%). We generated multilocus haplotypes for each H. pylori isolate by DNA sequencing, but were not able to identify Baka-specific lineages, and most isolates in our sample were assigned to hpNEAfrica or hpAfrica1. The population hpNEAfrica, a marker for the expansion of the Nilo-Saharan language family, was divided into East African and Central West African subpopulations. Similarly, a new hpAfrica1 subpopulation, identified mainly among Cameroonians, supports eastern and western expansions of Bantu languages. An age-structured transmission model shows that the low H. pylori prevalence among Baka Pygmies is achievable within the timeframe of a few hundred years and suggests that demographic factors such as small population size and unusually low life expectancy can lead to the eradication of H. pylori from individual human populations. The Baka were thus either H. pylori-free or lost their ancient lineages during past demographic fluctuations. Using coalescent simulations and phylogenetic inference, we show that Baka almost certainly acquired their extant H. pylori through secondary contact with their agriculturalist neighbors

    Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The wide use of high-throughput DNA microarray technology provide an increasingly detailed view of human transcriptome from hundreds to thousands of genes. Although biomedical researchers typically design microarray experiments to explore specific biological contexts, the relationships between genes are hard to identified because they are complex and noisy high-dimensional data and are often hindered by low statistical power. The main challenge now is to extract valuable biological information from the colossal amount of data to gain insight into biological processes and the mechanisms of human disease. To overcome the challenge requires mathematical and computational methods that are versatile enough to capture the underlying biological features and simple enough to be applied efficiently to large datasets.</p> <p>Methods</p> <p>Unsupervised machine learning approaches provide new and efficient analysis of gene expression profiles. In our study, two unsupervised knowledge-based matrix factorization methods, independent component analysis (ICA) and nonnegative matrix factorization (NMF) are integrated to identify significant genes and related pathways in microarray gene expression dataset of Alzheimer’s disease. The advantage of these two approaches is they can be performed as a biclustering method by which genes and conditions can be clustered simultaneously. Furthermore, they can group genes into different categories for identifying related diagnostic pathways and regulatory networks. The difference between these two method lies in ICA assume statistical independence of the expression modes, while NMF need positivity constrains to generate localized gene expression profiles.</p> <p>Results</p> <p>In our work, we performed FastICA and non-smooth NMF methods on DNA microarray gene expression data of Alzheimer’s disease respectively. The simulation results shows that both of the methods can clearly classify severe AD samples from control samples, and the biological analysis of the identified significant genes and their related pathways demonstrated that these genes play a prominent role in AD and relate the activation patterns to AD phenotypes. It is validated that the combination of these two methods is efficient.</p> <p>Conclusions</p> <p>Unsupervised matrix factorization methods provide efficient tools to analyze high-throughput microarray dataset. According to the facts that different unsupervised approaches explore correlations in the high-dimensional data space and identify relevant subspace base on different hypotheses, integrating these methods to explore the underlying biological information from microarray dataset is an efficient approach. By combining the significant genes identified by both ICA and NMF, the biological analysis shows great efficient for elucidating the molecular taxonomy of Alzheimer’s disease and enable better experimental design to further identify potential pathways and therapeutic targets of AD.</p

    Control of Stochastic Gene Expression by Host Factors at the HIV Promoter

    Get PDF
    The HIV promoter within the viral long terminal repeat (LTR) orchestrates many aspects of the viral life cycle, from the dynamics of viral gene expression and replication to the establishment of a latent state. In particular, after viral integration into the host genome, stochastic fluctuations in viral gene expression amplified by the Tat positive feedback loop can contribute to the formation of either a productive, transactivated state or an inactive state. In a significant fraction of cells harboring an integrated copy of the HIV-1 model provirus (LTR-GFP-IRES-Tat), this bimodal gene expression profile is dynamic, as cells spontaneously and continuously flip between active (Bright) and inactive (Off) expression modes. Furthermore, these switching dynamics may contribute to the establishment and maintenance of proviral latency, because after viral integration long delays in gene expression can occur before viral transactivation. The HIV-1 promoter contains cis-acting Sp1 and NF-κB elements that regulate gene expression via the recruitment of both activating and repressing complexes. We hypothesized that interplay in the recruitment of such positive and negative factors could modulate the stability of the Bright and Off modes and thereby alter the sensitivity of viral gene expression to stochastic fluctuations in the Tat feedback loop. Using model lentivirus variants with mutations introduced in the Sp1 and NF-κB elements, we employed flow cytometry, mRNA quantification, pharmacological perturbations, and chromatin immunoprecipitation to reveal significant functional differences in contributions of each site to viral gene regulation. Specifically, the Sp1 sites apparently stabilize both the Bright and the Off states, such that their mutation promotes noisy gene expression and reduction in the regulation of histone acetylation and deacetylation. Furthermore, the NF-κB sites exhibit distinct properties, with κB site I serving a stronger activating role than κB site II. Moreover, Sp1 site III plays a particularly important role in the recruitment of both p300 and RelA to the promoter. Finally, analysis of 362 clonal cell populations infected with the viral variants revealed that mutations in any of the Sp1 sites yield a 6-fold higher frequency of clonal bifurcation compared to that of the wild-type promoter. Thus, each Sp1 and NF-κB site differentially contributes to the regulation of viral gene expression, and Sp1 sites functionally “dampen” transcriptional noise and thereby modulate the frequency and maintenance of this model of viral latency. These results may have biomedical implications for the treatment of HIV latency

    Estrogen receptor transcription and transactivation: Estrogen receptor alpha and estrogen receptor beta - regulation by selective estrogen receptor modulators and importance in breast cancer

    Get PDF
    Estrogens display intriguing tissue-selective action that is of great biomedical importance in the development of optimal therapeutics for the prevention and treatment of breast cancer, for menopausal hormone replacement, and for fertility regulation. Certain compounds that act through the estrogen receptor (ER), now referred to as selective estrogen receptor modulators (SERMs), can demonstrate remarkable differences in activity in the various estrogen target tissues, functioning as agonists in some tissues but as antagonists in others. Recent advances elucidating the tripartite nature of the biochemical and molecular actions of estrogens provide a good basis for understanding these tissue-selective actions. As discussed in this thematic review, the development of optimal SERMs should now be viewed in the context of two estrogen receptor subtypes, ERα and ERβ, that have differing affinities and responsiveness to various SERMs, and differing tissue distribution and effectiveness at various gene regulatory sites. Cellular, biochemical, and structural approaches have also shown that the nature of the ligand affects the conformation assumed by the ER-ligand complex, thereby regulating its state of phosphorylation and the recruitment of different coregulator proteins. Growth factors and protein kinases that control the phosphorylation state of the complex also regulate the bioactivity of the ER. These interactions and changes determine the magnitude of the transcriptional response and the potency of different SERMs. As these critical components are becoming increasingly well defined, they provide a sound basis for the development of novel SERMs with optimal profiles of tissue selectivity as medical therapeutic agents

    Quantification of miRNA-mRNA Interactions

    Get PDF
    miRNAs are small RNA molecules (′ 22nt) that interact with their corresponding target mRNAs inhibiting the translation of the mRNA into proteins and cleaving the target mRNA. This second effect diminishes the overall expression of the target mRNA. Several miRNA-mRNA relationship databases have been deployed, most of them based on sequence complementarities. However, the number of false positives in these databases is large and they do not overlap completely. Recently, it has been proposed to combine expression measurement from both miRNA and mRNA and sequence based predictions to achieve more accurate relationships. In our work, we use LASSO regression with non-positive constraints to integrate both sources of information. LASSO enforces the sparseness of the solution and the non-positive constraints restrict the search of miRNA targets to those with down-regulation effects on the mRNA expression. We named this method TaLasso (miRNA-Target LASSO)

    Low is large: spatial location and pitch interact in voice-based body size estimation

    Get PDF
    The binding of incongruent cues poses a challenge for multimodal perception. Indeed, although taller objects emit sounds from higher elevations, low-pitched sounds are perceptually mapped both to large size and to low elevation. In the present study, we examined how these incongruent vertical spatial cues (up is more) and pitch cues (low is large) to size interact, and whether similar biases influence size perception along the horizontal axis. In Experiment 1, we measured listeners’ voice-based judgments of human body size using pitch-manipulated voices projected from a high versus a low, and a right versus a left, spatial location. Listeners associated low spatial locations with largeness for lowered-pitch but not for raised-pitch voices, demonstrating that pitch overrode vertical-elevation cues. Listeners associated rightward spatial locations with largeness, regardless of voice pitch. In Experiment 2, listeners performed the task while sitting or standing, allowing us to examine self-referential cues to elevation in size estimation. Listeners associated vertically low and rightward spatial cues with largeness more for lowered- than for raised-pitch voices. These correspondences were robust to sex (of both the voice and the listener) and head elevation (standing or sitting); however, horizontal correspondences were amplified when participants stood. Moreover, when participants were standing, their judgments of how much larger men’s voices sounded than women’s increased when the voices were projected from the low speaker. Our results provide novel evidence for a multidimensional spatial mapping of pitch that is generalizable to human voices and that affects performance in an indirect, ecologically relevant spatial task (body size estimation). These findings suggest that crossmodal pitch correspondences evoke both low-level and higher-level cognitive processes

    The Regulation of MS-KIF18A Expression and Cross Talk with Estrogen Receptor

    Get PDF
    This study provides a novel view on the interactions between the MS-KIF18A, a kinesin protein, and estrogen receptor alpha (ERα) which were studied in vivo and in vitro. Additionally, the regulation of MS-KIF18A expression by estrogen was investigated at the gene and protein levels. An association between recombinant proteins; ERα and MS-KIF18A was demonstrated in vitro in a pull down assay. Such interactions were proven also for endogenous proteins in MBA-15 cells were detected prominently in the cytoplasm and are up-regulated by estrogen. Additionally, an association between these proteins and the transcription factor NF-κB was identified. MS-KIF18A mRNA expression was measured in vivo in relation to age and estrogen level in mice and rats models. A decrease in MS-KIF18A mRNA level was measured in old and in OVX-estrogen depleted rats as compared to young animals. The low MS-KIF18A mRNA expression in OVX rats was restored by estrogen treatment. We studied the regulation of MS-KIF18A transcription by estrogen using the luciferase reporter gene and chromatin immuno-percipitation (ChIP) assays. The luciferase reporter gene assay demonstrated an increase in MS-KIF18A promoter activity in response to 10−8 M estrogen and 10−7M ICI-182,780. Complimentary, the ChIP assay quantified the binding of ERα and pcJun to the MS-KIF18A promoter that was enhanced in cells treated by estrogen and ICI-182,780. In addition, cells treated by estrogen expressed higher levels of MS-KIF18A mRNA and protein and the protein turnover in MBA-15 cells was accelerated. Presented data demonstrated that ERα is a defined cargo of MS-KIF18A and added novel insight on the role of estrogen in regulation of MS-KIF18A expression both in vivo and in vitro

    Partial Inhibition of Estrogen-Induced Mammary Carcinogenesis in Rats by Tamoxifen: Balance between Oxidant Stress and Estrogen Responsiveness

    Get PDF
    Epidemiological and experimental evidences strongly support the role of estrogens in breast tumor development. Both estrogen receptor (ER)-dependent and ER-independent mechanisms are implicated in estrogen-induced breast carcinogenesis. Tamoxifen, a selective estrogen receptor modulator is widely used as chemoprotectant in human breast cancer. It binds to ERs and interferes with normal binding of estrogen to ERs. In the present study, we examined the effect of long-term tamoxifen treatment in the prevention of estrogen-induced breast cancer. Female ACI rats were treated with 17β-estradiol (E2), tamoxifen or with a combination of E2 and tamoxifen for eight months. Tissue levels of oxidative stress markers 8-iso-Prostane F2α (8-isoPGF2α), superoxide dismutase (SOD), glutathione peroxidase (GPx), catalase, and oxidative DNA damage marker 8-hydroxydeoxyguanosine (8-OHdG) were quantified in the mammary tissues of all the treatment groups and compared with age-matched controls. Levels of tamoxifen metabolizing enzymes cytochrome P450s as well as estrogen responsive genes were also quantified. At necropsy, breast tumors were detected in 44% of rats co-treated with tamoxifen+E2. No tumors were detected in the sham or tamoxifen only treatment groups whereas in the E2 only treatment group, the tumor incidence was 82%. Co-treatment with tamoxifen decreased GPx and catalase levels; did not completely inhibit E2-mediated oxidative DNA damage and estrogen-responsive genes monoamine oxygenase B1 (MaoB1) and cell death inducing DFF45 like effector C (Cidec) but differentially affected the levels of tamoxifen metabolizing enzymes. In summary, our studies suggest that although tamoxifen treatment inhibits estrogen-induced breast tumor development and increases the latency of tumor development, it does not completely abrogate breast tumor development in a rat model of estrogen-induced breast cancer. The inability of tamoxifen to completely inhibit E2-induced breast carcinogenesis may be because of increased estrogen-mediated oxidant burden
    corecore