112 research outputs found
Mammalian Genes Preferentially Co-Retained in Radiation Hybrid Panels Tend to Avoid Coexpression
Coexpression has been frequently used to explore modules of functionally related genes in eukaryotic genomes. However, we found that genetically interacting mammalian genes identified through radiation hybrid (RH) genotypes tend not to be coexpressed across tissues. This pattern remained unchanged after controlling for potential confounding factors, including chromosomal linkage, chromosomal distance, and gene duplication. Because >99.9% of the genetically interacting genes were identified according to the higher co-retention frequencies, our observation implies that coexpression is not necessarily an indication of the need for the co-presence of two genes in the genome, which is a prerequisite for cofunctionality of their coding proteins in the cell. Therefore, coexpression information must be applied cautiously to the exploration of the functional relatedness of genes in a genome
Classification of heterogeneous microarray data by maximum entropy kernel
<p>Abstract</p> <p>Background</p> <p>There is a large amount of microarray data accumulating in public databases, providing various data waiting to be analyzed jointly. Powerful kernel-based methods are commonly used in microarray analyses with support vector machines (SVMs) to approach a wide range of classification problems. However, the standard vectorial data kernel family (linear, RBF, etc.) that takes vectorial data as input, often fails in prediction if the data come from different platforms or laboratories, due to the low gene overlaps or consistencies between the different datasets.</p> <p>Results</p> <p>We introduce a new type of kernel called maximum entropy (ME) kernel, which has no pre-defined function but is generated by kernel entropy maximization with sample distance matrices as constraints, into the field of SVM classification of microarray data. We assessed the performance of the ME kernel with three different data: heterogeneous kidney carcinoma, noise-introduced leukemia, and heterogeneous oral cavity carcinoma metastasis data. The results clearly show that the ME kernel is very robust for heterogeneous data containing missing values and high-noise, and gives higher prediction accuracies than the standard kernels, namely, linear, polynomial and RBF.</p> <p>Conclusion</p> <p>The results demonstrate its utility in effectively analyzing promiscuous microarray data of rare specimens, e.g., minor diseases or species, that present difficulty in compiling homogeneous data in a single laboratory.</p
Toll-like receptor signaling adapter proteins govern spread of neuropathic pain and recovery following nerve injury in male mice.
BackgroundSpinal Toll-like receptors (TLRs) and signaling intermediaries have been implicated in persistent pain states. We examined the roles of two major TLR signaling pathways and selected TLRs in a mononeuropathic allodynia.MethodsL5 spinal nerve ligation (SNL) was performed in wild type (WT, C57BL/6) male and female mice and in male Tlr2-/-Tlr3-/-, Tlr4-/-, Tlr5-/-, Myd88-/-, Triflps2, Myd88/Triflps2, Tnf-/-, and Ifnar1-/- mice. We also examined L5 ligation in Tlr4-/- female mice. We examined tactile allodynia using von Frey hairs. Iba-1 (microglia) and GFAP (astrocytes) were assessed in spinal cords by immunostaining. Tactile thresholds were analyzed by 1- and 2-way ANOVA and the Bonferroni post hoc test was used.ResultsIn WT male and female mice, SNL lesions resulted in a persistent and robust ipsilateral, tactile allodynia. In males with TLR2, 3, 4, or 5 deficiencies, tactile allodynia was significantly, but incompletely, reversed (approximately 50%) as compared to WT. This effect was not seen in female Tlr4-/- mice. Increases in ipsilateral lumbar Iba-1 and GFAP were seen in mutant and WT mice. Mice deficient in MyD88, or MyD88 and TRIF, showed an approximately 50% reduction in withdrawal thresholds and reduced ipsilateral Iba-1. In contrast, TRIF and interferon receptor null mice developed a profound ipsilateral and contralateral tactile allodynia. In lumbar sections of the spinal cords, we observed a greater increase in Iba-1 immunoreactivity in the TRIF-signaling deficient mice as compared to WT, but no significant increase in GFAP. Removing MyD88 abrogated the contralateral allodynia in the TRIF signaling-deficient mice. Conversely, IFNβ, released downstream to TRIF signaling, administered intrathecally, temporarily reversed the tactile allodynia.ConclusionsThese observations suggest a critical role for the MyD88 pathway in initiating neuropathic pain, but a distinct role for the TRIF pathway and interferon in regulating neuropathic pain phenotypes in male mice
Functional Diversification of Paralogous Transcription Factors via Divergence in DNA Binding Site Motif and in Expression
BACKGROUND: Gene duplication is a major driver of evolutionary innovation as it allows for an organism to elaborate its existing biological functions via specialization or diversification of initially redundant gene paralogs. Gene function can diversify in several ways. Transcription factor gene paralogs in particular, can diversify either by changes in their tissue-specific expression pattern or by changes in the DNA binding site motif recognized by their protein product, which in turn alters their gene targets. The relationship between these two modes of functional diversification of transcription factor paralogs has not been previously investigated, and is essential for understanding adaptive evolution of transcription factor gene families. FINDINGS: Based on a large set of human paralogous transcription factor pairs, we show that when the DNA binding site motifs of transcription factor paralogs are similar, the expressions of the genes that encode the paralogs have diverged, so in general, at most one of the paralogs is highly expressed in a tissue. Moreover, paralogs with diverged DNA binding site motifs tend to be diverged in their function. Conversely, two paralogs that are highly expressed in a tissue tend to have dissimilar DNA binding site motifs. We have also found that in general, within a paralogous family, tissue-specific decrease in gene expression is more frequent than what is expected by chance. CONCLUSIONS: While previous investigations of paralogous gene diversification have only considered coding sequence divergence, by explicitly quantifying divergence in DNA binding site motif, our work presents a new paradigm for investigating functional diversification. Consistent with evolutionary expectation, our quantitative analysis suggests that paralogous transcription factors have survived extinction in part, either through diversification of their DNA binding site motifs or through alterations in their tissue-specific expression levels
Viral pathogens associated with acute respiratory infections in central vietnamese children.
Hospitalized Vietnamese children with acute respiratory infection were investigated for 13 viral pathogens using multiplex-polymerase chain reaction. We enrolled 958 children of whom 659 (69%) had documented viral infection: rhinovirus (28%), respiratory syncytial virus (23%), influenza virus (15%), adenovirus (5%), human metapneumo virus (4.5%), parainfluenza virus (5%), and bocavirus (2%). These Vietnamese children had a range of respiratory viruses which underscores the need for enhanced acute respiratory infection surveillance in tropical developing countries
Large-scale integration of cancer microarray data identifies a robust common cancer signature
<p>Abstract</p> <p>Background</p> <p>There is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer.</p> <p>Results</p> <p>In this study, we collect and integrate ~ 1500 microarray gene expression profiles from 26 published cancer data sets across 21 major human cancer types. We then apply a statistical method, referred to as the <it>T</it>op-<it>S</it>coring <it>P</it>air of <it>G</it>roups (TSPG) classifier, and a repeated random sampling strategy to the integrated training data sets and identify a common cancer signature consisting of 46 genes. These 46 genes are naturally divided into two distinct groups; those in one group are typically expressed less than those in the other group for cancer tissues. Given a new expression profile, the classifier discriminates cancer from normal tissues by ranking the expression values of the 46 genes in the cancer signature and comparing the average ranks of the two groups. This signature is then validated by applying this decision rule to independent test data.</p> <p>Conclusion</p> <p>By combining the TSPG method and repeated random sampling, a robust common cancer signature has been identified from large-scale microarray data integration. Upon further validation, this signature may be useful as a robust and objective diagnostic test for cancer.</p
Nutritional status of pre-school children from low income families
<p>Abstract</p> <p>Background</p> <p>We evaluated growth and nutritional status of preschool children between 2 and 6 years old from low income families from 14 daycare centers.</p> <p>Methods</p> <p>Cross-sectional study with 1544 children from daycare centers of Santo Andre, Brazil. Body weight (W), height (H) and body mass index (BMI) were classified according to the 2000 National Center for Health Statistics (CDC/NCHS). Cutoff points for nutritional disorders: -2 z scores and 2.5 and 10 percentiles for malnutrition risk, 85 to 95 percentile for overweight and above BMI 95 percentile for obesity. Stepwise Forward Regression method was used including age, gender, birth weight, breastfeeding duration, age of mother at birth and period of time they attended the daycare center.</p> <p>Results</p> <p>Children presented mean z scores of H, W and BMI above the median of the CDC/NCHS reference. Girls were taller and heavier than boys, while we observed similar BMI between both genders. The z scores tended to rise with age. A Pearson Coefficient of Correlation of 0.89 for W, 0.93 for H and 0.95 for BMI was documented indicating positive association of age with weight, height and BMI. The frequency of children below -2 z scores was lower than expected: 1.5% for W, 1.75% for H and 0% for BMI, which suggests that there were no malnourished children. The other extremity of the distribution evidenced prevalence of overweight and obesity of 16.8% and 10.8%, respectively.</p> <p>Conclusion</p> <p>Low income preschool children are in an advanced stage of nutritional transition with a high prevalence of overweight.</p
Clustering of Codons with Rare Cognate tRNAs in Human Genes Suggests an Extra Level of Expression Regulation
In species with large effective population sizes, highly expressed genes tend to be encoded by codons with highly abundant cognate tRNAs to maximize translation rate. However, there has been little evidence for a similar bias of synonymous codons in highly expressed human genes. Here, we ask instead whether there is evidence for the selection for codons associated with low abundance tRNAs. Rather than averaging the codon usage of complete genes, we scan the genes for windows with deviating codon usage. We show that there is a significant over representation of human genes that contain clusters of codons with low abundance cognate tRNAs. We name these regions, which on average have a 50% reduction in the amount of cognate tRNA available compared to the remainder of the gene, RTS (rare tRNA score) clusters. We observed a significant reduction in the substitution rate between the human RTS clusters and their orthologous chimp sequence, when compared to non–RTS cluster sequences. Overall, the genes with an RTS cluster have higher tissue specificity than the non–RTS cluster genes. Furthermore, these genes are functionally enriched for transcription regulation. As genes that regulate transcription in lower eukaryotes are known to be involved in translation on demand, this suggests that the mechanism of translation level expression regulation also exists within the human genome
Preservation of Ranking Order in the Expression of Human Housekeeping Genes
Housekeeping (HK) genes fulfill the basic needs for a cell to survive and function properly. Their ubiquitous expression, originally thought to be constant, can vary from tissue to tissue, but this variation remains largely uncharacterized and it could not be explained by previously identified properties of HK genes such as short gene length and high GC content. By analyzing microarray expression data for human genes, we uncovered a previously unnoted characteristic of HK gene expression, namely that the ranking order of their expression levels tends to be preserved from one tissue to another. Further analysis by tensor product decomposition and pathway stratification identified three main factors of the observed ranking preservation, namely that, compared to those of non-HK (NHK) genes, the expression levels of HK genes show a greater degree of dispersion (less overlap), stableness (a smaller variation in expression between tissues), and correlation of expression. Our results shed light on regulatory mechanisms of HK gene expression that are probably different for different HK genes or pathways, but are consistent and coordinated in different tissues
- …