32 research outputs found
Evolutionary Signatures amongst Disease Genes Permit Novel Methods for Gene Prioritization and Construction of Informative Gene-Based Networks
<div><p>Genes involved in the same function tend to have similar evolutionary histories, in that their rates of evolution covary over time. This coevolutionary signature, termed Evolutionary Rate Covariation (ERC), is calculated using only gene sequences from a set of closely related species and has demonstrated potential as a computational tool for inferring functional relationships between genes. To further define applications of ERC, we first established that roughly 55% of genetic diseases posses an ERC signature between their contributing genes. At a false discovery rate of 5% we report 40 such diseases including cancers, developmental disorders and mitochondrial diseases. Given these coevolutionary signatures between disease genes, we then assessed ERC's ability to prioritize known disease genes out of a list of unrelated candidates. We found that in the presence of an ERC signature, the true disease gene is effectively prioritized to the top 6% of candidates on average. We then apply this strategy to a melanoma-associated region on chromosome 1 and identify <i>MCL1</i> as a potential causative gene. Furthermore, to gain global insight into disease mechanisms, we used ERC to predict molecular connections between 310 nominally distinct diseases. The resulting “disease map” network associates several diseases with related pathogenic mechanisms and unveils many novel relationships between clinically distinct diseases, such as between Hirschsprung's disease and melanoma. Taken together, these results demonstrate the utility of molecular evolution as a gene discovery platform and show that evolutionary signatures can be used to build informative gene-based networks.</p></div
ERC values between complement deficiency genes.
<p>A) Complement genes <i>C1S</i> and <i>CFI</i> show variation in their evolutionary rates between branches of the mammalian phylogeny. Branches are color-coded according to rate. (Red is for rapid evolution, blue for slow, and intermediate shades for rates in between.) Tree topology and distances between species are the same for each gene. B) The same evolutionary rates for <i>C1S</i> and <i>CFI</i> are plotted against each other. Their correlation is apparent here in the best-fit line and correlation coefficient of 0.806. C) This matrix contains all pairwise ERC values between the OMIM genes for complement deficiency. Cells are shaded red according to the intensity of their departure from the null expectation. Blue arrows indicate the genes <i>C1S</i> and <i>CFI</i>. It is notable that most values are positive, whereas a random collection of genes would contain equal proportions of positive and negative values. There are also many clusters of functionally related complement proteins that contain very strong signals of ERC. The C1-related proteins in the upper left corner are a prime example of such an ERC hotspot.</p
ERC gene prioritization compared to other methods.
<p>ERC gene prioritization compared to other methods.</p
Diseases with significant ERC at a 5% false discovery rate.
<p>Diseases with significant ERC at a 5% false discovery rate.</p
ERC disease gene prioritization.
<p>The prioritization of the true disease gene relative to its chromosomal neighbors improves with a stronger ERC signal within the training set. A low p-value (x-axis) indicates strong ERC within a training set. Prioritization (y-axis) is presented as the proportion of candidate genes scoring lower than the true disease gene, i.e. higher represents better prioritization. The blue series is for diseases with training sets with 20 or fewer genes, representing the majority (70%) of OMIM diseases interrogated. The dotted green line is for those diseases with larger training sets.</p
Disease gene groupings P-value distribution.
<p>P-values represent the significance of elevated mean ERC within a particular disease. There is a notable excess of low p-values, indicating a large number of diseases with an ERC signature between their genes. False discovery rate analyses show that approximately 55% of disease states interrogated have significantly elevated ERC values.</p
Overexpression of <i>RECQL4</i> results in increased RAD51 foci and decreased tail moment.
(A) Overexpression of RECQL4 results in increased RAD51 foci, which is dependent on its helicase activity. U2OS cells were transfected with an empty plasmid or a plasmid expressing RECQL4 or RECQL4-K508A under a CMV promoter. The cells were either mock or cisplatin treated for one hour and after a two-hour recovery, imaged for RAD51 foci or DAPI by immunofluorescence. RAD51 foci was quantified from 200 cells per condition for each experiment. The experiment was performed three to five times and the median was graphed (Unprocessed foci count in S9 Data). Representative images are shown. (B) Overexpression of RECQL4 results decreased tail moment following cisplatin exposure, which is dependent on its helicase activity. U2OS cells were treated similarly to the immunofluorescence experiment, before being harvested for neutral comet assay. At least 40 comets were counted per condition for each experiment. The experiment was performed four times and the mean and standard deviation was graphed (Unprocessed tail moments in S10 Data).</p