28 research outputs found
Secure searching of biomarkers through hybrid homomorphic encryption scheme
Background: As genome sequencing technology develops rapidly, there has lately been an increasing need to keep genomic data secure even when stored in the cloud and still used for research. We are interested in designing a protocol for the secure outsourcing matching problem on encrypted data. Method: We propose an efficient method to securely search a matching position with the query data and extract some information at the position. After decryption, only a small amount of comparisons with the query information should be performed in plaintext state. We apply this method to find a set of biomarkers in encrypted genomes. The important feature of our method is to encode a genomic database as a single element of polynomial ring. Result: Since our method requires a single homomorphic multiplication of hybrid scheme for query computation, it has the advantage over the previous methods in parameter size, computation complexity, and communication cost. In particular, the extraction procedure not only prevents leakage of database information that has not been queried by user but also reduces the communication cost by half. We evaluate the performance of our method and verify that the computation on large-scale personal data can be securely and practically outsourced to a cloud environment during data analysis. It takes about 3.9 s to search-and-extract the reference and alternate sequences at the queried position in a database of size 4M. Conclusion: Our solution for finding a set of biomarkers in DNA sequences shows the progress of cryptographic techniques in terms of their capability can support real-world genome data analysis in a cloud environment
A Phenome-Based Functional Analysis of Transcription Factors in the Cereal Head Blight Fungus, Fusarium graminearum
Fusarium graminearum is an important plant pathogen that causes head blight of major cereal crops. The fungus produces mycotoxins that are harmful to animal and human. In this study, a systematic analysis of 17 phenotypes of the mutants in 657 Fusarium graminearum genes encoding putative transcription factors (TFs) resulted in a database of over 11,000 phenotypes (phenome). This database provides comprehensive insights into how this cereal pathogen of global significance regulates traits important for growth, development, stress response, pathogenesis, and toxin production and how transcriptional regulations of these traits are interconnected. In-depth analysis of TFs involved in sexual development revealed that mutations causing defects in perithecia development frequently affect multiple other phenotypes, and the TFs associated with sexual development tend to be highly conserved in the fungal kingdom. Besides providing many new insights into understanding the function of F. graminearum TFs, this mutant library and phenome will be a valuable resource for characterizing the gene expression network in this fungus and serve as a reference for studying how different fungi have evolved to control various cellular processes at the transcriptional level
Characterizing steroid hormone receptor chromatin binding landscapes in male and female breast cancer
Male breast cancer (MBC) is rare and largely hormonally driven. Here, the authors examine the action of steroid hormone receptors in male and female breast cancers and find gender selective hormone receptor action that associates with the survival of MBC patients
A CRISPR-Cas9 screen identifies essential CTCF anchor sites for estrogen receptor-driven breast cancer cell proliferation
Estrogen receptor α (ERα) is an enhancer activating transcription factor, a key driver of breast cancer and a main target for cancer therapy. ERα-mediated gene regulation requires proper chromatin-conformation to facilitate interactions between ERα-bound enhancers and their target promoters. A major determinant of chromatin structure is the CCCTC-binding factor (CTCF), that dimerizes and together with cohesin stabilizes chromatin loops and forms the boundaries of topologically associated domains. However, whether CTCF-binding elements (CBEs) are essential for ERα-driven cell proliferation is unknown. To address this question in a global manner, we implemented a CRISPR-based functional genetic screen targeting CBEs located in the vicinity of ERα-bound enhancers. We identified four functional CBEs and demonstrated the role of one of them in inducing chromatin conformation changes in favor of activation of PREX1, a key ERα target gene in breast cancer. Indeed, high PREX1 expression is a bona-fide marker of ERα-dependency in cell lines, and is associated with good outcome after anti-hormonal treatment. Altogether, our data show that distinct CTCF-mediated chromatin structures are required for ERα- driven breast cancer cell proliferation
CUEDC1 is a primary target of ERα essential for the growth of breast cancer cells
Breast cancer is the most prevalent type of malignancy in women with ∼1.7 million new cases diagnosed annually, of which the majority express ERα (ESR1), a ligand-dependent transcription factor. Genome-wide chromatin binding maps suggest that ERα may control the expression of thousands of genes, posing a great challenge in identifying functional targets. Recently, we developed a CRISPR-Cas9 functional genetic screening approach to identify enhancers required for ERα-positive breast cancer cell proliferation. We validated several candidates, including CUTE, a putative ERα-responsive enhancer located in the first intron of CUEDC1 (CUE-domain containing protein). Here, we show that CUTE controls CUEDC1 expression, and that this interaction is essential for ERα-mediated cell proliferation. Moreover, ectopic expression of CUEDC1, but not a CUE-domain mutant, rescues the defects in CUTE activity. Finally, CUEDC1 expression correlates positively with ERα in breast cancer. Thus, CUEDC1 is a functional target gene of ERα and is required for breast cancer cell proliferation
Myotis rufoniger genome sequence and analyses: M-rufoniger's genomic feature and the decreasing effective population size of Myotis bats
Myotis rufoniger is a vesper bat in the genus Myotis. Here we report the whole genome sequence and analyses of the M. rufoniger. We generated 124 Gb of short-read DNA sequences with an estimated genome size of 1.88 Gb at a sequencing depth of 66x fold. The sequences were aligned to M. brandtii bat reference genome at a mapping rate of 96.50% covering 95.71% coding sequence region at 10x coverage. The divergence time of Myotis bat family is estimated to be 11.5 million years, and the divergence time between M. rufoniger and its closest species M. davidii is estimated to be 10.4 million years. We found 1,239 function-altering M. rufoniger specific amino acid sequences from 929 genes compared to other Myotis bat and mammalian genomes. The functional enrichment test of the 929 genes detected amino acid changes in melanin associated DCT, SLC45A2, TYRP1, and OCA2 genes possibly responsible for the M. rufoniger's red fur color and a general coloration in Myotis. N6AMT1 gene, associated with arsenic resistance, showed a high degree of function alteration in M. rufoniger. We further confirmed that the M. rufoniger also has batspecific sequences within FSHB, GHR, IGF1R, TP53, MDM2, SLC45A2, RGS7BP, RHO, OPN1SW, and CNGB3 genes that have already been published to be related to bat's reproduction, lifespan, flight, low vision, and echolocation. Additionally, our demographic history analysis found that the effective population size of Myotis clade has been consistently decreasing since similar to 30k years ago. M. rufoniger's effective population size was the lowest in Myotis bats, confirming its relatively low genetic diversity
Cardiovascular disease, chronic kidney disease, and diabetes mortality burden of cardiometabolic risk factors from 1980 to 2010: A comparative risk assessment
Background: High blood pressure, blood glucose, serum cholesterol, and BMI are risk factors for cardiovascular diseases and some of these factors also increase the risk of chronic kidney disease and diabetes. We estimated mortality from cardiovascular diseases, chronic kidney disease, and diabetes that was attributable to these four cardiometabolic risk factors for all countries and regions from 1980 to 2010. Methods: We used data for exposure to risk factors by country, age group, and sex from pooled analyses of population-based health surveys. We obtained relative risks for the effects of risk factors on cause-specific mortality from meta-analyses of large prospective studies. We calculated the population attributable fractions for each risk factor alone, and for the combination of all risk factors, accounting for multicausality and for mediation of the effects of BMI by the other three risks. We calculated attributable deaths by multiplying the cause-specific population attributable fractions by the number of disease-specific deaths. We obtained cause-specific mortality from the Global Burden of Diseases, Injuries, and Risk Factors 2010 Study. We propagated the uncertainties of all the inputs to the final estimates. Findings: In 2010, high blood pressure was the leading risk factor for deaths due to cardiovascular diseases, chronic kidney disease, and diabetes in every region, causing more than 40% of worldwide deaths from these diseases; high BMI and glucose were each responsible for about 15% of deaths, and high cholesterol for more than 10%. After accounting for multicausality, 63% (10·8 million deaths, 95% CI 10·1-11·5) of deaths from these diseases in 2010 were attributable to the combined effect of these four metabolic risk factors, compared with 67% (7·1 million deaths, 6·6-7·6) in 1980. The mortality burden of high BMI and glucose nearly doubled from 1980 to 2010. At the country level, age-standardised death rates from these diseases attributable to the combined effects of these four risk factors surpassed 925 deaths per 100 000 for men in Belarus, Kazakhstan, and Mongolia, but were less than 130 deaths per 100 000 for women and less than 200 for men in some high-income countries including Australia, Canada, France, Japan, the Netherlands, Singapore, South Korea, and Spain. Interpretation: The salient features of the cardiometabolic disease and risk factor epidemic at the beginning of the 21st century are high blood pressure and an increasing effect of obesity and diabetes. The mortality burden of cardiometabolic risk factors has shifted from high-income to low-income and middle-income countries. Lowering cardiometabolic risks through dietary, behavioural, and pharmacological interventions should be a part of the global response to non-communicable diseases. Funding: UK Medical Research Council, US National Institutes of Health. © 2014 Elsevier Ltd
Principal network analysis: identification of subnetworks representing major dynamics using gene expression data.
Motivation: Systems biology attempts to describe complex systems behaviors in terms of dynamic operations of biological networks. However, there is lack of tools that can effectively decode complex network dynamics over multiple conditions. Results: We present principal network analysis (PNA) that can automatically capture major dynamic activation patterns over multiple conditions and then generate protein and metabolic subnetworks for the captured patterns. We first demonstrated the utility of this method by applying it to a synthetic dataset. The results showed that PNA correctly captured the subnetworks representing dynamics in the data. We further applied PNA to two time-course gene expression profiles collected from (i) MCF7 cells after treatments of HRG at multiple doses and (ii) brain samples of four strains of mice infected with two prion strains. The resulting subnetworks and their interactions revealed network dynamics associated with HRG dose-dependent regulation of cell proliferation and differentiation and early PrPSc accumulation during prion infection.X112730sciescopu