144 research outputs found
Towards comprehensive syntactic and semantic annotations of the clinical narrative
Objective: To create annotated clinical narratives with layers of syntactic and semantic labels to facilitate advances in clinical natural language processing (NLP). To develop NLP algorithms and open source components. Methods: Manual annotation of a clinical narrative corpus of 127 606 tokens following the Treebank schema for syntactic information, PropBank schema for predicate-argument structures, and the Unified Medical Language System (UMLS) schema for semantic information. NLP components were developed. Results: The final corpus consists of 13 091 sentences containing 1772 distinct predicate lemmas. Of the 766 newly created PropBank frames, 74 are verbs. There are 28 539 named entity (NE) annotations spread over 15 UMLS semantic groups, one UMLS semantic type, and the Person semantic category. The most frequent annotations belong to the UMLS semantic groups of Procedures (15.71%), Disorders (14.74%), Concepts and Ideas (15.10%), Anatomy (12.80%), Chemicals and Drugs (7.49%), and the UMLS semantic type of Sign or Symptom (12.46%). Inter-annotator agreement results: Treebank (0.926), PropBank (0.891–0.931), NE (0.697–0.750). The part-of-speech tagger, constituency parser, dependency parser, and semantic role labeler are built from the corpus and released open source. A significant limitation uncovered by this project is the need for the NLP community to develop a widely agreed-upon schema for the annotation of clinical concepts and their relations. Conclusions: This project takes a foundational step towards bringing the field of clinical NLP up to par with NLP in the general domain. The corpus creation and NLP components provide a resource for research and application development that would have been previously impossible
Non SUSY Unification in Left-Right Models
We explore in a model independent way the possibility of achieving the non
supersymmetric gauge coupling unification within left-right symmetric models,
with the minimal particle content at the left-right mass scale which could be
as low as 1 TeV in a variety of models, and with a unification scale M in the
range GeV GeV.Comment: 18 pages, Latex file, uses epsf style, four figures. Submitted for
publication to Phys. Rev. D on Oct. 13, 199
Gene and pathway level analyses of germline DNA-repair gene variants and prostate cancer susceptibility using the iCOGS-genotyping array.
BACKGROUND: Germline mutations within DNA-repair genes are implicated in susceptibility to multiple forms of cancer. For prostate cancer (PrCa), rare mutations in BRCA2 and BRCA1 give rise to moderately elevated risk, whereas two of B100 common, low-penetrance PrCa susceptibility variants identified so far by genome-wide association studies implicate RAD51B and RAD23B. METHODS: Genotype data from the iCOGS array were imputed to the 1000 genomes phase 3 reference panel for 21 780 PrCa cases and 21 727 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. We subsequently performed single variant, gene and pathway-level analyses using 81 303 SNPs within 20 Kb of a panel of 179 DNA-repair genes. RESULTS: Single SNP analyses identified only the previously reported association with RAD51B. Gene-level analyses using the SKAT-C test from the SNP-set (Sequence) Kernel Association Test (SKAT) identified a significant association with PrCa for MSH5. Pathway-level analyses suggested a possible role for the translesion synthesis pathway in PrCa risk and Homologous recombination/Fanconi Anaemia pathway for PrCa aggressiveness, even though after adjustment for multiple testing these did not remain significant. CONCLUSIONS: MSH5 is a novel candidate gene warranting additional follow-up as a prospective PrCa-risk locus. MSH5 has previously been reported as a pleiotropic susceptibility locus for lung, colorectal and serous ovarian cancers.This is the final version of the article. It first appeared from Nature Publishing Group via http://dx.doi.org/10.1038/bjc.2016.5
Genetic randomization reveals functional relationships among morphologic and tissue-quality traits that contribute to bone strength and fragility
We examined femora from adult AXB/BXA recombinant inbred (RI) mouse strains to identify skeletal traits that are functionally related and to determine how functional interactions among these traits contribute to genetic variability in whole-bone stiffness, strength, and toughness. Randomization of A/J and C57BL/6J genomic regions resulted in each adult male and female RI strain building mechanically functional femora by assembling unique sets of morphologic and tissue-quality traits. A correlation analysis was conducted using the mean trait values for each RI strain. A third of the 66 correlations examined were significant, indicating that many bone traits covaried or were functionally related. Path analysis revealed important functional interactions among bone slenderness, cortical thickness, and tissue mineral density. The path coefficients describing these functional relations were similar for both sexes. The causal relationship among these three traits suggested that cellular processes during growth simultaneously regulate bone slenderness, cortical thickness, and tissue mineral density so that the combination of traits is sufficiently stiff and strong to satisfy daily loading demands. A disadvantage of these functional interactions was that increases in tissue mineral density also deleteriously affected tissue ductility. Consequently, slender bones with high mineral density may be stiff and strong but they are also brittle. Thus, genetically randomized mouse strains revealed a basic biological paradigm that allows for flexibility in building bones that are functional for daily activities but that creates preferred sets of traits under extreme loading conditions. Genetic or environmental perturbations that alter these functional interactions during growth would be expected to lead to loss of function and suboptimal adult bone quality
Polyunsaturated fatty acids and prostate cancer risk: a Mendelian randomisation analysis from the PRACTICAL consortium.
BACKGROUND: Prostate cancer is a common cancer worldwide with no established modifiable lifestyle factors to guide prevention. The associations between polyunsaturated fatty acids (PUFAs) and prostate cancer risk have been inconsistent. Using Mendelian randomisation, we evaluated associations between PUFAs and prostate cancer risk. METHODS: We used individual-level data from a consortium of 22 721 cases and 23 034 controls of European ancestry. Externally-weighted PUFA-specific polygenic risk scores (wPRSs), with explanatory variation ranging from 0.65 to 33.07%, were constructed and used to evaluate associations with prostate cancer risk per one standard deviation (s.d.) increase in genetically-predicted plasma PUFA levels using multivariable-adjusted unconditional logistic regression. RESULTS: No overall association was observed between the genetically-predicted PUFAs evaluated in this study and prostate cancer risk. However, risk reductions were observed for short-chain PUFAs, linoleic (ORLA=0.95, 95%CI=0.92, 0.98) and α-linolenic acids (ORALA=0.96, 95%CI=0.93, 0.98), among men <62 years; whereas increased risk was found among men ⩾62 years for LA (ORLA=1.04, 95%CI=1.01, 1.07). For long-chain PUFAs (i.e., arachidonic, eicosapentaenoic, and docosapentaenoic acids), increased risks were observed among men <62 years (ORAA=1.05, 95%CI=1.02, 1.08; OREPA=1.04, 95%CI=1.01, 1.06; ORDPA=1.05, 95%CI=1.02, 1.08). CONCLUSION: Results from this study suggest that circulating ω-3 and ω-6 PUFAs may have a different role in the aetiology of early- and late-onset prostate cancer
Evaluating Approaches for Constructing Polygenic Risk Scores for Prostate Cancer in Men of African and European Ancestry
Genome-wide polygenic risk scores (GW-PRSs) have been reported to have better predictive ability than PRSs based on genome-wide significance thresholds across numerous traits. We compared the predictive ability of several GW-PRS approaches to a recently developed PRS of 269 established prostate cancer-risk variants from multi-ancestry GWASs and fine-mapping studies (PRS269). GW-PRS models were trained with a large and diverse prostate cancer GWAS of 107,247 cases and 127,006 controls that we previously used to develop the multi-ancestry PRS269. Resulting models were independently tested in 1,586 cases and 1,047 controls of African ancestry from the California Uganda Study and 8,046 cases and 191,825 controls of European ancestry from the UK Biobank and further validated in 13,643 cases and 210,214 controls of European ancestry and 6,353 cases and 53,362 controls of African ancestry from the Million Veteran Program. In the testing data, the best performing GW-PRS approach had AUCs of 0.656 (95% CI = 0.635-0.677) in African and 0.844 (95% CI = 0.840-0.848) in European ancestry men and corresponding prostate cancer ORs of 1.83 (95% CI = 1.67-2.00) and 2.19 (95% CI = 2.14-2.25), respectively, for each SD unit increase in the GW-PRS. Compared to the GW-PRS, in African and European ancestry men, the PRS269 had larger or similar AUCs (AUC = 0.679, 95% CI = 0.659-0.700 and AUC = 0.845, 95% CI = 0.841-0.849, respectively) and comparable prostate cancer ORs (OR = 2.05, 95% CI = 1.87-2.26 and OR = 2.21, 95% CI = 2.16-2.26, respectively). Findings were similar in the validation studies. This investigation suggests that current GW-PRS approaches may not improve the ability to predict prostate cancer risk compared to the PRS269 developed from multi-ancestry GWASs and fine-mapping
Risk Analysis of Prostate Cancer in PRACTICAL, a Multinational Consortium, Using 25 Known Prostate Cancer Susceptibility Loci.
BACKGROUND: Genome-wide association studies have identified multiple genetic variants associated with prostate cancer risk which explain a substantial proportion of familial relative risk. These variants can be used to stratify individuals by their risk of prostate cancer. METHODS: We genotyped 25 prostate cancer susceptibility loci in 40,414 individuals and derived a polygenic risk score (PRS). We estimated empirical odds ratios (OR) for prostate cancer associated with different risk strata defined by PRS and derived age-specific absolute risks of developing prostate cancer by PRS stratum and family history. RESULTS: The prostate cancer risk for men in the top 1% of the PRS distribution was 30.6 (95% CI, 16.4-57.3) fold compared with men in the bottom 1%, and 4.2 (95% CI, 3.2-5.5) fold compared with the median risk. The absolute risk of prostate cancer by age of 85 years was 65.8% for a man with family history in the top 1% of the PRS distribution, compared with 3.7% for a man in the bottom 1%. The PRS was only weakly correlated with serum PSA level (correlation = 0.09). CONCLUSIONS: Risk profiling can identify men at substantially increased or reduced risk of prostate cancer. The effect size, measured by OR per unit PRS, was higher in men at younger ages and in men with family history of prostate cancer. Incorporating additional newly identified loci into a PRS should improve the predictive value of risk profiles. IMPACT: We demonstrate that the risk profiling based on SNPs can identify men at substantially increased or reduced risk that could have useful implications for targeted prevention and screening programs.D F. Easton was recipient of the CR-UK grant C1287/A10118. R
A. Eeles was recipient of the CR-UK grant C5047/A10692 and B E.
Henderson was recipient of the NIH grant 1U19CA148537-01This is the author accepted manuscript. The final version is available via AACR at http://cebp.aacrjournals.org/content/early/2015/04/02/1055-9965.EPI-14-0317.long
A Meta-analysis of Multiple Myeloma Risk Regions in African and European Ancestry Populations Identifies Putatively Functional Loci
Genome-wide association studies (GWAS) in European populations have identified genetic risk variants associated with multiple myeloma (MM)
An integrative multi-omics analysis to identify candidate DNA methylation biomarkers related to prostate cancer risk
Abstract: It remains elusive whether some of the associations identified in genome-wide association studies of prostate cancer (PrCa) may be due to regulatory effects of genetic variants on CpG sites, which may further influence expression of PrCa target genes. To search for CpG sites associated with PrCa risk, here we establish genetic models to predict methylation (N = 1,595) and conduct association analyses with PrCa risk (79,194 cases and 61,112 controls). We identify 759 CpG sites showing an association, including 15 located at novel loci. Among those 759 CpG sites, methylation of 42 is associated with expression of 28 adjacent genes. Among 22 genes, 18 show an association with PrCa risk. Overall, 25 CpG sites show consistent association directions for the methylation-gene expression-PrCa pathway. We identify DNA methylation biomarkers associated with PrCa, and our findings suggest that specific CpG sites may influence PrCa via regulating expression of candidate PrCa target genes
- …