210 research outputs found

    Discerning Tumor Status from Unstructured MRI Reports—Completeness of Information in Existing Reports and Utility of Automated Natural Language Processing

    Get PDF
    Information in electronic medical records is often in an unstructured free-text format. This format presents challenges for expedient data retrieval and may fail to convey important findings. Natural language processing (NLP) is an emerging technique for rapid and efficient clinical data retrieval. While proven in disease detection, the utility of NLP in discerning disease progression from free-text reports is untested. We aimed to (1) assess whether unstructured radiology reports contained sufficient information for tumor status classification; (2) develop an NLP-based data extraction tool to determine tumor status from unstructured reports; and (3) compare NLP and human tumor status classification outcomes. Consecutive follow-up brain tumor magnetic resonance imaging reports (2000–­2007) from a tertiary center were manually annotated using consensus guidelines on tumor status. Reports were randomized to NLP training (70%) or testing (30%) groups. The NLP tool utilized a support vector machines model with statistical and rule-based outcomes. Most reports had sufficient information for tumor status classification, although 0.8% did not describe status despite reference to prior examinations. Tumor size was unreported in 68.7% of documents, while 50.3% lacked data on change magnitude when there was detectable progression or regression. Using retrospective human classification as the gold standard, NLP achieved 80.6% sensitivity and 91.6% specificity for tumor status determination (mean positive predictive value, 82.4%; negative predictive value, 92.0%). In conclusion, most reports contained sufficient information for tumor status determination, though variable features were used to describe status. NLP demonstrated good accuracy for tumor status classification and may have novel application for automated disease status classification from electronic databases

    Position and sequence conservation in Amniota of polymorphic enhancer HS1.2 within the palindrome of IgH 3'Regulatory Region

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Immunoglobulin heavy chain (IgH) 3' Regulatory Region (3'RR), located at the 3' of the constant alpha gene, plays a crucial role in immunoglobulin production. In humans, there are 2 copies of the 3'RR, each composed of 4 main elements: 3 enhancers and a 20 bp tandem repeat. The single mouse 3'RR differs from the two human ones for the presence of 4 more regulative elements with the double copy of one enhancer at the border of a palindromic region.</p> <p>Results</p> <p>We compared the 3'RR organization in genomes of vertebrates to depict the evolutionary history of the region and highlight its shared features. We found that in the 8 species in which the whole region was included in a fully assembled contig (mouse, rat, dog, rabbit, panda, orangutan, chimpanzee, and human), the shared elements showed synteny and a highly conserved sequence, thus suggesting a strong evolutionary constraint. In these species, the wide 3'RR (~30 kb in human) bears a large palindromic sequence, consisting in two ~3 kb complementary branches spaced by a ~3 kb sequence always including the HS1.2 enhancer. In mouse and rat, HS3 is involved by the palindrome so that one copy of the enhancer is present on each side. A second relevant feature of our present work concerns human polymorphism of the HS1.2 enhancer, associated to immune diseases in our species. We detected a similar polymorphism in all the studied Catarrhini (a primate parvorder). The polymorphism consists of multiple copies of a 40 bp element up to 12 in chimpanzees, 8 in baboons, 6 in macaque, 5 in gibbons, 4 in humans and orangutan, separated by stretches of Cytosine. We show specific binding of this element to nuclear factors.</p> <p>Conclusions</p> <p>The nucleotide sequence of the palindrome is not conserved among evolutionary distant species, suggesting pressures for the maintenance of two self-matching regions driving a three-dimensional structure despite of the inter-specific divergence at sequence level. The information about the conservation of the palindromic structure and the settling in primates of the polymorphic feature of HS1.2 show the relevance of these structures in the control and modulation of the Ig production through the formation of possible three-dimensional structures.</p

    The Role of Cadmium and Nickel in Estrogen Receptor Signaling and Breast Cancer: Metalloestrogens or Not?

    Get PDF
    During the last half-century, incidences of breast cancer have increased globally. Various factors —genetic and environmental— have been implicated in the initiation and progression of this disease. One potential environmental risk factor that has not received a lot of attention is the exposure to heavy metals. While several mechanisms have been put forth describing how high concentrations of heavy metals play a role in carcinogenesis, it is unclear whether chronic, lowlevel exposure to certain heavy metals (i.e. cadmium and nickel), can directly result in the development and progression of cancer. Cadmium and nickel have been hypothesized to play a role in breast cancer development by acting as metalloestrogens— metals that bind to estrogen receptors and mimic the actions of estrogen. Since the lifetime exposure to estrogen is a wellestablished risk factor for breast cancer, anything that mimics its activity will likely contribute to the etiology of the disease. However, heavy metals, depending on their concentration, are capable of binding to a variety of proteins and may exert their toxicities by disrupting multiple cellular functions, complicating the analysis of whether heavy metal-induced carcinogenesis is mediated by the estrogen receptor. The purpose of this review is to discuss the various epidemiological, in vivo, and in vitro studies that show a link between the heavy metals, cadmium and nickel, and breast cancer development. We will particularly focus on the studies that test whether or not these two metals act as metalloestrogens in order to assess the strength of the data supporting this hypothesis

    The DSIF Subunits Spt4 and Spt5 Have Distinct Roles at Various Phases of Immunoglobulin Class Switch Recombination

    Get PDF
    Class-switch recombination (CSR), induced by activation-induced cytidine deaminase (AID), can be divided into two phases: DNA cleavage of the switch (S) regions and the joining of the cleaved ends of the different S regions. Here, we show that the DSIF complex (Spt4 and Spt5), a transcription elongation factor, is required for CSR in a switch-proficient B cell line CH12F3-2A cells, and Spt4 and Spt5 carry out independent functions in CSR. While neither Spt4 nor Spt5 is required for transcription of S regions and AID, expression array analysis suggests that Spt4 and Spt5 regulate a distinct subset of transcripts in CH12F3-2A cells. Curiously, Spt4 is critically important in suppressing cryptic transcription initiating from the intronic Sμ region. Depletion of Spt5 reduced the H3K4me3 level and DNA cleavage at the Sα region, whereas Spt4 knockdown did not perturb the H3K4me3 status and S region cleavage. H3K4me3 modification level thus correlated well with the DNA breakage efficiency. Therefore we conclude that Spt5 plays a role similar to the histone chaperone FACT complex that regulates H3K4me3 modification and DNA cleavage in CSR. Since Spt4 is not involved in the DNA cleavage step, we suspected that Spt4 might be required for DNA repair in CSR. We examined whether Spt4 or Spt5 is essential in non-homologous end joining (NHEJ) and homologous recombination (HR) as CSR utilizes general repair pathways. Both Spt4 and Spt5 are required for NHEJ and HR as determined by assay systems using synthetic repair substrates that are actively transcribed even in the absence of Spt4 and Spt5. Taken together, Spt4 and Spt5 can function independently in multiple transcription-coupled steps of CSR

    Negative Supercoiling Creates Single-Stranded Patches of DNA That Are Substrates for AID–Mediated Mutagenesis

    Get PDF
    Antibody diversification necessitates targeted mutation of regions within the immunoglobulin locus by activation-induced cytidine deaminase (AID). While AID is known to act on single-stranded DNA (ssDNA), the source, structure, and distribution of these substrates in vivo remain unclear. Using the technique of in situ bisulfite treatment, we characterized these substrates—which we found to be unique to actively transcribed genes—as short ssDNA regions, that are equally distributed on both DNA strands. We found that the frequencies of these ssDNA patches act as accurate predictors of AID activity at reporter genes in hypermutating and class switching B cells as well as in Escherichia coli. Importantly, these ssDNA patches rely on transcription, and we report that transcription-induced negative supercoiling enhances both ssDNA tract formation and AID mutagenesis. In addition, RNaseH1 expression does not impact the formation of these ssDNA tracts indicating that these structures are distinct from R-loops. These data emphasize the notion that these transcription-generated ssDNA tracts are one of many in vivo substrates for AID

    Genome-Wide Analyses of Recombination Prone Regions Predict Role of DNA Structural Motif in Recombination

    Get PDF
    HapMap findings reveal surprisingly asymmetric distribution of recombinogenic regions. Short recombinogenic regions (hotspots) are interspersed between large relatively non-recombinogenic regions. This raises the interesting possibility of DNA sequence and/or other cis- elements as determinants of recombination. We hypothesized the involvement of non-canonical sequences that can result in local non-B DNA structures and tested this using the G-quadruplex DNA as a model. G-quadruplex or G4 DNA is a unique form of four-stranded non-B DNA structure that engages certain G-rich sequences, presence of such motifs has been noted within telomeres. In support of this hypothesis, genome-wide computational analyses presented here reveal enrichment of potential G4 (PG4) DNA forming sequences within 25618 human hotspots relative to 9290 coldspots (p<0.0001). Furthermore, co-occurrence of PG4 DNA within several short sequence elements that are associated with recombinogenic regions was found to be significantly more than randomly expected. Interestingly, analyses of more than 50 DNA binding factors revealed that co-occurrence of PG4 DNA with target DNA binding sites of transcription factors c-Rel, NF-kappa B (p50 and p65) and Evi-1 was significantly enriched in recombination-prone regions. These observations support involvement of G4 DNA in recombination, predicting a functional model that is consistent with duplex-strand separation induced by formation of G4 motifs in supercoiled DNA and/or when assisted by other cellular factors

    Toxicological aspects of the use of phenolic compounds in disease prevention

    Get PDF
    The consumption of a diet low in fat and enhanced by fruits and vegetables, especially rich in phenolic compounds, may reduce risks of many civilization diseases. The use of traditional medicines, mainly derived from plant sources, has become an attractive segment in the management of many lifestyle diseases. Concerning the application of dietary supplements (based on phenolic compounds) in common practice, the ongoing debate over possible adverse effects of certain nutrients and dosage levels is of great importance. Since dietary supplements are not classified as drugs, their potential toxicities and interactions have not been thoroughly evaluated. First, this review will introduce phenolic compounds as natural substances beneficial for human health. Second, the potential dual mode of action of flavonoids will be outlined. Third, potential deleterious impacts of phenolic compounds utilization will be discussed: pro-oxidant and estrogenic activities, cancerogenic potential, cytotoxic effects, apoptosis induction and flavonoid-drug interaction. Finally, future trends within the research field will be indicated
    • …
    corecore