140 research outputs found

    Investigating heterogeneous protein annotations toward cross-corpora utilization

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The number of corpora, collections of structured texts, has been increasing, as a result of the growing interest in the application of natural language processing methods to biological texts. Many named entity recognition (NER) systems have been developed based on these corpora. However, in the biomedical community, there is yet no general consensus regarding named entity annotation; thus, the resources are largely incompatible, and it is difficult to compare the performance of systems developed on resources that were divergently annotated. On the other hand, from a practical application perspective, it is desirable to utilize as many existing annotated resources as possible, because annotation is costly. Thus, it becomes a task of interest to integrate the heterogeneous annotations in these resources.</p> <p>Results</p> <p>We explore the potential sources of incompatibility among gene and protein annotations that were made for three common corpora: GENIA, GENETAG and AIMed. To show the inconsistency in the corpora annotations, we first tackle the incompatibility problem caused by corpus integration, and we quantitatively measure the effect of this incompatibility on protein mention recognition. We find that the F-score performance declines tremendously when training with integrated data, instead of training with pure data; in some cases, the performance drops nearly 12%. This degradation may be caused by the newly added heterogeneous annotations, and cannot be fixed without an understanding of the heterogeneities that exist among the corpora. Motivated by the result of this preliminary experiment, we further qualitatively analyze a number of possible sources for these differences, and investigate the factors that would explain the inconsistencies, by performing a series of well-designed experiments. Our analyses indicate that incompatibilities in the gene/protein annotations exist mainly in the following four areas: the boundary annotation conventions, the scope of the entities of interest, the distribution of annotated entities, and the ratio of overlap between annotated entities. We further suggest that almost all of the incompatibilities can be prevented by properly considering the four aspects aforementioned.</p> <p>Conclusion</p> <p>Our analysis covers the key similarities and dissimilarities that exist among the diverse gene/protein corpora. This paper serves to improve our understanding of the differences in the three studied corpora, which can then lead to a better understanding of the performance of protein recognizers that are based on the corpora.</p

    Lack of robustness of textural measures obtained from 3D brain tumor MRIs impose a need for standardization

    Get PDF
    Purpose Textural measures have been widely explored as imaging biomarkers in cancer. However, their robustness under dynamic range and spatial resolution changes in brain 3D magnetic resonance images (MRI) has not been assessed. The aim of this work was to study potential variations of textural measures due to changes in MRI protocols. Materials and methods Twenty patients harboring glioblastoma with pretreatment 3D T1-weighted MRIs were included in the study. Four different spatial resolution combinations and three dynamic ranges were studied for each patient. Sixteen three-dimensional textural heterogeneity measures were computed for each patient and configuration including co-occurrence matrices (CM) features and run-length matrices (RLM) features. The coefficient of variation was used to assess the robustness of the measures in two series of experiments corresponding to (i) changing the dynamic range and (ii) changing the matrix size. Results No textural measures were robust under dynamic range changes. Entropy was the only textural feature robust under spatial resolution changes (coefficient of variation under 10% in all cases). Conclusion Textural measures of three-dimensional brain tumor images are not robust neither under dynamic range nor under matrix size changes. Standards should be harmonized to use textural features as imaging biomarkers in radiomic-based studies. The implications of this work go beyond the specific tumor type studied here and pose the need for standardization in textural feature calculation of oncological images

    Noncanonical GLI1 signaling promotes stemness features and in vivo growth in lung adenocarcinoma

    Get PDF
    Aberrant Hedgehog/GLI signaling has been implicated in a diverse spectrum of human cancers, but its role in lung adenocarcinoma (LAC) is still under debate. We show that the downstream effector of the Hedgehog pathway, GLI1, is expressed in 76% of LACs, but in roughly half of these tumors, the canonical pathway activator, Smoothened, is expressed at low levels, possibly owing to epigenetic silencing. In LAC cells including the cancer stem cell compartment, we show that GLI1 is activated noncanonically by MAPK/ERK signaling. Different mechanisms can trigger the MAPK/ERK/GLI1 cascade including KRAS mutation and stimulation of NRP2 by VEGF produced by the cancer cells themselves in an autocrine loop or by stromal cells as paracrine cross talk. Suppression of GLI1, by silencing or drug-mediated, inhibits LAC cells proliferation, attenuates their stemness and increases their susceptibility to apoptosis in vitro and in vivo. These findings provide insight into the growth of LACs and point to GLI1 as a downstream effector for oncogenic pathways. Thus, strategies involving direct inhibition of GLI1 may be useful in the treatment of LACs

    p53 and TAp63 promote keratinocyte proliferation and differentiation in breeding tubercles of the zebrafish

    Get PDF
    p63 is a multi-isoform member of the p53 family of transcription factors. There is compelling genetic evidence that ΔNp63 isoforms are needed for keratinocyte proliferation and stemness in the developing vertebrate epidermis. However, the role of TAp63 isoforms is not fully understood, and TAp63 knockout mice display normal epidermal development. Here, we show that zebrafish mutants specifically lacking TAp63 isoforms, or p53, display compromised development of breeding tubercles, epidermal appendages which according to our analyses display more advanced stratification and keratinization than regular epidermis, including continuous desquamation and renewal of superficial cells by derivatives of basal keratinocytes. Defects are further enhanced in TAp63/p53 double mutants, pointing to partially redundant roles of the two related factors. Molecular analyses, treatments with chemical inhibitors and epistasis studies further reveal the existence of a linear TAp63/p53->Notch->caspase 3 pathway required both for enhanced proliferation of keratinocytes at the base of the tubercles and their subsequent differentiation in upper layers. Together, these studies identify the zebrafish breeding tubercles as specific epidermal structures sharing crucial features with the cornified mammalian epidermis. In addition, they unravel essential roles of TAp63 and p53 to promote both keratinocyte proliferation and their terminal differentiation by promoting Notch signalling and caspase 3 activity, ensuring formation and proper homeostasis of this self-renewing stratified epithelium

    Middle East - North Africa and the millennium development goals : implications for German development cooperation

    Get PDF
              Closed-loop controlled combustion is a promising technique to improve the overall performance of internal combustion engines and Diesel engines in particular. In order for this technique to be implemented some form of feedback from the combustion process is required. The feedback signal is processed and from it combustionrelated parameters are computed. These parameters are then fed to a control process which drives a series of outputs (e.g. injection timing in Diesel engines) to control their values. This paper’s focus lies on the processing and computation that is needed on the feedback signal before this is ready to be fed to the control process as well as on the electronics necessary to support it. A number of feedback alternatives are briefly discussed and for one of them, the in-cylinder pressure sensor, the CA50 (crank angle in which the integrated heat release curve reaches its 50% value) and the IMEP (Indicated Mean Effective Pressure) are identified as two potential control variables. The hardware architecture of a system capable of calculating both of them on-line is proposed and necessary feasibility size and speed considerations are made by implementing critical blocks in VHDL targeting a flash-based Actel ProASIC3 automotive-grade FPGA

    �ber die Anwendung der Fehlerrechnung auf die Untersuchung morphologischer Unregelm��igkeiten

    No full text
    corecore