57 research outputs found

    CORENup: a combination of convolutional and recurrent deep neural networks for nucleosome positioning identification

    Get PDF
    Background: Nucleosomes wrap the DNA into the nucleus of the Eukaryote cell and regulate its transcription phase. Several studies indicate that nucleosomes are determined by the combined effects of several factors, including DNA sequence organization. Interestingly, the identification of nucleosomes on a genomic scale has been successfully performed by computational methods using DNA sequence as input data. Results: In this work, we propose CORENup, a deep learning model for nucleosome identification. CORENup processes a DNA sequence as input using one-hot representation and combines in a parallel fashion a fully convolutional neural network and a recurrent layer. These two parallel levels are devoted to catching both non-periodic and periodic DNA string features. A dense layer is devoted to their combination to give a final classification. Conclusions: Results computed on public data sets of different organisms show that CORENup is a state of the art methodology for nucleosome positioning identification based on a Deep Neural Network architecture. The comparisons have been carried out using two groups of datasets, currently adopted by the best performing methods, and CORENup has shown top performance both in terms of classification metrics and elapsed computation time

    Fuzzy Clustering of Histopathological Images Using Deep Learning Embeddings

    Get PDF
    Metric learning is a machine learning approach that aims to learn a new distance metric by increasing (reducing) the similarity of examples belonging to the same (different) classes. The output of these approaches are embeddings, where the input data are mapped to improve a crisp or fuzzy classification process. The deep metric learning approaches regard metric learning, implemented by using deep neural networks. Such models have the advantage to discover very representative nonlinear embeddings. In this work, we propose a triplet network deep metric learning approach, based on ResNet50, to find a representative embedding for the unsupervised fuzzy classification of benign and malignant histopathological images of breast cancer tissues. Experiments computed on the BreakHis benchmark dataset, using Fuzzy C-Means Clustering, show the benefit of using very low dimensional embeddings found by the deep metric learning approach

    An Online Multilingual Medical Vocabulary/Thesaurus/Dictionary (MED-VTD) for Facilitating Understanding of Medical Texts

    Get PDF
    Medical texts (e.g., reports and medicine leaflets) are usually written by professionals (physicians, medical researchers, etc.) who use their own language and communication style. On the other hand, they are often read by health consumers or other medical professionals who do not have the same vocabularies and can have difficulties in text comprehension. Thus, to help a generic user in understanding a medical text, it would be desirable to have an online medical vocabulary/thesaurus/dictionary that he/she can easily look for finding the plain equivalent of any medical (technical) term and a definition of the term with the same kind of language. In this work, we present an online multilingual medical vocabulary/thesaurus/dictionary (MED-VTD) that helps the generic user in understanding any medical text written either in English or in Italian. To this end, we have built an integrated system that uses medical vocabularies for creating a list of medical (technical) terms, consumer health vocabularies (CHVs) for translating the technical terms into their consumer equivalents and consumer dictionaries for finding supplementary information on the terms. MED-VTD contains both English and Italian resources so that a translation of terms between the two languages can also be automatically performed. Moreover, other languages can easily be added providing that the related vocabularies, thesauri and dictionaries are available

    A Multi-Layer Method to Study Genome-Scale Positions of Nucleosomes

    Get PDF
    The basic unit of eukaryotic chromatin is the nucleosome, consisting of about 150 by of DNA wrapped around a protein core made of histone proteins. Nucleosomes position is modulated in vivo to regulate fundamental nuclear processes. To measure nucleosome positions on a genomic scale both theoretical and experimental approaches have been recently reported. We have developed a new method, Multi-Layer Model (MLM), for the analysis of nucleosome position data obtained with microarray-based approach. The MLM is a feature extraction method in which the input data is processed by a classifier to distinguish between several kinds of patterns. We applied our method to simulated-synthetic and experimental nucleosome position data and found that besides a high nucleosome recognition and a strong agreement with standard statistical methods, the MLM can identify distinct classes of nucleosomes, making it an important tool for the genome wide analysis of nucleosome position and function. In conclusion, the MLM allows a better representation of nucleosome position data and a significant reduction in computational time

    Associations between Notch-2, Akt-1 and HER2/neu expression in invasive human breast cancer: a tissue microarray immunophenotypic analysis on 98 patients.

    Get PDF
    Objective: We aimed to investigate the existence of associations between well-established and newly recognized biological and phenotypic features of breast cancer involved in tumor progression and prognosis. Methods: Ninety-eight cases of invasive breast cancer were assessed for the immunohistochemical expression of estrogen and progesterone receptors, Ki-67, HER2, Akt-1, and Notch-2, using the tissue microarray technique. Data regarding tumor histotype, histological grade, tumor size and lymph node status were collected for each patient and included in the analysis. Results: Several significant associations between histological and/or immunophenotypic features came from the analysis of our data. Positive associations were observed between estrogen and progesterone receptors, tumor grade and proliferation index, tumor grade and HER2, Akt-1 and estrogen receptors, and Notch-2 and HER2. Inverse associations were noted between hormone receptors and tumor grade, hormone receptors and HER2, Akt-1 and tumor grade, and Akt-1 and nodal invasion. Conclusions: Our results, showing the existence of a number of estrogen receptor-positive tumors with Akt-1 expression, better degree of differentiation, and no lymph node involvement, along with the presence of HER2- positive tumors with strong Notch-2 expression, support the role of Notch and Akt in breast cancer progression and suggest that they may also represent new appealing therapeutic targets

    Alignment Free Dissimilarities for Nucleosome Classification

    No full text
    Epigenetic mechanisms such as nucleosome positioning, histone modications and DNA methylation play an important role in the regulation of cell type-specic gene activities, yet how epigenetic patterns are established and maintained remains poorly understood. Recent studies have shown a role of DNA sequences in recruitment of epigenetic regulators. For this reason, the use of more suitable similarities or dissimilarity between DNA sequences could help in the context of epigenetic studies. In particular, alignment-free dissimilarities have already been successfully applied to identify distinct sequence features that are associated with epigenetic patterns and to predict epigenomic proles. In this work, we focalize the study on the problem of nucleosome classification, providing a benchmark study of 6 alignment free dissimilarity measures between sequences, belonging to the categories of geometricbased, correlation-based, information-based and compression based. Their comparisons have been done versus an alignment based dissimilarity, by measuring the performance of several nearest neighbour classiers that incorporate each one the considered dissimilarities. Results computed on three dataset of nucleosome forming and inhibiting sequences, shows that among the alignment free dissimilarities, the geometric and correlation are the more suitable for the purpose of nucleosome classication, making them a more ecient alternative to the alignment-based similarity measures, which nevertheless are yet the preferred choice when dealing with sequence similarity measurement

    A genetic algorithm for image segmentation

    No full text
    The paper describes a new algorithm for image segmentation. It is based on a genetic approach that allow us to consider the segmentation problem as a global optimization problem (GOP). For this purpose a fitness function, based on the similarity between images, has been defined. The similarity is function of both the intensity and spatial position of pixels. Preliminary result, obtained using real images, show a good performance of the segmentation algorithm

    An integrated fuzzy cells-classifier

    No full text
    This paper introduces a genetic algorithm able to combine different classifiers based on different distance functions. The use of a genetic algorithm is motivated by the fact that the combination phase is based on the optimization of a vote strategy. The method has been applied to the classification of four types of biological cells, results show an improvement of the recognition rate using the genetic algorithm combination strategy compared with the recognition rate of each single classifier

    A Lexicon-based Approach for Sentiment Classification of Amazon Books Reviews in Italian Language

    Get PDF
    We present a system aimed at the automatic classification of the sentiment orientation expressed into book reviews written in Italian language. The system we have developed is found on a lexicon-based approach and uses NLP techniques in order to take into account the linguistic relation between terms in the analyzed texts. The classification of a review is based on the average sentiment strenght of its sentences, while the classification of each sentence is obtained through a parsing process inspecting, for each term, a window of previous items to detect particular combinations of elements giving inversions or variations of polarity. The score of a single word depends on all the associated meanings considering also semantically related concepts as synonyms and hyperonims. Concepts associated to words are extracted from a proper stratification of linguistic resources that we adopt to solve the problems of lack of an opinion lexicon specifically tailored on the Italian language. The system has been prototyped by using Python language and it has been tested on a dataset of reviews crawled from Amazon.it, the Italian Amazon website. Experiments show that the proposed system is able to automatically classify both positive and negative reviews, with an average accuracy of above 82%
    • …
    corecore