5,080 research outputs found
A novel kernel based approach to arbitrary length symbolic data with application to type 2 diabetes risk
Predictive modeling of clinical data is fraught with challenges arising from the manner in which events are recorded. Patients typically fall ill at irregular intervals and experience dissimilar intervention trajectories. This results in irregularly sampled and uneven length data which poses a problem for standard multivariate tools. The alternative of feature extraction into equal-length vectors via methods like Bag-of-Words (BoW) potentially discards useful information. We propose an approach based on a kernel framework in which data is maintained in its native form: discrete sequences of symbols. Kernel functions derived from the edit distance between pairs of sequences may then be utilized in conjunction with support vector machines to classify the data. Our method is evaluated in the context of the prediction task of determining patients likely to develop type 2 diabetes following an earlier episode of elevated blood pressure of 130/80 mmHg. Kernels combined via multi kernel learning achieved an F1-score of 0.96, outperforming classification with SVM 0.63, logistic regression 0.63, Long Short Term Memory 0.61 and Multi-Layer Perceptron 0.54 applied to a BoW representation of the data. We achieved an F1-score of 0.97 on MKL on external dataset. The proposed approach is consequently able to overcome limitations associated with feature-based classification in the context of clinical data
Automated screening of MRI brain scanning using grey level statistics
This paper describes the development of an algorithm for detecting and classifying MRI brain slices into normal and abnormal by relying on prior-knowledge, that the two hemispheres of a healthy brain have approximately a bilateral symmetry. We use the modified grey level co-occurrence matrix method to analyze and measure asymmetry between the two brain hemispheres. 21 co-occurrence statistics are used to discriminate the images. The experimental results demonstrate the efficacy of our proposed algorithm in detecting brain abnormality with high accuracy and low computational time. The dataset used in the experiment comprises 165 patients with 88 patients having different brain abnormalities whilst the remainder do not exhibit any detectable pathology. The algorithm was tested using a ten-fold cross-validation technique with 100 repetitions to avoid the result depending on the sample order. The maximum accuracy achieved for the brain tumours detection was 97.8% using a Multi-Layer Perceptron Neural Network
Design and evaluation of an antenna applicator for a microwave colonoscopy system
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.This paper presents a design of a compact antenna applicator for a microwave colonoscopy system. Although colonoscopy is the most effective method for colorectal cancer detection, it suffers from important visualization restrictions that limit its performance. We recently reported that the contrast between healthy mucosa and cancer was 30%-100% for the relative permittivity and conductivity, respectively, at 8 GHz, and the complex permittivity increased proportionally to the degeneration rate of polyps (cancer precursors). The applicator is designed as a compact cylindrical array of eight antennas attached at the tip of a conventional colonoscope. The design presented here is a proof-of-concept applicator composed by one transmitting and one receiving cavity-backed U-shaped slot antenna elements fed by an L-shaped microstrip line. The antennas are low profile and present a high isolation at 8 GHz. The antenna performance is assessed with simulations and experimentally with a phantom composed by different liquids.Peer ReviewedPostprint (author's final draft
Structural Pattern Recognition for Chemical-Compound Virtual Screening
Les molècules es configuren de manera natural com a xarxes, de manera que són ideals per estudiar utilitzant les seves representacions grà fiques, on els nodes representen à toms i les vores representen els enllaços quÃmics. Una alternativa per a aquesta representació directa és el grà fic reduït ampliat, que resumeix les estructures quÃmiques mitjançant descripcions de nodes de tipus farmacòfor per codificar les propietats moleculars rellevants. Un cop tenim una manera adequada de representar les molècules com a grà fics, hem de triar l’eina adequada per comparar-les i analitzar-les. La distà ncia d'edició de grà fics s'utilitza per resoldre la concordança de grà fics tolerant als errors; aquesta metodologia calcula la distà ncia entre dos grà fics determinant el nombre mÃnim de modificacions necessà ries per transformar un grà fic en l’altre. Aquestes modificacions (conegudes com a operacions d’edició) tenen associat un cost d’edició (també conegut com a cost de transformació), que s’ha de determinar en funció del problema. Aquest estudi investiga l’eficà cia d’una comparació molecular basada només en grà fics que utilitza grà fics reduïts ampliats i distà ncia d’edició de grà fics com a eina per a aplicacions de cribratge virtual basades en lligands. Aquestes aplicacions estimen la bioactivitat d'una substà ncia quÃmica que utilitza la bioactivitat de compostos similars. Una part essencial d’aquest estudi es centra en l’ús d’aprenentatge automà tic i tècniques de processament del llenguatge natural per optimitzar els costos de transformació utilitzats en les comparacions moleculars amb la distà ncia d’edició de grà fics.Las moléculas tienen la forma natural de redes, lo que las hace ideales para estudiar mediante el empleo de sus representaciones gráficas, donde los nodos representan los átomos y los bordes representan los enlaces quÃmicos. Una alternativa para esta representación sencilla es el gráfico reducido extendido, que resume las estructuras quÃmicas utilizando descripciones de nodos de tipo farmacóforo para codificar las propiedades moleculares relevantes. Una vez que tenemos una forma adecuada de representar moléculas como gráficos, debemos elegir la herramienta adecuada para compararlas y analizarlas. La distancia de edición de gráficos se utiliza para resolver la coincidencia de gráficos tolerante a errores; esta metodologÃa estima una distancia entre dos gráficos determinando el número mÃnimo de modificaciones necesarias para transformar un gráfico en el otro. Estas modificaciones (conocidas como operaciones de edición) tienen un costo de edición (también conocido como costo de transformación) asociado, que debe determinarse en función del problema. Este estudio investiga la efectividad de una comparación molecular basada solo en gráficos que emplea gráficos reducidos extendidos y distancia de edición de gráficos como una herramienta para aplicaciones de detección virtual
basadas en ligandos. Estas aplicaciones estiman la bioactividad de una sustancia quÃmica empleando la bioactividad de compuestos similares. Una parte esencial de este estudio se centra en el uso de técnicas de procesamiento de lenguaje natural y aprendizaje automático para optimizar los costos de transformación utilizados en las comparaciones moleculares con la distancia de edición de gráficos.Molecules are naturally shaped as networks, making them ideal for studying by employing their graph representations, where nodes represent atoms and edges represent the chemical bonds. An alternative for this straightforward representation is the extended reduced graph, which summarizes the chemical structures using pharmacophore-type node descriptions to encode the relevant molecular properties. Once we have a suitable way to represent molecules as graphs, we need to choose the right tool to compare and analyze them. Graph edit distance is used to solve the error-tolerant graph matching; this methodology estimates a distance between two graphs by determining the minimum number of modifications required to transform one graph into the other. These modifications (known as edit operations) have an edit cost (also known as transformation cost) associated, which must be determined depending on the problem.
This study investigates the effectiveness of a graph-only driven molecular comparison employing extended reduced graphs and graph edit distance as a tool for ligand-based virtual screening applications. Those applications estimate the bioactivity of a chemical employing the bioactivity of similar compounds. An essential part of this study focuses on using machine learning and natural language processing techniques to optimize the transformation costs used in the molecular comparisons with the graph edit distance.
Overall, this work shows a framework that combines graph reduction and comparison with optimization tools and natural language processing to identify bioactivity similarities in a structurally diverse group of molecules. We confirm the efficiency of this framework with several chemoinformatic tests applied to regression and classification problems over different publicly available datasets
Imaging-based representation and stratification of intra-tumor heterogeneity via tree-edit distance
Personalized medicine is the future of medical practice. In oncology, tumor heterogeneity assessment represents a pivotal step for effective treatment planning and prognosis prediction. Despite new procedures for DNA sequencing and analysis, non-invasive methods for tumor characterization are needed to impact on daily routine. On purpose, imaging texture analysis is rapidly scaling, holding the promise to surrogate histopathological assessment of tumor lesions. In this work, we propose a tree-based representation strategy for describing intra-tumor heterogeneity of patients affected by metastatic cancer. We leverage radiomics information extracted from PET/CT imaging and we provide an exhaustive and easily readable summary of the disease spreading. We exploit this novel patient representation to perform cancer subtyping according to hierarchical clustering technique. To this purpose, a new heterogeneity-based distance between trees is defined and applied to a case study of prostate cancer. Clusters interpretation is explored in terms of concordance with severity status, tumor burden and biological characteristics. Results are promising, as the proposed method outperforms current literature approaches. Ultimately, the proposed method draws a general analysis framework that would allow to extract knowledge from daily acquired imaging data of patients and provide insights for effective treatment planning
Imaging-based representation and stratification of intra-tumor heterogeneity via tree-edit distance
Personalized medicine is the future of medical practice. In oncology, tumor heterogeneity assessment represents a pivotal step for effective treatment planning and prognosis prediction. Despite new procedures for DNA sequencing and analysis, non-invasive methods for tumor characterization are needed to impact on daily routine. On purpose, imaging texture analysis is rapidly scaling, holding the promise to surrogate histopathological assessment of tumor lesions. In this work, we propose a tree-based representation strategy for describing intra-tumor heterogeneity of patients affected by metastatic cancer. We leverage radiomics information extracted from PET/CT imaging and we provide an exhaustive and easily readable summary of the disease spreading. We exploit this novel patient representation to perform cancer subtyping according to hierarchical clustering technique. To this purpose, a new heterogeneity-based distance between trees is defined and applied to a case study of prostate cancer. Clusters interpretation is explored in terms of concordance with severity status, tumor burden and biological characteristics. Results are promising, as the proposed method outperforms current literature approaches. Ultimately, the proposed method draws a general analysis framework that would allow to extract knowledge from daily acquired imaging data of patients and provide insights for effective treatment planning
- …