10,655 research outputs found

    Sparse Proteomics Analysis - A compressed sensing-based approach for feature selection and classification of high-dimensional proteomics mass spectrometry data

    Get PDF
    Background: High-throughput proteomics techniques, such as mass spectrometry (MS)-based approaches, produce very high-dimensional data-sets. In a clinical setting one is often interested in how mass spectra differ between patients of different classes, for example spectra from healthy patients vs. spectra from patients having a particular disease. Machine learning algorithms are needed to (a) identify these discriminating features and (b) classify unknown spectra based on this feature set. Since the acquired data is usually noisy, the algorithms should be robust against noise and outliers, while the identified feature set should be as small as possible. Results: We present a new algorithm, Sparse Proteomics Analysis (SPA), based on the theory of compressed sensing that allows us to identify a minimal discriminating set of features from mass spectrometry data-sets. We show (1) how our method performs on artificial and real-world data-sets, (2) that its performance is competitive with standard (and widely used) algorithms for analyzing proteomics data, and (3) that it is robust against random and systematic noise. We further demonstrate the applicability of our algorithm to two previously published clinical data-sets

    Development of a compact, IoT-enabled electronic nose for breath analysis

    Get PDF
    In this paper, we report on an in-house developed electronic nose (E-nose) for use with breath analysis. The unit consists of an array of 10 micro-electro-mechanical systems (MEMS) metal oxide (MOX) gas sensors produced by seven manufacturers. Breath sampling of end-tidal breath is achieved using a heated sample tube, capable of monitoring sampling-related parameters, such as carbon dioxide (CO2), humidity, and temperature. A simple mobile app was developed to receive real-time data from the device, using Wi-Fi communication. The system has been tested using chemical standards and exhaled breath samples from healthy volunteers, before and after taking a peppermint capsule. Results from chemical testing indicate that we can separate chemical standards (acetone, isopropanol and 1-propanol) and different concentrations of isobutylene. The analysis of exhaled breath samples demonstrate that we can distinguish between pre- and post-consumption of peppermint capsules; area under the curve (AUC): 0.81, sensitivity: 0.83 (0.59–0.96), specificity: 0.72 (0.47–0.90), p-value: <0.001. The functionality of the developed device has been demonstrated with the testing of chemical standards and a simplified breath study using peppermint capsules. It is our intention to deploy this system in a UK hospital in an upcoming breath research study

    Application of a novel tool for diagnosing bile acid diarrhoea

    Get PDF
    Bile acid diarrhoea (BAD) is a common disease that requires expensive imaging to diagnose. We have tested the efficacy of a new method to identify BAD, based on the detection of differences in volatile organic compounds (VOC) in urine headspace of BAD vs. ulcerative colitis and healthy controls. A total of 110 patients were recruited; 23 with BAD, 42 with ulcerative colitis (UC) and 45 controls. Patients with BAD also received standard imaging (Se75HCAT) for confirmation. Urine samples were collected and the headspace analysed using an AlphaMOS Fox 4000 electronic nose in combination with an Owlstone Lonestar Field Asymmetric Ion Mobility Spectrometer (FAIMS). A subset was also tested by gas chromatography, mass spectrometry (GCMS). Linear Discriminant Analysis (LDA) was used to explore both the electronic nose and FAIMS data. LDA showed statistical differences between the groups, with reclassification success rates (using an n-1 approach) at typically 83%. GCMS experiments confirmed these results and showed that patients with BAD had two chemical compounds, 2-propanol and acetamide, that were either not present or were in much reduced quantities in the ulcerative colitis and control samples. We believe that this work may lead to a new tool to diagnose BAD, which is cheaper, quicker and easier that current methods

    Breath analysis using electronic nose and gas chromatography-mass spectrometry: A pilot study on bronchial infections in bronchiectasis

    Get PDF
    Background and aims: In this work, breath samples from clinically stable bronchiectasis patients with and without bronchial infections by Pseudomonas Aeruginosa- PA) were collected and chemically analysed to determine if they have clinical value in the monitoring of these patients. Materials and methods: A cohort was recruited inviting bronchiectasis patients (25) and controls (9). Among the former group, 12 members were suffering PA infection. Breath samples were collected in Tedlar bags and analyzed by e-nose and Gas Chromatography-Mass Spectrometry (GC-MS). The obtained data were analyzed by chemometric methods to determine their discriminant power in regards to their health condition. Results were evaluated with blind samples. Results: Breath analysis by electronic nose successfully separated the three groups with an overall classification rate of 84% for the three-class classification problem. The best discrimination was obtained between control and bronchiectasis with PA infection samples 100% (CI95%: 84-100%) on external validation and the results were confirmed by permutation tests. The discrimination analysis by GC-MS provided good results but did not reach proper statistical significance after a permutation test. Conclusions: Breath sample analysis by electronic nose followed by proper predictive models successfully differentiated between control, Bronchiectasis and Bronchiectasis PA samples

    Epithelial ovarian carcinoma diagnosis by desorption electrospray ionization mass spectrometry imaging

    Get PDF
    Ovarian cancer is highly prevalent among European women, and is the leading cause of gynaecological cancer death. Current histopathological diagnoses of tumour severity are based on interpretation of, for example, immunohistochemical staining. Desorption electrospray mass spectrometry imaging (DESI-MSI) generates spatially resolved metabolic profiles of tissues and supports an objective investigation of tumour biology. In this study, various ovarian tissue types were analysed by DESI-MSI and co-registered with their corresponding haematoxylin and eosin (H&E) stained images. The mass spectral data reveal tissue type-dependent lipid profiles which are consistent across the n = 110 samples (n = 107 patients) used in this study. Multivariate statistical methods were used to classify samples and identify molecular features discriminating between tissue types. Three main groups of samples (epithelial ovarian carcinoma, borderline ovarian tumours, normal ovarian stroma) were compared as were the carcinoma histotypes (serous, endometrioid, clear cell). Classification rates >84% were achieved for all analyses, and variables differing statistically between groups were determined and putatively identified. The changes noted in various lipid types help to provide a context in terms of tumour biochemistry. The classification of unseen samples demonstrates the capability of DESI-MSI to characterise ovarian samples and to overcome existing limitations in classical histopathology

    Diagnóstico no invasivo de patologías humanas combinando análisis de aliento y modelización con redes neuronales

    Get PDF
    Tesis inédita de la Universidad Complutense de Madrid, Facultad de Ciencias Químicas, leída el 09-09-2016It is currently known that there is a direct relation between the moment a disease is detected or diagnosed and the consequences it will have on the patient, as an early detection is generally linked to a more favorable outcome. This concept is the basis of the present research, due to the fact that its main goal is the development of mathematical tools based on computational artificial intelligence to safely and non-invasively attain the detection of multiple diseases. To reach these devices, this research has focused on the breath analysis of patients with diverse diseases, using several analytical methodologies to extract the information contained in these samples, and multiple feature selection algorithms and neural networks for data analysis. In the past, it has been shown that there is a correlation between the molecular composition of breath and the clinical status of a human being, proving the existence of volatile biomarkers that can aid in disease detection depending on their presence or amount. During this research, two main types of analytical approaches have been employed to study the gaseous samples, and these were cross-reactive sensor arrays (based on organically functionalized silicon nanowire field-effect transistors (SiNW FETs) or gold nanoparticles (GNPs)) and proton transfer reaction-mass spectrometry (PTR-MS). The cross-reactive sensors analyze the bulk of the breath samples, offering global, fingerprint-like information, whereas PTR-MS quantifies the volatile molecules present in the samples. All of the analytical equipment employed leads to the generation of large amounts of data per sample, forcing the need of a meticulous mathematical analysis to adequately interpret the results. In this work, two fundamental types of mathematical tools were utilized. In first place, a set of five filter-based feature selection algorithms (χ2 (chi2) score, Fisher’s discriminant ratio, Kruskal-Wallis test, Relief-F algorithm, and information gain test) were employed to reduce the amount of independent in the large databases to the ones which contain the greatest discriminative power for a further modeling task. On the other hand, and in relation to mathematical modeling, artificial neural networks (ANNs), algorithms that are categorized as computational artificial intelligence, have been employed. These non-linear tools have been used to locate the relations between the independent variables of a system and the dependent ones to fulfill estimations or classifications. The type of ANN that has been used in this thesis coincides with the one that is more commonly employed in research, which is the supervised multilayer perceptron (MLP), due to its proven ability to create reliable models for many different applications...Actualmente es sabido que existe una relación directa entre el momento en el cual se detecta o diagnostica una enfermedad y las consecuencias que tendrá sobre el paciente, ya que una detección temprana va generalmente ligada a un desarrollo más favorable. Este concepto es el cimiento de la presente investigación, cuyo objetivo fundamental es el desarrollo de herramientas basadas en inteligencia artificial computacional que consigan, mediante medios seguros y no invasivos, la detección de diversas enfermedades. Para alcanzar dichos sistemas, los estudios han sido enfocados en el análisis de muestras de aliento de pacientes de diversas enfermedades, empleando varias técnicas para extraer información, y diversos algoritmos de selección de variables y redes neuronales para el procesamiento matemático. En el pasado, se ha comprobado que hay una correlación entre la composición molecular del aliento y el estado clínico de una persona, evidenciando la existencia de biomarcadores volátiles que pueden ayudar a detectar enfermedades, ya sea por su presencia o por su cantidad. Durante el transcurso de esta investigación, se han empleado esencialmente dos tipos de técnicas analíticas para estudiar las muestras gaseosas, y estas son conjuntos de sensores de reactividad cruzada (basados en transistores de efecto de campo con nanocables de silicio (SiNW FETs) o en nanopartículas de oro (GNPs), ambos funcionalizados con cadenas orgánicas) y equipos de reacción de transferencia de protones con espectrometría de masas (PTR-MS). Los sensores de reactividad cruzada analizan el aliento en su conjunto, extrayéndose información de la muestra global, mientras que usando PTR-MS, se cuantifican las moléculas volátiles presentes en las muestras analizadas. Todas las técnicas empleadas desembocan en la generación de grandes cantidades de datos por muestra, por lo que un análisis matemático exhaustivo es necesario para poder sacar el máximo rendimiento de los estudios. En este trabajo, se emplearon principalmente dos tipos de herramientas matemáticas. Las primeras son un grupo de cinco algoritmos de selección de variables, concretamente, filtros de variables (cálculos basados en estadística de χ2 (chi2), ratio discriminante de Fisher, análisis de Kruskal-Wallis, algoritmo relief-F y test de ganancia de información), que se han empleado en las bases de datos con grandes cantidades de variables independientes para localizar aquellas con mayor importancia o poder discriminativo para una tarea de modelización matemática posterior. Por otro lado, en cuando a dicha modelización, se ha empleado un tipo de algoritmo que se cataloga dentro del área de la inteligencia artificial computacional: las redes neuronales artificiales (ANNs). Estas herramientas matemáticas de naturaleza no lineal se han utilizado para localizar las relaciones existentes entre las variables independientes de un sistema y las variables dependientes o parámetros a estimar o clasificar. Se ha empleado el tipo de ANN supervisada más extensamente usado en investigación, que son los perceptrones multicapa (MLPs), debido a su habilidad contrastada para originar modelos fiables para numerosas aplicaciones...Fac. de Ciencias QuímicasTRUEunpu

    Improvement of the sensory and autonomous capability of robots through olfaction: the IRO Project

    Get PDF
    Proyecto de Excelencia Junta de Andalucía TEP2012-530Olfaction is a valuable source of information about the environment that has not been su ciently exploited in mobile robotics yet. Certainly, odor information can contribute to other sensing modalities, e.g. vision, to successfully accomplish high-level robot activities, such as task planning or execution in human environments. This paper describes the developments carried out in the scope of the IRO project, which aims at making progress in this direction by investigating mechanisms that exploit odor information (usually coming in the form of the type of volatile and its concentration) in problems like object recognition and scene-activity understanding. A distinctive aspect of this research is the special attention paid to the role of semantics within the robot perception and decisionmaking processes. The results of the IRO project have improved the robot capabilities in terms of efciency, autonomy and usefulness.Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tec
    • …
    corecore