26 research outputs found

    Implementation of an hybrid machine learning methodology for pharmacological modeling

    Get PDF
    Tese de mestrado, Bioinformática e Biologia Computacional (Bioinformática) Universidade de Lisboa, Faculdade de Ciências, 2017Hoje em dia, especialmente na area biomedica, os dados contem milhares de variaveis de fontes diferentes e com apenas algumas instancias ao mesmo tempo. Devido a este facto, as abordagens da aprendizagem automatica enfrentam dois problemas, nomeadamente a questao da integracao de dados heterogeneos e a selecao das caracteristicas. Este trabalho propoe uma solucao eficiente para esta questao e proporciona uma implementacao funcional da metodologia hibrida. A inspiracao para este trabalho veio do desafio proposto no ambito da competicao AstraZeneca-Sanger Drug Combination Prediction DREAM Challenge em 2016, e da solucao vencedora desenvolvida por Yuanfang Guan. Relativamente a motivacao do concurso, e observado que os tratamentos combinatorios para o cancro sao mais eficientes do que as terapias habituais de agente unico, desde que tem potencial para superar as desvantagens dos outros (limitado espetro de acao e desenvolvimento de resistencia). No entanto, o efeito combinatorio de drogas nao e obvio, produzindo possivelmente o resultado aditivo, sinergico ou antagonico. Assim, o objetivo da competicao era prever in vitro a sinergia dos compostos, sem ter acesso aos dados experimentais da terapia combinatoria. No ambito da competicao foram fornecidos ficheiros de varias fontes, contendo o conhecimento farmacologico tanto experimental como obtido de ajustamento das equacoes, a informacao sobre propriedades quimicas e estruturais de drogas, e por fim, os perfis moleculares de celulas, incluindo expressao de RNA, copy variants, sequencia e metilacao de DNA. O trabalho referido envolveu uma abordagem muito bem sucedida de integração dos dados heterogeneos, estendendo o modelo com conhecimento disponivel dentro do projeto The Cancer Cell Line Encyclopedia, e tambem introduzindo o passo decisivo de simulacao que permite imitar o efeito de terapia combinatoria no cancro. Apesar das descricoes pouco claras e da documentacao da solucao vencedora ineficiente, a reproducao da abordagem de Guan foi concluida, tentando ser o mais fiel possivel. A implementacao funcional foi escrita nas linguagens R e Python, e o seu desempenho foi verificado usando como referencia a matriz submetida no concurso. Para melhorar a metodologia, o workflow de selecao dos caracteristicas foi estabelecido e executado usando o algoritmo Lasso. Alem disso, o desempenho de dois metodos alternativos de modelacao foi experimentado, incluindo Support Vector Machine and Multivariate Adaptive Regression Splines (MARS). Varias versoes da equacao de integracao foram consideradas permitindo a determinacao de coeficientes aparentemente otimos. Como resultado, a compreensao da melhor solucao de competição foi desenvolvida e a implementacao funcional foi construida com sucesso. As melhorias foram propostas e no efeito o algoritmo SVM foi verificado como capaz de superar os outros na resolução deste problema, a equacao de integracao com melhor desempenho foi estabelecida e finalmente a lista de 75 variaveis moleculares mais informativas foi fornecida. Entre estes genes, poderiam ser encontrados possiveis candidatos de biomarcadores de cancro.Nowadays, especially in the biomedical field, the data sets usually contain thousands of multi-source variables and with only few instances in the same time. Due to this fact, Machine Learning approaches face two problems, namely the issue of heterogenous data integration and the feature selection. This work proposes an efficient solution for this question and provides a functional implementation of the hybrid methodology. The inspiration originated from the AstraZeneca-Sanger Drug Combination Prediction DREAM Challenge from 2016 and the winning solution by Yuanfang Guan. Regarding to the motivation of competition, the combinatory cancer treatments are believed to be more effective than standard single-agent therapies since they have a potential to overcome others weaknesses (narrow spectrum of action and development of the resistance). However, the combinatorial drug effect is not obvious bringing possibly additive, synergistic or antagonistic treatment result. Thus, the goal of the competition was to predict in vitro compound synergy, without the access to the experimental combinatory therapy data. Within the competition, the multi-source files were supplied, encompassing the pharmacological knowledge from experiments and equation-fitting, the information on chemical properties and structure of drugs, finally the molecular cell profiles including RNA expression, copy variants, DNA sequence and methylation. The referred work included very successful approach of heterogenous data integration, extending additionally the model with prior knowledge outsourced from The Cancer Cell Line Encyclopedia, as well as introduced a key step of simulation that allows to imitate effect of a combinatory therapy on cancer. Despite unexplicit descriptions and poor documentation of the winning solution, as accurate as possible, reproduction of Guan’s approach was accomplished. The functional implementation was written in R and Python languages, and its performance was verified using as a reference the submitted in challenge prediction matrix. In order to improve the methodology feature selection workflow was established and run using a Lasso algorithm. Moreover, the performance of two alternative modeling methods was experimented including Support Vector Machine and Multivariate Adaptive Regression Splines (MARS). Several versions of merging equation were considered allowing determination of apparently optimal coefficients. As the result, the understanding of the best challenge solution was developed and the functional implementation was successfully constructed. The improvements were proposed and in the effect the SVM algorithm was verified to surpass others in solving this problem, the best-performing merging equation was established, and finally the list of 75 most informative molecular variables was provided. Among those genes, potential cancer biomarker candidates could be found

    Epigenetic Landscape of Pain in Diabetic Neuropathy

    Get PDF
    The available treatments for neuropathic pain are still unsatisfactory – they cope with low efficiency and serious side effects. To our knowledge, this is a pioneering study evaluating whole-genome DNA methylation in unique populations of type 2 diabetes mellitus patients that were histopalogically diagnosed with diabetic neuropathy, with or without neuropathic pain. We provided an evidence on the significant differences in methylation patterns between painful and painless phenotypes. Epigenomes of patients from two independent cohorts were assessed in whole blood samples with Infinium Methylation EPIC BeadChip. We performed differential analysis and identified epigenetic signals that highlight the dissimilarities between painful and painless subjects. We estimated epigenetic age of the patients and evaluated eventual acceleration of biological age in painful and painless phenotypes using set of epigenetic predictors. With the differential analysis we identified 27 CpG sites that reached the level of statistical significance in both studied cohorts, presented the methylation change between painful and painless diabetic neuropathy > 1% in one of the populations and had the direction of methylation change concordant between the two cohorts. 19 of selected probes were genic and resulted in a list of 19 unique genes. Multidimensional scaling analysis confirmed the potential of generated set of CpG sites to separate painful and painless subjects and to highlight the dissimilarities between two phenotypes. Evaluation of biological age showed that there was no association between painful phenotype and acceleration of biological age expressed by any of the assessed epigenetic clocks. DNA methylation based prediction of telomere length was found to vary between painful and painless groups in both studied cohorts. Obtained results confirmed the presence of epigenetic differences between painful and painless diabetic neuropathy patients. Promising genes were identified that may be linked to neuropathic pain through DNA methylation mechanisms

    The Polish version of the Juvenile Arthritis Multidimensional Assessment Report (JAMAR)

    Get PDF
    The Juvenile Arthritis Multidimensional Assessment Report (JAMAR) is a new parent/patient-reported outcome measure that enables a thorough assessment of the disease status in children with juvenile idiopathic arthritis (JIA). We report the results of the cross-cultural adaptation and validation of the parent and patient versions of the JAMAR in the Polish language. The reading comprehension of the questionnaire was tested in 10 JIA parents and patients. Each participating centre was asked to collect demographic, clinical data and the JAMAR in 100 consecutive JIA patients or all consecutive patients seen in a 6-month period and to administer the JAMAR to 100 healthy children and their parents. The statistical validation phase explored descriptive statistics and the psychometric issues of the JAMAR: the 3 Likert assumptions, floor/ceiling effects, internal consistency, Cronbach\u2019s alpha, interscale correlations, test\u2013retest reliability, and construct validity (convergent and discriminant validity). A total of 154 JIA patients (10.4% systemic, 50.0% oligoarticular, 24.7% RF-negative polyarthritis, 14.9% other categories) and 91 healthy children, were enrolled in two centres. The JAMAR components discriminated well healthy subjects from JIA patients. All JAMAR components revealed good psychometric performances. In conclusion, the Polish version of the JAMAR is a valid tool for the assessment of children with JIA and is suitable for use both in routine clinical practice and clinical research

    TP53 polymorphism in plasma cell myeloma

    Get PDF
    Introduction. Significant and accessible predictive factors for bortezomib treatment in plasma cell myeloma (PCM) are still lacking. TP53 codon 72 polymorphism (P72R) results in proline (P) or arginine (R) at 72 amino acid position, which causes synthesis of proteins with distinct functions. The aims of our study were to: 1) analyze whether this polymorphism is associated with an increased risk of PCM; 2) study whether the P72R polymorphism affects overall survival (OS) among PCM patients; 3) assess the possible association of the P72R polymorphism with sensitivity to bortezomib in cell cultures derived from PCM patients. Material and methods. Genomic DNA from newly diagnosed 59 patients (without IgVH gene rearrangements and TP53 deletions) and 50 healthy blood donors were analyzed by RFLP-PCR to identify TP53 polymorphism. Chromosomal aberrations were detected by use of cIg-FISH. The lymphocyte cell cultures from a subgroup of 40 PCM patients were treated with bortezomib (1, 2 and 4 nM). Results. The P allele of the P72R polymorphism was more common than the R allele in PMC patients compared to controls (39% vs. 24%), and the difference was significant (p = 0.02). The PP and PR genotypes (in combina­tion) were more frequent among cases than in controls (65% vs. 42%, OR = 2.32, p = 0.04). At the cell culture level and 2 nM bortezomib concentration the PP genotype was associated with higher necrosis rates (10.5%) compared to the PR genotype (5.7%, p = 0.006) or the RR genotype (6.3%, p = 0.02); however, no effect of genotypes was observed at bortezomib concentrations of 1 and 4 nM. The shortest OS (12 months) was observed in patients with the PP genotype compared to patients with the PR or RR genotypes (20 months) (p = 0.04). Conclusions. The results suggest that P72R polymorphisms may be associated with an increased PCM risk and may affect OS of PCM patients. However, we saw no consistent results of the polymorphism effect on apoptosis and necrosis in cell cultures derived from PCM patients. Further studies are need in this regard

    Children and adolescents with pulmonary arterial hypertension : baseline and follow-up data from the polish registry of pulmonary hypertension (BNP-PL)

    Get PDF
    We present the results from the pediatric arm of the Polish Registry of Pulmonary Hypertension. We prospectively enrolled all pulmonary arterial hypertension (PAH) patients, between the ages of 3 months and 18 years, who had been under the care of each PAH center in Poland between 1 March 2018 and 30 September 2018. The mean prevalence of PAH was 11.6 per million, and the estimated incidence rate was 2.4 per million/year, but it was geographically heterogeneous. Among 80 enrolled children (females, n = 40; 50%), 54 (67.5%) had PAH associated with congenital heart disease (CHD-PAH), 25 (31.25%) had idiopathic PAH (IPAH), and 1 (1.25%) had portopulmonary PAH. At the time of enrolment, 31% of the patients had significant impairment of physical capacity (WHO-FC III). The most frequent comorbidities included shortage of growth (n = 20; 25%), mental retardation (n = 32; 40%), hypothyroidism (n = 19; 23.8%) and Down syndrome (n = 24; 30%). The majority of children were treated with PAH-specific medications, but only half of them with double combination therapy, which improved after changing the reimbursement policy. The underrepresentation of PAH classes other than IPAH and CHD-PAH, and the geographically heterogeneous distribution of PAH prevalence, indicate the need for building awareness of PAH among pediatricians, while a frequent coexistence of PAH with other comorbidities calls for a multidisciplinary approach to the management of PAH children

    Implementation of an hybrid machine learning methodology for pharmacological modeling

    No full text
    Tese de mestrado, Bioinformática e Biologia Computacional (Bioinformática) Universidade de Lisboa, Faculdade de Ciências, 2017Hoje em dia, especialmente na area biomedica, os dados contem milhares de variaveis de fontes diferentes e com apenas algumas instancias ao mesmo tempo. Devido a este facto, as abordagens da aprendizagem automatica enfrentam dois problemas, nomeadamente a questao da integracao de dados heterogeneos e a selecao das caracteristicas. Este trabalho propoe uma solucao eficiente para esta questao e proporciona uma implementacao funcional da metodologia hibrida. A inspiracao para este trabalho veio do desafio proposto no ambito da competicao AstraZeneca-Sanger Drug Combination Prediction DREAM Challenge em 2016, e da solucao vencedora desenvolvida por Yuanfang Guan. Relativamente a motivacao do concurso, e observado que os tratamentos combinatorios para o cancro sao mais eficientes do que as terapias habituais de agente unico, desde que tem potencial para superar as desvantagens dos outros (limitado espetro de acao e desenvolvimento de resistencia). No entanto, o efeito combinatorio de drogas nao e obvio, produzindo possivelmente o resultado aditivo, sinergico ou antagonico. Assim, o objetivo da competicao era prever in vitro a sinergia dos compostos, sem ter acesso aos dados experimentais da terapia combinatoria. No ambito da competicao foram fornecidos ficheiros de varias fontes, contendo o conhecimento farmacologico tanto experimental como obtido de ajustamento das equacoes, a informacao sobre propriedades quimicas e estruturais de drogas, e por fim, os perfis moleculares de celulas, incluindo expressao de RNA, copy variants, sequencia e metilacao de DNA. O trabalho referido envolveu uma abordagem muito bem sucedida de integração dos dados heterogeneos, estendendo o modelo com conhecimento disponivel dentro do projeto The Cancer Cell Line Encyclopedia, e tambem introduzindo o passo decisivo de simulacao que permite imitar o efeito de terapia combinatoria no cancro. Apesar das descricoes pouco claras e da documentacao da solucao vencedora ineficiente, a reproducao da abordagem de Guan foi concluida, tentando ser o mais fiel possivel. A implementacao funcional foi escrita nas linguagens R e Python, e o seu desempenho foi verificado usando como referencia a matriz submetida no concurso. Para melhorar a metodologia, o workflow de selecao dos caracteristicas foi estabelecido e executado usando o algoritmo Lasso. Alem disso, o desempenho de dois metodos alternativos de modelacao foi experimentado, incluindo Support Vector Machine and Multivariate Adaptive Regression Splines (MARS). Varias versoes da equacao de integracao foram consideradas permitindo a determinacao de coeficientes aparentemente otimos. Como resultado, a compreensao da melhor solucao de competição foi desenvolvida e a implementacao funcional foi construida com sucesso. As melhorias foram propostas e no efeito o algoritmo SVM foi verificado como capaz de superar os outros na resolução deste problema, a equacao de integracao com melhor desempenho foi estabelecida e finalmente a lista de 75 variaveis moleculares mais informativas foi fornecida. Entre estes genes, poderiam ser encontrados possiveis candidatos de biomarcadores de cancro.Nowadays, especially in the biomedical field, the data sets usually contain thousands of multi-source variables and with only few instances in the same time. Due to this fact, Machine Learning approaches face two problems, namely the issue of heterogenous data integration and the feature selection. This work proposes an efficient solution for this question and provides a functional implementation of the hybrid methodology. The inspiration originated from the AstraZeneca-Sanger Drug Combination Prediction DREAM Challenge from 2016 and the winning solution by Yuanfang Guan. Regarding to the motivation of competition, the combinatory cancer treatments are believed to be more effective than standard single-agent therapies since they have a potential to overcome others weaknesses (narrow spectrum of action and development of the resistance). However, the combinatorial drug effect is not obvious bringing possibly additive, synergistic or antagonistic treatment result. Thus, the goal of the competition was to predict in vitro compound synergy, without the access to the experimental combinatory therapy data. Within the competition, the multi-source files were supplied, encompassing the pharmacological knowledge from experiments and equation-fitting, the information on chemical properties and structure of drugs, finally the molecular cell profiles including RNA expression, copy variants, DNA sequence and methylation. The referred work included very successful approach of heterogenous data integration, extending additionally the model with prior knowledge outsourced from The Cancer Cell Line Encyclopedia, as well as introduced a key step of simulation that allows to imitate effect of a combinatory therapy on cancer. Despite unexplicit descriptions and poor documentation of the winning solution, as accurate as possible, reproduction of Guan’s approach was accomplished. The functional implementation was written in R and Python languages, and its performance was verified using as a reference the submitted in challenge prediction matrix. In order to improve the methodology feature selection workflow was established and run using a Lasso algorithm. Moreover, the performance of two alternative modeling methods was experimented including Support Vector Machine and Multivariate Adaptive Regression Splines (MARS). Several versions of merging equation were considered allowing determination of apparently optimal coefficients. As the result, the understanding of the best challenge solution was developed and the functional implementation was successfully constructed. The improvements were proposed and in the effect the SVM algorithm was verified to surpass others in solving this problem, the best-performing merging equation was established, and finally the list of 75 most informative molecular variables was provided. Among those genes, potential cancer biomarker candidates could be found

    Neutrophil extracellular traps generation and degradation in patients with granulomatosis with polyangiitis and systemic lupus erythematosus

    No full text
    Neutrophils are one of the first cells to arrive at the site of infection, where they apply several strategies to kill pathogens: degranulation, respiratory burst, phagocytosis, and release of neutrophil extracellular traps (NETs). Recent discoveries try to connect NETs formation with autoimmune diseases, like systemic lupus erythematosus (SLE) or granulomatosis with polyangiitis (GPA) and place them among one of the factors responsible for disease pathogenesis. The aim of the study was to assess the NETotic capabilities of neutrophils obtained from freshly diagnosed autoimmune patients versus healthy controls. Further investigation involved assessing NETs production among treated patients. In the latter step, NETs degradation potency of collected sera from non-treated patients was checked. Lastly, the polymorphisms of the DNASE I gene among tested subjects were checked. NETs formation was measured in a neutrophil culture by fluorometry, while degradation assessment was performed with patients’ sera and extracellular source of DNA. Additionally, Sanger sequencing was used to check potential SNP mutations between patients. About 121 subjects were enrolled into this study, 54 of them with a diagnosed autoimmune disorder. Neutrophils stimulated with NETosis inducers were able to release NETs in all cases. We have found that disease affected patients produce NETs more rapidly and in larger quantities than control groups, with up to 82.5% more released. Most importantly, we showed a difference between the diseases themselves. NETs release was 68.5% higher in GPA samples when compared to SLE ones while stimulated with Calcium Ionophore. Serum nucleases were less effective at degrading NETs in both autoimmune diseases, with a reduction in degradation of 20.9% observed for GPA and 18.2% for SLE when compared with the controls. Potential therapies targeting neutrophils and NETs should be specifically tailored to the type of the disease. Since there are significant differences between NETs release and disease type, a standard neutrophil targeted therapy could prevent over-generation of traps in some cases, while in others it would deplete the cells, leaving the immune system unresponsive to primary infections
    corecore