1,454 research outputs found

    Decoding Sequence Classification Models for Acquiring New Biological Insights

    Get PDF
    Classifying biological sequences is one of the most important tasks in computational biology. In the last decade, support vector machines (SVMs) in combination with sequence kernels have emerged as a de-facto standard. These methods are theoretically well-founded, reliable, and provide high-accuracy solutions at low computational cost. However, obtaining a highly accurate classifier is rarely the end of the story in many practical situations. Instead, one often aims to acquire biological knowledge about the principles underlying a given classification task. SVMs with traditional sequence kernels do not offer a straightforward way of accessing this knowledge.

In this contribution, we propose a new approach to analyzing biological sequences on the basis of support vector machines with sequence kernels. We first extract explicit pattern weights from a given SVM. When classifying a sequence, we then compute a prediction profile by distributing the weight of each pattern to the sequence positions that match the pattern. The final profile not only allows assessing the importance of a position, but also determining for which class it is indicative. Since it is unfeasible to analyze profiles of all sequences in a given data set, we advocate using affinity propagation (AP) clustering to narrow down the analysis to a small set of typical sequences.

The proposed approach is applicable to a wide range of biological sequences and a wide selection of sequence kernels. To illustrate our framework, we present the prediction of oligomerization tendencies of coiled coil proteins as a case study.
&#xa

    Crenças, aceitação e atitudes dos utentes perante os medicamentos genéricos: um estudo comparativo entre Portugal e Estónia

    Get PDF
    Medicamento genérico (MG) é definido como uma fiel imitação de um medicamento original, terapeuticamente equivalente apresentando a mesma forma farmacêutica, composição qualitativa e quantitativa destinado a ser intercambiável com o produto original. Os MGs só podem ser comercializados depois de todas as patentes e certificados complementares de protecção (SPCs) que cobrem o produto original terem expirado. O papel dos MGs tem sido providenciar medicamentos essenciais que são de boa qualidade e de preço acessível em toda a União Europeia e o seu uso aumentou a acessibilidade dos pacientes e proporcionou uma poupança económica significativa para os sistemas de saúde. À medida que as despesas totais em cuidados de saúde têm vindo a aumentar e a maioria dessas despesas é composta de custos fixos (nomeadamente os serviços hospitalares), a indústria farmacêutica tem sido um objectivo de poupança em todos os países da Europa, que têm reformulado os seus sistemas nacionais de saúde de modo a responder ao rápido crescimento dos gastos em saúde. Os governos preocupados com o aumento do custo de produtos farmacêuticos dentro dos seus orçamentos nacionais de saúde, estão a esforçar-se para promover a utilização de genéricos em relação aos produtos originais de preço mais elevado. Portugal e Estónia são dois países pertencentes à União Europeia. Existem algumas diferenças no sector da saúde entre os dois países, especialmente no que concerne a medicamentos, seus preços e reembolso pelos sistemas de seguro obrigatório de saúde e serviços nacionais de saúde, no entanto apresentam em comum a preocupação com o custo dos medicamentos, incentivando o uso de MGs. Actualmente a Estónia apresenta uma quota de mercado de MGs superior a Portugal, que ocupa uma posição inferior à média Europeia. À medida que os sistemas governamentais vão incentivando o uso de MGs e o seu consumo vai aumentando é importante perceber as opiniões que os consumidores têm acerca destes medicamentos. Este estudo teve como objectivo avaliar a aceitação e as crenças dos utentes sobre MGs em relação aos medicamentos de marca (MM), comparando resultados entre Portugal e Estónia

    Oil spill Hazard maps

    Get PDF
    This report contains the description of the methodology to produce coastal oil spill hazard mapping for the Atlantic Ocean coastlines and the description of the Web Portal used to disseminate the informatio

    09081 Abstracts Collection -- Similarity-based learning on structures

    Get PDF
    From 15.02. to 20.02.2009, the Dagstuhl Seminar 09081 ``Similarity-based learning on structures \u27\u27 was held in Schloss Dagstuhl~--~Leibniz Center for Informatics. During the seminar, several participants presented their current research, and ongoing work and open problems were discussed. Abstracts of the presentations given during the seminar as well as abstracts of seminar results and ideas are put together in this paper. The first section describes the seminar topics and goals in general. Links to extended abstracts or full papers are provided, if available

    DeepSynergy: predicting anti-cancer drug synergy with Deep Learning.

    Get PDF
    MOTIVATION: While drug combination therapies are a well-established concept in cancer treatment, identifying novel synergistic combinations is challenging due to the size of combinatorial space. However, computational approaches have emerged as a time- and cost-efficient way to prioritize combinations to test, based on recently available large-scale combination screening data. Recently, Deep Learning has had an impact in many research areas by achieving new state-of-the-art model performance. However, Deep Learning has not yet been applied to drug synergy prediction, which is the approach we present here, termed DeepSynergy. DeepSynergy uses chemical and genomic information as input information, a normalization strategy to account for input data heterogeneity, and conical layers to model drug synergies. RESULTS: DeepSynergy was compared to other machine learning methods such as Gradient Boosting Machines, Random Forests, Support Vector Machines and Elastic Nets on the largest publicly available synergy dataset with respect to mean squared error. DeepSynergy significantly outperformed the other methods with an improvement of 7.2% over the second best method at the prediction of novel drug combinations within the space of explored drugs and cell lines. At this task, the mean Pearson correlation coefficient between the measured and the predicted values of DeepSynergy was 0.73. Applying DeepSynergy for classification of these novel drug combinations resulted in a high predictive performance of an AUC of 0.90. Furthermore, we found that all compared methods exhibit low predictive performance when extrapolating to unexplored drugs or cell lines, which we suggest is due to limitations in the size and diversity of the dataset. We envision that DeepSynergy could be a valuable tool for selecting novel synergistic drug combinations. AVAILABILITY AND IMPLEMENTATION: DeepSynergy is available via www.bioinf.jku.at/software/DeepSynergy. CONTACT: [email protected]. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online

    Complement C3 variant and the risk of age-related macular degeneration

    Get PDF
    Background: Age-related macular degeneration is the most common cause of blindness in Western populations. Susceptibility is influenced by age and by genetic and environmental factors. Complement activation is implicated in the pathogenesis.Methods: We tested for an association between age-related macular degeneration and 13 single-nucleotide polymorphisms (SNPs) spanning the complement genes C3 and C5 in case subjects and control subjects from the southeastern region of England. All subjects were examined by an ophthalmologist and had independent grading of fundus photographs to confirm their disease status. To test for replication of the most significant findings, we genotyped a set of Scottish cases and controls.Results: The common functional polymorphism rs2230199 (Arg80Gly) in the C3 gene, corresponding to the electrophoretic variants C3S (slow) and C3F (fast), was strongly associated with age-related macular degeneration in both the English group (603 cases and 350 controls, P=5.9 x 10(sup -5)) and the Scottish group (244 cases and 351 controls, P=5.0 x 10(sup -5)). The odds ratio for age-related macular degeneration in C3 S/F heterozygotes as compared with S/S homozygotes was 1.7 (95% confidence interval [CI], 1.3 to 2.1); for F/F homozygotes, the odds ratio was 2.6 (95% CI, 1.6 to 4.1). The estimated population attributable risk for C3F was 22%.Conclusions: Complement C3 is important in the pathogenesis of age-related macular degeneration. This finding further underscores the influence of the complement pathway in the pathogenesis of this disease

    COST 733 - WG4: Applications of weather type classification

    Get PDF
    The main objective of the COST Action 733 is to achieve a general numerical method for assessing, comparing and classifying typical weather situations in the European regions. To accomplish this goal, different workgroups are established, each with their specific aims: WG1: Existing methods and applications (finished); WG2: Implementation and development of weather types classification methods; WG3: Comparison of selected weather types classifications; WG4: Testing methods for various applications. The main task of Workgroup 4 (WG4) in COST 733 implies the testing of the selected weather type methods for various classifications. In more detail, WG4 focuses on the following topics:• Selection of dedicated applications (using results from WG1), • Performance of the selected applications using available weather types provided by WG2, • Intercomparison of the application results as a results of different methods • Final assessment of the results and uncertainties, • Presentation and release of results to the other WGs and external interested • Recommend specifications for a new (common) method WG2 Introduction In order to address these specific aims, various applications are selected and WG4 is divided in subgroups accordingly: 1.Air quality 2. Hydrology (& Climatological mapping) 3. Forest fires 4. Climate change and variability 5. Risks and hazards Simultaneously, the special attention is paid to the several wide topics concerning some other COST Actions such as: phenology (COST725), biometeorology (COST730), agriculture (COST 734) and mesoscale modelling and air pollution (COST728). Sub-groups are established to find advantages and disadvantages of different classification methods for different applications. Focus is given to data requirements, spatial and temporal scale, domain area, specifi

    Probing scalar particle and unparticle couplings in e+ e- -> t tbar with transversely polarized beams

    Full text link
    In searching for indications of new physics scalar particle and unparticle couplings in e^+ e^- \to t\bar t, we consider the role of transversely polarized initial beams at e^+ e^- colliders. By using a general relativistic spin density matrix formalism for describing the particles spin states, we find analytical expressions for the squared amplitude of the process with t or \bar t polarization measured, including the anomalous coupling contributions. Thanks to the transversely polarized initial beams these contributions are first order anomalous coupling corrections to the Standard Model (SM) contributions. We present and analyse the main features of the SM and anomalous coupling contributions. We show how differences between SM and anomalous coupling contributions provide means to search for anomalous coupling manifestations at future e^+ e^- linear colliders.Comment: 28 pages in LaTeX, including 7 encapsulated PostScript figures, published versio

    Assessing the role of EO in biodiversity monitoring: options for integrating in-situ observations with EO within the context of the EBONE concept

    Get PDF
    The European Biodiversity Observation Network (EBONE) is a European contribution on terrestrial monitoring to GEO BON, the Group on Earth Observations Biodiversity Observation Network. EBONE’s aims are to develop a system of biodiversity observation at regional, national and European levels by assessing existing approaches in terms of their validity and applicability starting in Europe, then expanding to regions in Africa. The objective of EBONE is to deliver: 1. A sound scientific basis for the production of statistical estimates of stock and change of key indicators; 2. The development of a system for estimating past changes and forecasting and testing policy options and management strategies for threatened ecosystems and species; 3. A proposal for a cost-effective biodiversity monitoring system. There is a consensus that Earth Observation (EO) has a role to play in monitoring biodiversity. With its capacity to observe detailed spatial patterns and variability across large areas at regular intervals, our instinct suggests that EO could deliver the type of spatial and temporal coverage that is beyond reach with in-situ efforts. Furthermore, when considering the emerging networks of in-situ observations, the prospect of enhancing the quality of the information whilst reducing cost through integration is compelling. This report gives a realistic assessment of the role of EO in biodiversity monitoring and the options for integrating in-situ observations with EO within the context of the EBONE concept (cfr. EBONE-ID1.4). The assessment is mainly based on a set of targeted pilot studies. Building on this assessment, the report then presents a series of recommendations on the best options for using EO in an effective, consistent and sustainable biodiversity monitoring scheme. The issues that we faced were many: 1. Integration can be interpreted in different ways. One possible interpretation is: the combined use of independent data sets to deliver a different but improved data set; another is: the use of one data set to complement another dataset. 2. The targeted improvement will vary with stakeholder group: some will seek for more efficiency, others for more reliable estimates (accuracy and/or precision); others for more detail in space and/or time or more of everything. 3. Integration requires a link between the datasets (EO and in-situ). The strength of the link between reflected electromagnetic radiation and the habitats and their biodiversity observed in-situ is function of many variables, for example: the spatial scale of the observations; timing of the observations; the adopted nomenclature for classification; the complexity of the landscape in terms of composition, spatial structure and the physical environment; the habitat and land cover types under consideration. 4. The type of the EO data available varies (function of e.g. budget, size and location of region, cloudiness, national and/or international investment in airborne campaigns or space technology) which determines its capability to deliver the required output. EO and in-situ could be combined in different ways, depending on the type of integration we wanted to achieve and the targeted improvement. We aimed for an improvement in accuracy (i.e. the reduction in error of our indicator estimate calculated for an environmental zone). Furthermore, EO would also provide the spatial patterns for correlated in-situ data. EBONE in its initial development, focused on three main indicators covering: (i) the extent and change of habitats of European interest in the context of a general habitat assessment; (ii) abundance and distribution of selected species (birds, butterflies and plants); and (iii) fragmentation of natural and semi-natural areas. For habitat extent, we decided that it did not matter how in-situ was integrated with EO as long as we could demonstrate that acceptable accuracies could be achieved and the precision could consistently be improved. The nomenclature used to map habitats in-situ was the General Habitat Classification. We considered the following options where the EO and in-situ play different roles: using in-situ samples to re-calibrate a habitat map independently derived from EO; improving the accuracy of in-situ sampled habitat statistics, by post-stratification with correlated EO data; and using in-situ samples to train the classification of EO data into habitat types where the EO data delivers full coverage or a larger number of samples. For some of the above cases we also considered the impact that the sampling strategy employed to deliver the samples would have on the accuracy and precision achieved. Restricted access to European wide species data prevented work on the indicator ‘abundance and distribution of species’. With respect to the indicator ‘fragmentation’, we investigated ways of delivering EO derived measures of habitat patterns that are meaningful to sampled in-situ observations
    • …
    corecore