138 research outputs found

    On Using Machine Learning to Identify Knowledge in API Reference Documentation

    Full text link
    Using API reference documentation like JavaDoc is an integral part of software development. Previous research introduced a grounded taxonomy that organizes API documentation knowledge in 12 types, including knowledge about the Functionality, Structure, and Quality of an API. We study how well modern text classification approaches can automatically identify documentation containing specific knowledge types. We compared conventional machine learning (k-NN and SVM) and deep learning approaches trained on manually annotated Java and .NET API documentation (n = 5,574). When classifying the knowledge types individually (i.e., multiple binary classifiers) the best AUPRC was up to 87%. The deep learning and SVM classifiers seem complementary. For four knowledge types (Concept, Control, Pattern, and Non-Information), SVM clearly outperforms deep learning which, on the other hand, is more accurate for identifying the remaining types. When considering multiple knowledge types at once (i.e., multi-label classification) deep learning outperforms na\"ive baselines and traditional machine learning achieving a MacroAUC up to 79%. We also compared classifiers using embeddings pre-trained on generic text corpora and StackOverflow but did not observe significant improvements. Finally, to assess the generalizability of the classifiers, we re-tested them on a different, unseen Python documentation dataset. Classifiers for Functionality, Concept, Purpose, Pattern, and Directive seem to generalize from Java and .NET to Python documentation. The accuracy related to the remaining types seems API-specific. We discuss our results and how they inform the development of tools for supporting developers sharing and accessing API knowledge. Published article: https://doi.org/10.1145/3338906.333894

    Self-reported side effects and adherence to antiretroviral therapy in HIV-infected pregnant women under option B+: a prospective study

    Get PDF
    BACKGROUND: Antiretroviral therapy (ART) regimens containing efavirenz (EFV) are recommended as part of universal ART for pregnant and breastfeeding women. EFV may have appreciable side effects (SE), and ART adherence in pregnancy is a major concern, but little is known about ART SE and associations with adherence in pregnancy. METHODS: We investigated the distribution of patient-reported SE (based on Division of AIDS categories) and the association of SE with missed ART doses in a cohort of 517 women starting EFV+3TC/FTC+TDF during pregnancy. In analysis, SE were considered in terms of their overall frequency, by systems category, and by latent classes. RESULTS: Overall 97% of women reported experiencing at least one SE after ART initiation, with 48% experiencing more than five SE. Gastrointestinal, central nervous system, systemic and skin SE were reported by 81%, 85%, 79% and 31% of women, respectively, with considerable overlap across groups. At least one missed dose was reported by 32% of women. In multivariable models, ART non-adherence was associated with systemic SE compared to other systems categories, and measures of the overall burden of SE experienced were most strongly associated with missed ART doses. CONCLUSION: These data demonstrate very high levels of SE in pregnant women initiating EFV-based ART and a strong association between SE burden and ART adherence. ART regimens with reduced SE profiles may enhance adherence, and as countries expand universal ART for all adult patients, counseling must include preparation for ART SE

    Inequity in the distribution of non-communicable disease multimorbidity in adults in South Africa: An analysis of prevalence and patterns

    Get PDF
    The present study examined the prevalence and patterns of noncommunicable disease multimorbidity by wealth quintile among adults in South Africa. The South African National Income Dynamics Study Wave 5 was conducted in 2017 to examine the livelihoods of individuals and households. We analysed data in people aged 15 years and older (N = 27,042), including self-reported diagnosis of diabetes, stroke, heart disease and anthropometric measurements. Logistic regression and latent class analysis were used to analyse factors associated with multimorbidity and common disease patterns

    Efficient Parallel Statistical Model Checking of Biochemical Networks

    Full text link
    We consider the problem of verifying stochastic models of biochemical networks against behavioral properties expressed in temporal logic terms. Exact probabilistic verification approaches such as, for example, CSL/PCTL model checking, are undermined by a huge computational demand which rule them out for most real case studies. Less demanding approaches, such as statistical model checking, estimate the likelihood that a property is satisfied by sampling executions out of the stochastic model. We propose a methodology for efficiently estimating the likelihood that a LTL property P holds of a stochastic model of a biochemical network. As with other statistical verification techniques, the methodology we propose uses a stochastic simulation algorithm for generating execution samples, however there are three key aspects that improve the efficiency: first, the sample generation is driven by on-the-fly verification of P which results in optimal overall simulation time. Second, the confidence interval estimation for the probability of P to hold is based on an efficient variant of the Wilson method which ensures a faster convergence. Third, the whole methodology is designed according to a parallel fashion and a prototype software tool has been implemented that performs the sampling/verification process in parallel over an HPC architecture

    The strategic relevance of manufacturing technology : an overall quality concept to promote innovation preventing drug shortage

    Get PDF
    Manufacturing is the bridge between research and patient: without product, there is no clinical outcome. Shortage has a variety of causes, in this paper we analyse only causes related to manufacturing technology and we use shortage as a paradigm highliting the relevance of Pharmaceutical Technology. Product and process complexity and capacity issues are the main challenge for the Pharmaceutical Industry Supply chain. Manufacturing Technology should be acknowledged as a R&D step and as a very important matter during University degree in Pharmacy and related disciplines, promoting collaboration between Academia and Industry, measured during HTA step and rewarded in terms of price and reimbursement. The above elements are not yet properly recognised, and manufacturing technology is taken in to consideration only when a shortage is in place. In a previous work, Panzitta et al. proposed to perform a full technology assessment at the Health Technological Assessment stage, evaluating three main technical aspects of a medicine: manufacturing process, physicochemical properties, and formulation characteristics. In this paper, we develop the concept of manufacturing appraisal, providing a technical overview of upcoming challenges, a risk based approach and an economic picture of shortage costs. We develop also an overall quality concept, not limited to GMP factors but broaden to all elements leading to a robust supply and promoting technical innovation
    • …
    corecore