122 research outputs found

    Transductive Learning with String Kernels for Cross-Domain Text Classification

    Full text link
    For many text classification tasks, there is a major problem posed by the lack of labeled data in a target domain. Although classifiers for a target domain can be trained on labeled text data from a related source domain, the accuracy of such classifiers is usually lower in the cross-domain setting. Recently, string kernels have obtained state-of-the-art results in various text classification tasks such as native language identification or automatic essay scoring. Moreover, classifiers based on string kernels have been found to be robust to the distribution gap between different domains. In this paper, we formally describe an algorithm composed of two simple yet effective transductive learning approaches to further improve the results of string kernels in cross-domain settings. By adapting string kernels to the test set without using the ground-truth test labels, we report significantly better accuracy rates in cross-domain English polarity classification.Comment: Accepted at ICONIP 2018. arXiv admin note: substantial text overlap with arXiv:1808.0840

    How do women prepare for pregnancy? Preconception experiences of women attending antenatal services and views of health professionals

    Get PDF
    Copyright: © 2014 Stephenson et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Main objective - To determine the extent to which women plan and prepare for pregnancy. Methods - Cross-sectional questionnaire survey of pregnant women attending three maternity services in London about knowledge and uptake of preconception care; including a robust measure of pregnancy planning, and phone interviews with a range of health care professionals. Main results - We recruited 1173/1288 (90%) women, median age of 32 years. 73% had clearly planned their pregnancy, 24% were ambivalent and only 3% of pregnancies were unplanned. 51% of all women and 63% of those with a planned pregnancy took folic acid before pregnancy. 21% of all women reported smoking and 61% reported drinking alcohol in the 3 months before pregnancy; 48% of smokers and 41% of drinkers reduced or stopped before pregnancy. The 51% of all women who reported advice from a health professional before becoming pregnant were more likely to adopt healthier behaviours before pregnancy [adjusted odds ratios for greatest health professional input compared with none were 2.34 (95% confidence interval 1.54–3.54) for taking folic acid and 2.18 (95% CI 1.42–3.36) for adopting a healthier diet before pregnancy]. Interviews with 20 health professionals indicated low awareness of preconception health issues, missed opportunities and confusion about responsibility for delivery of preconception care. Significance of the findings - Despite a high level of pregnancy planning, awareness of preconception health among women and health professionals is low, and responsibility for providing preconception care is unclear. However, many women are motivated to adopt healthier behaviours in the preconception period, as indicated by halving of reported smoking rates in this study. The link between health professional input and healthy behaviour change before pregnancy is a new finding that should invigorate strategies to improve awareness and uptake of pre-pregnancy health care, and bring wider benefits for public health.Department of Healt

    CPC: Crime, Policing and Citizenship - Intelligent Policing and Big Data

    Get PDF
    Crime, Policing and Citizenship (CPC) – Space-Time Interactions of Dynamic Networks has been a major UK EPSRC-funded research project. It has been a multidisciplinary collaboration of geoinformatics, crime science, computer science and geography within University College London (UCL), in partnership with the Metropolitan Police Service (MPS). The aim of the project has been to develop new methods and applications in space-time analytics and emergent network complexity, in order to uncover patterning and interactions in crime, policing and citizen perceptions. The work carried out throughout the project will help inform policing at a range of scales, from the local to the city-wide, with the goal of reducing both crime and the fear of crime. The CPC project is timely given the tremendous challenging facing policing in big cities nationally and globally, as consequences of changes in society, population structure and economic well-being. It addresses these issues through an intelligent approach to data-driven policing, using daily reported crime statistics, GPS traces of foot and vehicular patrols, surveys of public attitudes and geo-temporal demographic data of changing community structure. The analytic focus takes a spatio-temporal perspective, reflecting the strong spatial and temporal integration of criminal, policing and citizen activities. Street networks are used throughout as a basis for analysis, reflecting their role as a key determinant of urban structure and the substrate on which crime and policing take place. The project has presented a manifesto for ‘intelligent policing’ which embodies the key issues arising in the transition from Big Data into actionable insights. Police intelligence should go beyond current practice, incorporating not only the prediction of events, but also how to respond to them, and how to evaluate the actions taken. Cutting-edge network-based crime prediction methods have been developed to accurately predict crime risks at street segment level, helping police forces to focus resources in the right places at the right times. Methods and tools have been implemented to support senior offices in strategic planning, and to provide guidance to frontline officers in daily patrolling. To evaluate police performance, models and tools have been developed to aid identification of areas requiring greater attention, and to analyse the patrolling behaviours of officers. Methods to understand and model confidence in policing have also been explored, suggesting strategies by which confidence in the police can be improved in different population segments and neighbourhood areas. A number of tools have been developed during the course of the project include data-driven methods for crime prediction and for performance evaluation. We anticipate that these will ultimately be adopted in daily policing practice and will play an important role in the modernisation of policing. Furthermore, we believe that the approaches to the building of public trust and confidence that we suggest will contribute to the transformation and improvement of the relationship between the public and police

    A comparison of variational and Markov chain Monte Carlo methods for inference in partially observed stochastic dynamic systems

    Get PDF
    In recent work we have developed a novel variational inference method for partially observed systems governed by stochastic differential equations. In this paper we provide a comparison of the Variational Gaussian Process Smoother with an exact solution computed using a Hybrid Monte Carlo approach to path sampling, applied to a stochastic double well potential model. It is demonstrated that the variational smoother provides us a very accurate estimate of mean path while conditional variance is slightly underestimated. We conclude with some remarks as to the advantages and disadvantages of the variational smoother. © 2008 Springer Science + Business Media LLC

    A Kernel-Based Membrane Clustering Algorithm

    Get PDF
    The existing membrane clustering algorithms may fail to handle the data sets with non-spherical cluster boundaries. To overcome the shortcoming, this paper introduces kernel methods into membrane clustering algorithms and proposes a kernel-based membrane clustering algorithm, KMCA. By using non-linear kernel function, samples in original data space are mapped to data points in a high-dimension feature space, and the data points are clustered by membrane clustering algorithms. Therefore, a data clustering problem is formalized as a kernel clustering problem. In KMCA algorithm, a tissue-like P system is designed to determine the optimal cluster centers for the kernel clustering problem. Due to the use of non-linear kernel function, the proposed KMCA algorithm can well deal with the data sets with non-spherical cluster boundaries. The proposed KMCA algorithm is evaluated on nine benchmark data sets and is compared with four existing clustering algorithms

    Why do women invest in pre-pregnancy health and care? A qualitative investigation with women attending maternity services

    Get PDF
    Background Despite the importance attributed to good pre-pregnancy care and its potential to improve pregnancy and child health outcomes, relatively little is known about why women invest in pre-pregnancy health and care. We sought to gain insight into why women invested in pre-pregnancy health and care. Methods We carried out 20 qualitative in-depth interviews with pregnant or recently pregnant women who were drawn from a survey of antenatal clinic attendees in London, UK. Interviewees were purposively sampled to include high and low investors in pre-pregnancy health and care, with variation in age, partnership status, ethnicity and pre-existing medical conditions. Data analysis was conducted using the Framework method. Results We identified three groups in relation to pre-pregnancy health and care: 1) The “prepared” group, who had high levels of pregnancy planning and mostly positive attitudes to micronutrient supplementation outside of pregnancy, carried out pre-pregnancy activities such as taking folic acid and making changes to diet and lifestyle. 2) The “poor knowledge” group, who also had high levels of pregnancy planning, did not carry out pre-pregnancy activities and described themselves as having poor knowledge. Elsewhere in their interviews they expressed a strong dislike of micronutrient supplementation. 3) The “absent pre-pregnancy period” group, had the lowest levels of pregnancy planning and also expressed anti-supplement views. Even discussing the pre-pregnancy period with this group was difficult as responses to questions quickly shifted to focus on pregnancy itself. Knowledge of folic acid was poor in all groups. Conclusion Different pre-pregnancy care approaches are likely to be needed for each of the groups. Among the “prepared” group, who were proactive and receptive to health messages, greater availability of information and better response from health professionals could improve the range of pre-pregnancy activities carried out. Among the “poor knowledge” group, better response from health professionals might yield greater uptake of pre-pregnancy information. A different, general health strategy might be more appropriate for the “absent pre-pregnancy period” group. The fact that general attitudes to micronutrient supplementation were closely related to whether or not women invested in pre-pregnancy health and care was an unanticipated finding and warrants further investigation.This report is independent research commissioned and funded by the Department of Health Policy Research Programme Pre-Pregnancy Health and Care in England: Exploring Implementation and Public Health Impact, 006/0068

    SiteSeek: Post-translational modification analysis using adaptive locality-effective kernel methods and new profiles

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Post-translational modifications have a substantial influence on the structure and functions of protein. Post-translational phosphorylation is one of the most common modification that occur in intracellular proteins. Accurate prediction of protein phosphorylation sites is of great importance for the understanding of diverse cellular signalling processes in both the human body and in animals. In this study, we propose a new machine learning based protein phosphorylation site predictor, SiteSeek. SiteSeek is trained using a novel compact evolutionary and hydrophobicity profile to detect possible protein phosphorylation sites for a target sequence. The newly proposed method proves to be more accurate and exhibits a much stable predictive performance than currently existing phosphorylation site predictors.</p> <p>Results</p> <p>The performance of the proposed model was compared to nine existing different machine learning models and four widely known phosphorylation site predictors with the newly proposed PS-Benchmark_1 dataset to contrast their accuracy, sensitivity, specificity and correlation coefficient. SiteSeek showed better predictive performance with 86.6% accuracy, 83.8% sensitivity, 92.5% specificity and 0.77 correlation-coefficient on the four main kinase families (CDK, CK2, PKA, and PKC).</p> <p>Conclusion</p> <p>Our newly proposed methods used in SiteSeek were shown to be useful for the identification of protein phosphorylation sites as it performed much better than widely known predictors on the newly built PS-Benchmark_1 dataset.</p
    corecore