48,107 research outputs found

    Sampling labelled profile data for identity resolution

    Get PDF
    Identity resolution capability for social networking profiles is important for a range of purposes, from open-source intelligence applications to forming semantic web connections. Yet replication of research in this area is hampered by the lack of access to ground-truth data linking the identities of profiles from different networks. Almost all data sources previously used by researchers are no longer available, and historic datasets are both of decreasing relevance to the modern social networking landscape and ethically troublesome regarding the preservation and publication of personal data. We present and evaluate a method which provides researchers in identity resolution with easy access to a realistically-challenging labelled dataset of online profiles, drawing on four of the currently largest and most influential online social networks. We validate the comparability of samples drawn through this method and discuss the implications of this mechanism for researchers as well as potential alternatives and extensions

    Microbiology and atmospheric processes: Biological, physical and chemical characterization of aerosol particles

    Get PDF
    The interest in bioaerosols has traditionally been linked to health hazards for humans, animals and plants. However, several components of bioaerosols exhibit physical properties of great significance for cloud processes, such as ice nucleation and cloud condensation. To gain a better understanding of their influence on climate, it is therefore important to determine the composition, concentration, seasonal fluctuation, regional diversity and evolution of bioaerosols. In this paper, we will review briefly the existing techniques for detection, quantification, physical and chemical analysis of biological particles, attempting to bridge physical, chemical and biological methods for analysis of biological particles and integrate them with aerosol sampling techniques. We will also explore some emerging spectroscopy techniques for bulk and single-particle analysis that have potential for in-situ physical and chemical analysis. Lastly, we will outline open questions and further desired capabilities (e. g., in-situ, sensitive, both broad and selective, on-line, time-resolved, rapid, versatile, cost-effective techniques) required prior to comprehensive understanding of chemical and physical characterization of bioaerosols

    Improving Landmark Localization with Semi-Supervised Learning

    Full text link
    We present two techniques to improve landmark localization in images from partially annotated datasets. Our primary goal is to leverage the common situation where precise landmark locations are only provided for a small data subset, but where class labels for classification or regression tasks related to the landmarks are more abundantly available. First, we propose the framework of sequential multitasking and explore it here through an architecture for landmark localization where training with class labels acts as an auxiliary signal to guide the landmark localization on unlabeled data. A key aspect of our approach is that errors can be backpropagated through a complete landmark localization model. Second, we propose and explore an unsupervised learning technique for landmark localization based on having a model predict equivariant landmarks with respect to transformations applied to the image. We show that these techniques, improve landmark prediction considerably and can learn effective detectors even when only a small fraction of the dataset has landmark labels. We present results on two toy datasets and four real datasets, with hands and faces, and report new state-of-the-art on two datasets in the wild, e.g. with only 5\% of labeled images we outperform previous state-of-the-art trained on the AFLW dataset.Comment: Published as a conference paper in CVPR 201
    • ā€¦
    corecore