51,931 research outputs found

    Multiple Instance Learning: A Survey of Problem Characteristics and Applications

    Full text link
    Multiple instance learning (MIL) is a form of weakly supervised learning where training instances are arranged in sets, called bags, and a label is provided for the entire bag. This formulation is gaining interest because it naturally fits various problems and allows to leverage weakly labeled data. Consequently, it has been used in diverse application fields such as computer vision and document classification. However, learning from bags raises important challenges that are unique to MIL. This paper provides a comprehensive survey of the characteristics which define and differentiate the types of MIL problems. Until now, these problem characteristics have not been formally identified and described. As a result, the variations in performance of MIL algorithms from one data set to another are difficult to explain. In this paper, MIL problem characteristics are grouped into four broad categories: the composition of the bags, the types of data distribution, the ambiguity of instance labels, and the task to be performed. Methods specialized to address each category are reviewed. Then, the extent to which these characteristics manifest themselves in key MIL application areas are described. Finally, experiments are conducted to compare the performance of 16 state-of-the-art MIL methods on selected problem characteristics. This paper provides insight on how the problem characteristics affect MIL algorithms, recommendations for future benchmarking and promising avenues for research

    Voids in cosmological simulations over cosmic time

    Full text link
    We study evolution of voids in cosmological simulations using a new method for tracing voids over cosmic time. The method is based on tracking watershed basins (contiguous regions around density minima) of well developed voids at low redshift, on a regular grid of density field. It enables us to construct a robust and continuous mapping between voids at different redshifts, from initial conditions to the present time. We discuss how the new approach eliminates strong spurious effects of numerical origin when voids evolution is traced by matching voids between successive snapshots (by analogy to halo merger trees). We apply the new method to a cosmological simulation of a standard LambdaCDM cosmological model and study evolution of basic properties of typical voids (with effective radii between 6Mpc/h and 20Mpc/h at redshift z=0) such as volumes, shapes, matter density distributions and relative alignments. The final voids at low redshifts appear to retain a significant part of the configuration acquired in initial conditions. Shapes of voids evolve in a collective way which barely modifies the overall distribution of the axial ratios. The evolution appears to have a weak impact on mutual alignments of voids implying that the present state is in large part set up by the primordial density field. We present evolution of dark matter density profiles computed on iso-density surfaces which comply with the actual shapes of voids. Unlike spherical density profiles, this approach enables us to demonstrate development of theoretically predicted bucket-like shape of the final density profiles indicating a wide flat core and a sharp transition to high-density void walls.Comment: 13 pages, 13 figures; accepted for publication in MNRA

    STV-based Video Feature Processing for Action Recognition

    Get PDF
    In comparison to still image-based processes, video features can provide rich and intuitive information about dynamic events occurred over a period of time, such as human actions, crowd behaviours, and other subject pattern changes. Although substantial progresses have been made in the last decade on image processing and seen its successful applications in face matching and object recognition, video-based event detection still remains one of the most difficult challenges in computer vision research due to its complex continuous or discrete input signals, arbitrary dynamic feature definitions, and the often ambiguous analytical methods. In this paper, a Spatio-Temporal Volume (STV) and region intersection (RI) based 3D shape-matching method has been proposed to facilitate the definition and recognition of human actions recorded in videos. The distinctive characteristics and the performance gain of the devised approach stemmed from a coefficient factor-boosted 3D region intersection and matching mechanism developed in this research. This paper also reported the investigation into techniques for efficient STV data filtering to reduce the amount of voxels (volumetric-pixels) that need to be processed in each operational cycle in the implemented system. The encouraging features and improvements on the operational performance registered in the experiments have been discussed at the end
    corecore