2,411 research outputs found

    A Deep-structured Conditional Random Field Model for Object Silhouette Tracking

    Full text link
    In this work, we introduce a deep-structured conditional random field (DS-CRF) model for the purpose of state-based object silhouette tracking. The proposed DS-CRF model consists of a series of state layers, where each state layer spatially characterizes the object silhouette at a particular point in time. The interactions between adjacent state layers are established by inter-layer connectivity dynamically determined based on inter-frame optical flow. By incorporate both spatial and temporal context in a dynamic fashion within such a deep-structured probabilistic graphical model, the proposed DS-CRF model allows us to develop a framework that can accurately and efficiently track object silhouettes that can change greatly over time, as well as under different situations such as occlusion and multiple targets within the scene. Experiment results using video surveillance datasets containing different scenarios such as occlusion and multiple targets showed that the proposed DS-CRF approach provides strong object silhouette tracking performance when compared to baseline methods such as mean-shift tracking, as well as state-of-the-art methods such as context tracking and boosted particle filtering.Comment: 17 page

    Temporal and phylogenetic evolution of the sauropod dinosaur body plan

    Get PDF
    The colossal size and body plan of sauropod dinosaurs are unparalleled in terrestrial vertebrates. However, to date, there have been only limited attempts to examine temporal and phylogenetic patterns in the sauropod bauplan. Here, we combine three-dimensional computational models with phylogenetic reconstructions to quantify the evolution of whole-body shape and body segment properties across the sauropod radiation. Limitations associated with the absence of soft tissue preservation in fossils result in large error bars about mean absolute body shape predictions. However, applying any consistent skeleton : body volume ratio to all taxa does yield changes in body shape that appear concurrent with major macroevolutionary events in sauropod history. A caudad shift in centre-of-mass (CoM) in Middle Triassic Saurischia, associated with the evolution of bipedalism in various dinosaur lineages, was reversed in Late Triassic sauropodomorphs. A craniad CoM shift coincided with the evolution of quadrupedalism in the Late Triassic, followed by a more striking craniad shift in Late Jurassic–Cretaceous titanosauriforms, which included the largest sauropods. These craniad CoM shifts are strongly correlated with neck enlargement, a key innovation in sauropod evolution and pivotal to their gigantism. By creating a much larger feeding envelope, neck elongation is thought to have increased feeding efficiency and opened up trophic niches that were inaccessible to other herbivores. However, we find that relative neck size and CoM position are not strongly correlated with inferred feeding habits. Instead the craniad CoM positions of titanosauriforms appear closely linked with locomotion and environmental distributions, potentially contributing to the continued success of this group until the end-Cretaceous, with all other sauropods having gone extinct by the early Late Cretaceous

    End-to-end Recovery of Human Shape and Pose

    Full text link
    We describe Human Mesh Recovery (HMR), an end-to-end framework for reconstructing a full 3D mesh of a human body from a single RGB image. In contrast to most current methods that compute 2D or 3D joint locations, we produce a richer and more useful mesh representation that is parameterized by shape and 3D joint angles. The main objective is to minimize the reprojection loss of keypoints, which allow our model to be trained using images in-the-wild that only have ground truth 2D annotations. However, the reprojection loss alone leaves the model highly under constrained. In this work we address this problem by introducing an adversary trained to tell whether a human body parameter is real or not using a large database of 3D human meshes. We show that HMR can be trained with and without using any paired 2D-to-3D supervision. We do not rely on intermediate 2D keypoint detections and infer 3D pose and shape parameters directly from image pixels. Our model runs in real-time given a bounding box containing the person. We demonstrate our approach on various images in-the-wild and out-perform previous optimization based methods that output 3D meshes and show competitive results on tasks such as 3D joint location estimation and part segmentation.Comment: CVPR 2018, Project page with code: https://akanazawa.github.io/hmr

    Single camera pose estimation using Bayesian filtering and Kinect motion priors

    Full text link
    Traditional approaches to upper body pose estimation using monocular vision rely on complex body models and a large variety of geometric constraints. We argue that this is not ideal and somewhat inelegant as it results in large processing burdens, and instead attempt to incorporate these constraints through priors obtained directly from training data. A prior distribution covering the probability of a human pose occurring is used to incorporate likely human poses. This distribution is obtained offline, by fitting a Gaussian mixture model to a large dataset of recorded human body poses, tracked using a Kinect sensor. We combine this prior information with a random walk transition model to obtain an upper body model, suitable for use within a recursive Bayesian filtering framework. Our model can be viewed as a mixture of discrete Ornstein-Uhlenbeck processes, in that states behave as random walks, but drift towards a set of typically observed poses. This model is combined with measurements of the human head and hand positions, using recursive Bayesian estimation to incorporate temporal information. Measurements are obtained using face detection and a simple skin colour hand detector, trained using the detected face. The suggested model is designed with analytical tractability in mind and we show that the pose tracking can be Rao-Blackwellised using the mixture Kalman filter, allowing for computational efficiency while still incorporating bio-mechanical properties of the upper body. In addition, the use of the proposed upper body model allows reliable three-dimensional pose estimates to be obtained indirectly for a number of joints that are often difficult to detect using traditional object recognition strategies. Comparisons with Kinect sensor results and the state of the art in 2D pose estimation highlight the efficacy of the proposed approach.Comment: 25 pages, Technical report, related to Burke and Lasenby, AMDO 2014 conference paper. Code sample: https://github.com/mgb45/SignerBodyPose Video: https://www.youtube.com/watch?v=dJMTSo7-uF
    • …
    corecore