1,055 research outputs found
Content-based Video Retrieval by Integrating Spatio-Temporal and Stochastic Recognition of Events
As amounts of publicly available video data grow the need to query this data efficiently becomes significant. Consequently content-based retrieval of video data turns out to be a challenging and important problem. We address the specific aspect of inferring semantics automatically from raw video data. In particular, we introduce a new video data model that supports the integrated use of two different approaches for mapping low-level features to high-level concepts. Firstly, the model is extended with a rule-based approach that supports spatio-temporal formalization of high-level concepts, and then with a stochastic approach. Furthermore, results on real tennis video data are presented, demonstrating the validity of both approaches, as well us advantages of their integrated us
Data preparation for artificial intelligence in medical imaging: A comprehensive guide to open-access platforms and tools
The vast amount of data produced by today's medical imaging systems has led medical professionals to turn to novel technologies in order to efficiently handle their data and exploit the rich information present in them. In this context, artificial intelligence (AI) is emerging as one of the most prominent solutions, promising to revolutionise every day clinical practice and medical research. The pillar supporting the development of reliable and robust AI algorithms is the appropriate preparation of the medical images to be used by the AI-driven solutions. Here, we provide a comprehensive guide for the necessary steps to prepare medical images prior to developing or applying AI algorithms. The main steps involved in a typical medical image preparation pipeline include: (i) image acquisition at clinical sites, (ii) image de-identification to remove personal information and protect patient privacy, (iii) data curation to control for image and associated information quality, (iv) image storage, and (v) image annotation. There exists a plethora of open access tools to perform each of the aforementioned tasks and are hereby reviewed. Furthermore, we detail medical image repositories covering different organs and diseases. Such repositories are constantly increasing and enriched with the advent of big data. Lastly, we offer directions for future work in this rapidly evolving field
Learning Multi-Object Tracking and Segmentation from Automatic Annotations
In this work we contribute a novel pipeline to automatically generate
training data, and to improve over state-of-the-art multi-object tracking and
segmentation (MOTS) methods. Our proposed track mining algorithm turns raw
street-level videos into high-fidelity MOTS training data, is scalable and
overcomes the need of expensive and time-consuming manual annotation
approaches. We leverage state-of-the-art instance segmentation results in
combination with optical flow predictions, also trained on automatically
harvested training data. Our second major contribution is MOTSNet - a deep
learning, tracking-by-detection architecture for MOTS - deploying a novel
mask-pooling layer for improved object association over time. Training MOTSNet
with our automatically extracted data leads to significantly improved sMOTSA
scores on the novel KITTI MOTS dataset (+1.9%/+7.5% on cars/pedestrians), and
MOTSNet improves by +4.1% over previously best methods on the MOTSChallenge
dataset. Our most impressive finding is that we can improve over previous
best-performing works, even in complete absence of manually annotated MOTS
training data
- âŠ