67 research outputs found
Adaptive Target Recognition: A Case Study Involving Airport Baggage Screening
This work addresses the question whether it is possible to design a
computer-vision based automatic threat recognition (ATR) system so that it can
adapt to changing specifications of a threat without having to create a new ATR
each time. The changes in threat specifications, which may be warranted by
intelligence reports and world events, are typically regarding the physical
characteristics of what constitutes a threat: its material composition, its
shape, its method of concealment, etc. Here we present our design of an AATR
system (Adaptive ATR) that can adapt to changing specifications in materials
characterization (meaning density, as measured by its x-ray attenuation
coefficient), its mass, and its thickness. Our design uses a two-stage cascaded
approach, in which the first stage is characterized by a high recall rate over
the entire range of possibilities for the threat parameters that are allowed to
change. The purpose of the second stage is to then fine-tune the performance of
the overall system for the current threat specifications. The computational
effort for this fine-tuning for achieving a desired PD/PFA rate is far less
than what it would take to create a new classifier with the same overall
performance for the new set of threat specifications
RMPD - A Recursive Mid-Point Displacement Algorithm for Path Planning
Motivated by what is required for real-time path planning, the paper starts
out by presenting sRMPD, a new recursive "local" planner founded on the key
notion that, unless made necessary by an obstacle, there must be no deviation
from the shortest path between any two points, which would normally be a
straight line path in the configuration space. Subsequently, we increase the
power of sRMPD by using it as a "connect" subroutine call in a higher-level
sampling-based algorithm mRMPD that is inspired by multi-RRT. As a consequence,
mRMPD spawns a larger number of space exploring trees in regions of the
configuration space that are characterized by a higher density of obstacles.
The overall effect is a hybrid tree growing strategy with a trade-off between
random exploration as made possible by multi-RRT based logic and immediate
exploitation of opportunities to connect two states as made possible by sRMPD.
The mRMPD planner can be biased with regard to this trade-off for solving
different kinds of planning problems efficiently. Based on the test cases we
have run, our experiments show that mRMPD can reduce planning time by up to 80%
compared to basic RRT
Self-Supervised One-Shot Learning for Automatic Segmentation of StyleGAN Images
We propose a framework for the automatic one-shot segmentation of synthetic
images generated by a StyleGAN. Our framework is based on the observation that
the multi-scale hidden features in the GAN generator hold useful semantic
information that can be utilized for automatic on-the-fly segmentation of the
generated images. Using these features, our framework learns to segment
synthetic images using a self-supervised contrastive clustering algorithm that
projects the hidden features into a compact space for per-pixel classification.
This contrastive learner is based on using a novel data augmentation strategy
and a pixel-wise swapped prediction loss that leads to faster learning of the
feature vectors for one-shot segmentation. We have tested our implementation on
five standard benchmarks to yield a segmentation performance that not only
outperforms the semi-supervised baselines by an average wIoU margin of 1.02 %
but also improves the inference speeds by a factor of 4.5. Finally, we also
show the results of using the proposed one-shot learner in implementing BagGAN,
a framework for producing annotated synthetic baggage X-ray scans for threat
detection. This framework was trained and tested on the PIDRay baggage
benchmark to yield a performance comparable to its baseline segmenter based on
manual annotations
Purdue RVL-SLLL American Sign Language Database
Development of automatic recognition systems for American Sign Language (ASL) needs a comprehensive database that provides a range of signed material under controlled and less-controlled lighting conditions. The database we created contains (a) handshapes in isolation and in single signs, (b) the American fingerspelling alphabet, (c) numbers, (d) movement in single signs, and (e) examples of short discourse narratives for testing sign recognition in connected linguistic contexts.
All of these are produced by 14 fluent Deaf ASL signers under controlled lighting conditions in a professional studio. All except the short narratives are also produced in less than superior lighting conditions.
These data can provide the recognition algorithm developer with the opportunity to move from simple recognition situations in the best of circumstances to more complex recognition situations with challenging lighting situations.
The database was collected with support from the National Science Foundation Linguistics Program under Grant No. 99-05848 and 0414953
Scripting with Objects: A Comparative Presentation of Object-Oriented Scripting with Perl and Python
Designing with objects: object-oriented design patterns explained with stories from Harry Potter
 All code examples in the book are available for download on a companion site with resources for readers and instructors A refreshing alternative to the rather abstract and dry explanations of the object-oriented design patterns in much of the existing literature on the subject In 24 chapters, Designing with Objects explains well-known design patterns by relating them to stories from the Harry Potter serie
- …