268 research outputs found
SCHLIEREN SEQUENCE ANALYSIS USING COMPUTER VISION
Computer vision-based methods are proposed for extraction and measurement of flow structures of interest in schlieren video. As schlieren data has increased with faster frame rates, we are faced with thousands of images to analyze. This presents an opportunity to study global flow structures over time that may not be evident from surface measurements. A degree of automation is desirable to extract flow structures and features to give information on their behavior through the sequence. Using an interdisciplinary approach, the analysis of large schlieren data is recast as a computer vision problem. The double-cone schlieren sequence is used as a testbed for the methodology; it is unique in that it contains 5,000 images, complex phenomena, and is feature rich.
Oblique structures such as shock waves and shear layers are common in schlieren images. A vision-based methodology is used to provide an estimate of oblique structure angles through the unsteady sequence. The methodology has been applied to a complex flowfield with multiple shocks. A converged detection success rate between 94% and 97% for these structures is obtained. The modified curvature scale space is used to define features at salient points on shock contours. A challenge in developing methods for feature extraction in schlieren images is the reconciliation of existing techniques with features of interest to an aerodynamicist. Domain-specific knowledge of physics must therefore be incorporated into the definition and detec- tion phases. Known location and physically possible structure representations form a knowledge base that provides a unique feature definition and extraction. Model tip location and the motion of a shock intersection across several thousand frames are identified, localized, and tracked.
Images are parsed into physically meaningful labels using segmentation. Using this representation, it is shown that in the double-cone flowfield, the dominant unsteady motion is associated with large scale random events within the aft-cone bow shock. Small scale organized motion is associated with the shock-separated flow on the fore-cone surface. We show that computer vision is a natural and useful extension to the evaluation of schlieren data, and that segmentation has the potential to permit new large scale measurements of flow motion
Markerless deformation capture of hoverfly wings using multiple calibrated cameras
This thesis introduces an algorithm for the automated deformation capture of hoverfly
wings from multiple camera image sequences. The algorithm is capable of extracting
dense surface measurements, without the aid of fiducial markers, over an arbitrary number
of wingbeats of hovering flight and requires limited manual initialisation. A novel motion
prediction method, called the ânormalised stroke modelâ, makes use of the similarity of adjacent
wing strokes to predict wing keypoint locations, which are then iteratively refined in
a stereo image registration procedure. Outlier removal, wing fitting and further refinement
using independently reconstructed boundary points complete the algorithm. It was tested
on two hovering data sets, as well as a challenging flight manoeuvre. By comparing the
3-d positions of keypoints extracted from these surfaces with those resulting from manual
identification, the accuracy of the algorithm is shown to approach that of a fully manual
approach. In particular, half of the algorithm-extracted keypoints were within 0.17mm of
manually identified keypoints, approximately equal to the error of the manual identification
process. This algorithm is unique among purely image based flapping flight studies in the
level of automation it achieves, and its generality would make it applicable to wing tracking
of other insects
Disparate View Matching
Matching of disparate views has gained significance in computer vision due to its role in many novel application areas. Being able to match images of the same scene captured during day and night, between a historic and contemporary picture of a scene, and between aerial and ground-level views of a building facade all enable novel applications ranging from loop-closure detection for structure-from-motion and re-photography to geo-localization of a street-level image using reference imagery captured from the air. The goal of this work is to develop novel features and methods that address matching problems where direct appearance-based correspondences are either difficult to obtain or infeasible because of the lack of appearance similarity altogether. To address these problems, we propose methods that span the appearance-geometry spectrum in terms of both the use of these cues as well as the ability of each method to handle variations in appearance and geometry. First, we consider the problem of geo-localization of a query street-level image using a reference database of building facades captured from a bird\u27s eye view. To address this wide-baseline facade matching problem, a novel scale-selective self-similarity feature that avoids direct comparison of appearance between disparate facade images is presented. Next, to address image matching problems with more extreme appearance variation, a novel representation for matchable images expressed in terms of the eigen-functions of the joint graph of the two images is presented. This representation is used to derive features that are persistent across wide variations in appearance. Next, the problem setting of matching between a street-level image and a digital elevation map (DEM) is considered. Given the limited appearance information available in this scenario, the matching approach has to rely more significantly on geometric cues. Therefore, a purely geometric method to establish correspondences between building corners in the DEM and the visible corners in the query image is presented. Finally, to generalize this problem setting we address the problem of establishing correspondences between 3D and 2D point clouds using geometric means alone. A novel framework for incorporating purely geometric constraints into a higher-order graph matching framework is presented with specific formulations for the three-point calibrated absolute camera pose problem (P3P), two-point upright camera pose problem (Up2p) and the three-plus-one relative camera pose problem
Automatic Spatiotemporal Analysis of Cardiac Image Series
RĂSUMĂ
Ă ce jour, les maladies cardiovasculaires demeurent au premier rang des principales causes de
dĂ©cĂšs en AmĂ©rique du Nord. Chez lâadulte et au sein de populations de plus en plus jeunes,
la soi-disant Ă©pidĂ©mie dâobĂ©sitĂ© entraĂźnĂ©e par certaines habitudes de vie tels que la mauvaise
alimentation, le manque dâexercice et le tabagisme est lourde de consĂ©quences pour les personnes
affectées, mais aussi sur le systÚme de santé. La principale cause de morbidité et de
mortalitĂ© chez ces patients est lâathĂ©rosclĂ©rose, une accumulation de plaque Ă lâintĂ©rieur des
vaisseaux sanguins à hautes pressions telles que les artÚres coronaires. Les lésions athérosclérotiques
peuvent entraĂźner lâischĂ©mie en bloquant la circulation sanguine et/ou en provoquant
une thrombose. Cela mĂšne souvent Ă de graves consĂ©quences telles quâun infarctus. Outre les
problÚmes liés à la sténose, les parois artérielles des régions criblées de plaque augmentent la
rigidité des parois vasculaires, ce qui peut aggraver la condition du patient. Dans la population
pédiatrique, la pathologie cardiovasculaire acquise la plus fréquente est la maladie de
Kawasaki. Il sâagit dâune vasculite aigĂŒe pouvant affecter lâintĂ©gritĂ© structurale des parois des
artĂšres coronaires et mener Ă la formation dâanĂ©vrismes. Dans certains cas, ceux-ci entravent
lâhĂ©modynamie artĂ©rielle en engendrant une perfusion myocardique insuffisante et en activant
la formation de thromboses.
Le diagnostic de ces deux maladies coronariennes sont traditionnellement effectuĂ©s Ă lâaide
dâangiographies par fluoroscopie. Pendant ces examens paracliniques, plusieurs centaines de
projections radiographiques sont acquises en sĂ©ries suite Ă lâinfusion artĂ©rielle dâun agent de
contraste. Ces images révÚlent la lumiÚre des vaisseaux sanguins et la présence de lésions
potentiellement pathologiques, sâil y a lieu. Parce que les sĂ©ries acquises contiennent de lâinformation
trĂšs dynamique en termes de mouvement du patient volontaire et involontaire (ex.
battements cardiaques, respiration et dĂ©placement dâorganes), le clinicien base gĂ©nĂ©ralement
son interprĂ©tation sur une seule image angiographique oĂč des mesures gĂ©omĂ©triques sont effectuĂ©es
manuellement ou semi-automatiquement par un technicien en radiologie. Bien que
lâangiographie par fluoroscopie soit frĂ©quemment utilisĂ© partout dans le monde et souvent
considĂ©rĂ© comme lâoutil de diagnostic âgold-standardâ pour de nombreuses maladies vasculaires,
la nature bidimensionnelle de cette modalitĂ© dâimagerie est malheureusement trĂšs
limitante en termes de spécification géométrique des différentes régions pathologiques. En effet,
la structure tridimensionnelle des stĂ©noses et des anĂ©vrismes ne peut pas ĂȘtre pleinement
appréciée en 2D car les caractéristiques observées varient selon la configuration angulaire de
lâimageur. De plus, la prĂ©sence de lĂ©sions affectant les artĂšres coronaires peut ne pas reflĂ©ter
la véritable santé du myocarde, car des mécanismes compensatoires naturels (ex. vaisseaux----------ABSTRACT
Cardiovascular disease continues to be the leading cause of death in North America. In adult
and, alarmingly, ever younger populations, the so-called obesity epidemic largely driven by
lifestyle factors that include poor diet, lack of exercise and smoking, incurs enormous stresses
on the healthcare system. The primary cause of serious morbidity and mortality for these
patients is atherosclerosis, the build up of plaque inside high pressure vessels like the coronary
arteries. These lesions can lead to ischemic disease and may progress to precarious blood
flow blockage or thrombosis, often with infarction or other severe consequences. Besides
the stenosis-related outcomes, the arterial walls of plaque-ridden regions manifest increased
stiffness, which may exacerbate negative patient prognosis. In pediatric populations, the
most prevalent acquired cardiovascular pathology is Kawasaki disease. This acute vasculitis
may affect the structural integrity of coronary artery walls and progress to aneurysmal lesions.
These can hinder the blood flowâs hemodynamics, leading to inadequate downstream
perfusion, and may activate thrombus formation which may lead to precarious prognosis.
Diagnosing these two prominent coronary artery diseases is traditionally performed using
fluoroscopic angiography. Several hundred serial x-ray projections are acquired during selective
arterial infusion of a radiodense contrast agent, which reveals the vesselsâ luminal
area and possible pathological lesions. The acquired series contain highly dynamic information
on voluntary and involuntary patient movement: respiration, organ displacement and
heartbeat, for example. Current clinical analysis is largely limited to a single angiographic
image where geometrical measures will be performed manually or semi-automatically by a
radiological technician. Although widely used around the world and generally considered
the gold-standard diagnosis tool for many vascular diseases, the two-dimensional nature of
this imaging modality is limiting in terms of specifying the geometry of various pathological
regions. Indeed, the 3D structures of stenotic or aneurysmal lesions may not be fully appreciated
in 2D because their observable features are dependent on the angular configuration of
the imaging gantry. Furthermore, the presence of lesions in the coronary arteries may not
reflect the true health of the myocardium, as natural compensatory mechanisms may obviate
the need for further intervention. In light of this, cardiac magnetic resonance perfusion
imaging is increasingly gaining attention and clinical implementation, as it offers a direct
assessment of myocardial tissue viability following infarction or suspected coronary artery
disease. This type of modality is plagued, however, by motion similar to that present in fluoroscopic
imaging. This issue predisposes clinicians to laborious manual intervention in order
to align anatomical structures in sequential perfusion frames, thus hindering automation o
Automatic Spatiotemporal Analysis of Cardiac Image Series
RĂSUMĂ
Ă ce jour, les maladies cardiovasculaires demeurent au premier rang des principales causes de
dĂ©cĂšs en AmĂ©rique du Nord. Chez lâadulte et au sein de populations de plus en plus jeunes,
la soi-disant Ă©pidĂ©mie dâobĂ©sitĂ© entraĂźnĂ©e par certaines habitudes de vie tels que la mauvaise
alimentation, le manque dâexercice et le tabagisme est lourde de consĂ©quences pour les personnes
affectées, mais aussi sur le systÚme de santé. La principale cause de morbidité et de
mortalitĂ© chez ces patients est lâathĂ©rosclĂ©rose, une accumulation de plaque Ă lâintĂ©rieur des
vaisseaux sanguins à hautes pressions telles que les artÚres coronaires. Les lésions athérosclérotiques
peuvent entraĂźner lâischĂ©mie en bloquant la circulation sanguine et/ou en provoquant
une thrombose. Cela mĂšne souvent Ă de graves consĂ©quences telles quâun infarctus. Outre les
problÚmes liés à la sténose, les parois artérielles des régions criblées de plaque augmentent la
rigidité des parois vasculaires, ce qui peut aggraver la condition du patient. Dans la population
pédiatrique, la pathologie cardiovasculaire acquise la plus fréquente est la maladie de
Kawasaki. Il sâagit dâune vasculite aigĂŒe pouvant affecter lâintĂ©gritĂ© structurale des parois des
artĂšres coronaires et mener Ă la formation dâanĂ©vrismes. Dans certains cas, ceux-ci entravent
lâhĂ©modynamie artĂ©rielle en engendrant une perfusion myocardique insuffisante et en activant
la formation de thromboses.
Le diagnostic de ces deux maladies coronariennes sont traditionnellement effectuĂ©s Ă lâaide
dâangiographies par fluoroscopie. Pendant ces examens paracliniques, plusieurs centaines de
projections radiographiques sont acquises en sĂ©ries suite Ă lâinfusion artĂ©rielle dâun agent de
contraste. Ces images révÚlent la lumiÚre des vaisseaux sanguins et la présence de lésions
potentiellement pathologiques, sâil y a lieu. Parce que les sĂ©ries acquises contiennent de lâinformation
trĂšs dynamique en termes de mouvement du patient volontaire et involontaire (ex.
battements cardiaques, respiration et dĂ©placement dâorganes), le clinicien base gĂ©nĂ©ralement
son interprĂ©tation sur une seule image angiographique oĂč des mesures gĂ©omĂ©triques sont effectuĂ©es
manuellement ou semi-automatiquement par un technicien en radiologie. Bien que
lâangiographie par fluoroscopie soit frĂ©quemment utilisĂ© partout dans le monde et souvent
considĂ©rĂ© comme lâoutil de diagnostic âgold-standardâ pour de nombreuses maladies vasculaires,
la nature bidimensionnelle de cette modalitĂ© dâimagerie est malheureusement trĂšs
limitante en termes de spécification géométrique des différentes régions pathologiques. En effet,
la structure tridimensionnelle des stĂ©noses et des anĂ©vrismes ne peut pas ĂȘtre pleinement
appréciée en 2D car les caractéristiques observées varient selon la configuration angulaire de
lâimageur. De plus, la prĂ©sence de lĂ©sions affectant les artĂšres coronaires peut ne pas reflĂ©ter
la véritable santé du myocarde, car des mécanismes compensatoires naturels (ex. vaisseaux----------ABSTRACT
Cardiovascular disease continues to be the leading cause of death in North America. In adult
and, alarmingly, ever younger populations, the so-called obesity epidemic largely driven by
lifestyle factors that include poor diet, lack of exercise and smoking, incurs enormous stresses
on the healthcare system. The primary cause of serious morbidity and mortality for these
patients is atherosclerosis, the build up of plaque inside high pressure vessels like the coronary
arteries. These lesions can lead to ischemic disease and may progress to precarious blood
flow blockage or thrombosis, often with infarction or other severe consequences. Besides
the stenosis-related outcomes, the arterial walls of plaque-ridden regions manifest increased
stiffness, which may exacerbate negative patient prognosis. In pediatric populations, the
most prevalent acquired cardiovascular pathology is Kawasaki disease. This acute vasculitis
may affect the structural integrity of coronary artery walls and progress to aneurysmal lesions.
These can hinder the blood flowâs hemodynamics, leading to inadequate downstream
perfusion, and may activate thrombus formation which may lead to precarious prognosis.
Diagnosing these two prominent coronary artery diseases is traditionally performed using
fluoroscopic angiography. Several hundred serial x-ray projections are acquired during selective
arterial infusion of a radiodense contrast agent, which reveals the vesselsâ luminal
area and possible pathological lesions. The acquired series contain highly dynamic information
on voluntary and involuntary patient movement: respiration, organ displacement and
heartbeat, for example. Current clinical analysis is largely limited to a single angiographic
image where geometrical measures will be performed manually or semi-automatically by a
radiological technician. Although widely used around the world and generally considered
the gold-standard diagnosis tool for many vascular diseases, the two-dimensional nature of
this imaging modality is limiting in terms of specifying the geometry of various pathological
regions. Indeed, the 3D structures of stenotic or aneurysmal lesions may not be fully appreciated
in 2D because their observable features are dependent on the angular configuration of
the imaging gantry. Furthermore, the presence of lesions in the coronary arteries may not
reflect the true health of the myocardium, as natural compensatory mechanisms may obviate
the need for further intervention. In light of this, cardiac magnetic resonance perfusion
imaging is increasingly gaining attention and clinical implementation, as it offers a direct
assessment of myocardial tissue viability following infarction or suspected coronary artery
disease. This type of modality is plagued, however, by motion similar to that present in fluoroscopic
imaging. This issue predisposes clinicians to laborious manual intervention in order
to align anatomical structures in sequential perfusion frames, thus hindering automation o
On-board three-dimensional object tracking: Software and hardware solutions
We describe a real time system for recognition and tracking 3D objects such as UAVs, airplanes, fighters with the optical sensor. Given a 2D image, the system has to perform background subtraction, recognize relative rotation, scale and translation of the object to sustain a prescribed topology of the fleet. In the thesis a comparative study of different algorithms and performance evaluation is carried out based on time and accuracy constraints. For background subtraction task we evaluate frame differencing, approximate median filter, mixture of Gaussians and propose classification based on neural network methods. For object detection we analyze the performance of invariant moments, scale invariant feature transform and affine scale invariant feature transform methods. Various tracking algorithms such as mean shift with variable and a fixed sized windows, scale invariant feature transform, Harris and fast full search based on fast fourier transform algorithms are evaluated. We develop an algorithm for the relative rotations and the scale change calculation based on Zernike moments. Based on the design criteria the selection is made for on-board implementation. The candidate techniques have been implemented on the Texas Instrument TMS320DM642 EVM board. It is shown in the thesis that 14 frames per second can be processed; that supports the real time implementation of the tracking system under reasonable accuracy limits
Spatiotemporal visual analysis of human actions
In this dissertation we propose four methods for the recognition of human activities. In all four of
them, the representation of the activities is based on spatiotemporal features that are automatically
detected at areas where there is a significant amount of independent motion, that is, motion that is
due to ongoing activities in the scene. We propose the use of spatiotemporal salient points as features
throughout this dissertation. The algorithms presented, however, can be used with any kind of features,
as long as the latter are well localized and have a well-defined area of support in space and time. We
introduce the utilized spatiotemporal salient points in the first method presented in this dissertation.
By extending previous work on spatial saliency, we measure the variations in the information content of
pixel neighborhoods both in space and time, and detect the points at the locations and scales for which
this information content is locally maximized. In this way, an activity is represented as a collection of
spatiotemporal salient points. We propose an iterative linear space-time warping technique in order
to align the representations in space and time and propose to use Relevance Vector Machines (RVM)
in order to classify each example into an action category. In the second method proposed in this
dissertation we propose to enhance the acquired representations of the first method. More specifically,
we propose to track each detected point in time, and create representations based on sets of trajectories,
where each trajectory expresses how the information engulfed by each salient point evolves over time.
In order to deal with imperfect localization of the detected points, we augment the observation model
of the tracker with background information, acquired using a fully automatic background estimation
algorithm. In this way, the tracker favors solutions that contain a large number of foreground pixels.
In addition, we perform experiments where the tracked templates are localized on specific parts of the
body, like the hands and the head, and we further augment the trackerâs observation model using a
human skin color model. Finally, we use a variant of the Longest Common Subsequence algorithm
(LCSS) in order to acquire a similarity measure between the resulting trajectory representations, and
RVMs for classification. In the third method that we propose, we assume that neighboring salient
points follow a similar motion. This is in contrast to the previous method, where each salient point was
tracked independently of its neighbors. More specifically, we propose to extract a novel set of visual
descriptors that are based on geometrical properties of three-dimensional piece-wise polynomials. The
latter are fitted on the spatiotemporal locations of salient points that fall within local spatiotemporal
neighborhoods, and are assumed to follow a similar motion. The extracted descriptors are invariant in
translation and scaling in space-time. Coupling the neighborhood dimensions to the scale at which the
corresponding spatiotemporal salient points are detected ensures the latter. The descriptors that are
extracted across the whole dataset are subsequently clustered in order to create a codebook, which is
used in order to represent the overall motion of the subjects within small temporal windows.Finally,we use boosting in order to select the most discriminative of these windows for each class, and RVMs for
classification. The fourth and last method addresses the joint problem of localization and recognition
of human activities depicted in unsegmented image sequences. Its main contribution is the use of an
implicit representation of the spatiotemporal shape of the activity, which relies on the spatiotemporal
localization of characteristic ensembles of spatiotemporal features. The latter are localized around
automatically detected salient points. Evidence for the spatiotemporal localization of the activity
is accumulated in a probabilistic spatiotemporal voting scheme. During training, we use boosting in
order to create codebooks of characteristic feature ensembles for each class. Subsequently, we construct
class-specific spatiotemporal models, which encode where in space and time each codeword ensemble
appears in the training set. During testing, each activated codeword ensemble casts probabilistic
votes concerning the spatiotemporal localization of the activity, according to the information stored
during training. We use a Mean Shift Mode estimation algorithm in order to extract the most probable
hypotheses from each resulting voting space. Each hypothesis corresponds to a spatiotemporal volume
which potentially engulfs the activity, and is verified by performing action category classification with
an RVM classifier
- âŠ