37 research outputs found
Image enhancement in digital X-ray angiography
Anyone who does not look back to the beginning throughout a
course of action, does not look forward to the end. Hence it
necessarily follows that an intention which looks ahead, depends
on a recollection which looks back.
| Aurelius Augustinus, De civitate Dei, VII.7 (417 A.D.)
Chapter 1
Introduction and Summary
D
espite the development of imaging techniques based on alternative physical
phenomena, such as nuclear magnetic resonance, emission of single photons
( -radiation) by radio-pharmaceuticals and photon pairs by electron-positron
annihilations, re ection of ultrasonic waves, and the Doppler eect, X-ray based im-
age acquisition is still daily practice in medicine. Perhaps this can be attributed to
the fact that, contrary to many other phenomena, X-rays lend themselves naturally
for registration by means of materials and methods widely available at the time of
their discovery | a fact that gave X-ray based medical imaging an at least 50-year
head start over possible alternatives. Immediately after the preliminary communica-
tion on the discovery of the \new light" by R¨ ontgen [317], late December 1895, the
possible applications of X-rays were investigated intensively. In 1896 alone, almost
one 1,000 articles about the new phenomenon appeared in print (Glasser [119] lists
all of them). Although most of the basics of the diagnostic as well as the therapeutic
uses of X-rays had been worked out by the end of that year [289], research on im-
proved acquisition and reduction of potential risks for humans continued steadily in
the century to follow. The development of improved X-ray tubes, rapid lm changers,
image intensiers, the introduction of television cameras into uoroscopy, and com-
puters in digital radiography and computerized tomography, formed a succession of
achievements which increased the diagnostic potential of X-ray based imaging.
One of the areas in medical imaging where X-rays have always played an im-
portant role is angiography,y which concerns the visualization of blood vessels in the
human body. As already suggested, research on the possibility of visualization of the
human vasculature was initiated shortly after the discovery of X-rays. A photograph
of a rst \angiogram" | obtained by injection of a mixture of chalk, red mercury,
and petroleum into an amputated hand, followed by almost an hour of exposure to
X-rays | was published as early as January 1896, by Hascheck & Lindenthal [139].
Although studies on cadavers led to greatly improved knowledge of the anatomy of
the human vascular system, angiography in living man for the purpose of diagnosis
and intervention became feasible only after substantial progress in the development
yA term originating from the Greek words o (aggeion), meaning \vessel" or \bucket", and
-' (graphein), meaning \to write" or \to record". 2 1 Introduction and Summary
of relatively safe contrast media and methods of administration, as well as advance-
ments in radiological equipment. Of special interest in the context of this thesis is
the improvement brought by photographic subtraction, a technique known since the
early 1900s and since then used successfully in e.g. astronomy, but rst introduced
in X-ray angiography in 1934, by Ziedses des Plantes [425, 426]. This technique al-
lowed for a considerable enhancement of vessel visibility by cancellation of unwanted
background structures. In the 1960s, the time consuming lm subtraction process
was replaced by analog video subtraction techniques [156, 275] which, with the in-
troduction of digital computers, gave rise to the development of digital subtraction
angiography [194] | a technique still considered by many the \gold standard" for de-
tection and quantication of vascular anomalies. Today, research on improved X-ray
based imaging techniques for angiography continues, witness the recent developments
in three-dimensional rotational angiography [88, 185, 186, 341,373].
The subject of this thesis is enhancement of digital X-ray angiography images. In
contrast with the previously mentioned developments, the emphasis is not on the
further improvement of image acquisition techniques, but rather on the development
and evaluation of digital image processing techniques for retrospective enhancement
of images acquired with existing techniques. In the context of this thesis, the term
\enhancement" must be regarded in a rather broad sense. It does not only refer
to improvement of image quality by reduction of disturbing artifacts and noise, but
also to minimization of possible image quality degradation and loss of quantitative
information, inevitably introduced by required image processing operations. These
two aspects of image enhancement will be claried further in a brief summary of each
of the chapters of this thesis.
The rst three chapters deal with the problem of patient motion artifacts in digital
subtraction angiography (DSA). In DSA imaging, a sequence of 2D digital X-ray
projection images is acquired, at a rate of e.g. two per second, following the injection
of contrast material into one of the arteries or veins feeding the part of the vasculature
to be diagnosed. Acquisition usually starts about one or two seconds prior to arrival
of the contrast bolus in the vessels of interest, so that the rst few images included
in the sequence do not show opacied vessels. In a subsequent post-processing step,
one of these \pre-bolus" images is then subtracted automatically from each of the
contrast images so as to mask out background structures such as bone and soft-
tissue shadows. However, it is clear that in the resulting digital subtraction images,
the unwanted background structures will have been removed completely only when
the patient lied perfectly still during acquisition of the original images. Since most
patients show at least some physical reaction to the passage of a contrast medium, this
proviso is generally not met. As a result, DSA images frequently show patient-motion
induced artifacts (see e.g. the bottom-left image in Fig. 1.1), which may in uence the
subsequent analysis and diagnosis carried out by radiologists.
Since the introduction of DSA, in the early 1980s, many solutions to the problem
of patient motion artifacts have been put forward. Chapter 2 presents an overview
of the possible types of motion artifacts reported in the literature and the techniques
that have been proposed to avoid them. The main purpose of that chapter is to
review and discuss the techniques proposed over the past two decades to correct for 1 Introduction and Summary 3
Figure 1.1. Example of creation and reduction of patient motion artifacts in
cerebral DSA imaging. Top left: a \pre-bolus" or mask image acquired just prior
to the arrival of the contrast medium. Top right: one of the contrast or live images
showing opacied vessels. Bottom left: DSA image obtained after subtraction of
the mask from the contrast image, followed by contrast enhancement. Due to patient
motion, the background structures in the mask and contrast image were not perfectly
aligned, as a result of which the DSA image does not only show blood vessels, but
also additional undesired structures (in this example primarily in the bottom-left
part of the image). Bottom right: DSA image resulting from subtraction of the
mask and contast image after application of the automatic registration algorithm
described in Chapter 3. 4 1 Introduction and Summary
patient motion artifacts retrospectively, by means of digital image processing. The
chapter addresses fundamental problems, such as whether it is possible to construct
a 2D geometrical transformation that exactly describes the projective eects of an
originally 3D transformation, as well as practical problems, such as how to retrieve
the correspondence between mask and contrast images by using only the grey-level
information contained in the images, and how to align the images according to that
correspondence in a computationally ecient manner.
The review in Chapter 2 reveals that there exists quite some literature on the
topic of (semi-)automatic image alignment, or image registration, for the purpose of
motion artifact reduction in DSA images. However, to the best of our knowledge,
research in this area has never led to algorithms which are suciently fast and robust
to be acceptable for routine use in clinical practice. By drawing upon the suggestions
put forward in Chapter 2, a new approach to automatic registration of digital X-ray
angiography images is presented in Chapter 3. Apart from describing the functionality
of the components of the algorithm, special attention is paid to their computationally
optimal implementation. The results of preliminary experiments described in that
chapter indicate that the algorithm is eective, very fast, and outperforms alterna-
tive approaches, in terms of both image quality and required computation time. It is
concluded that the algorithm is most eective in cerebral and peripheral DSA imag-
ing. An example of the image quality enhancement obtained after application of the
algorithm in the case of a cerebral DSA image is provided in Fig 1.1.
Chapter 4 reports on a clinical evaluation of the automatic registration technique.
The evaluation involved 104 cerebral DSA images, which were corrected for patient
motion artifacts by the automatic technique, as well as by pixel shifting | a manual
correction technique currently used in clinical practice. The quality of the DSA images
resulting from the two techniques was assessed by four observers, who compared the
images both mutually and to the corresponding original images. The results of the
evaluation presented in Chapter 4 indicate that the dierence in performance between
the two correction techniques is statistically signicant. From the results of the mutual
comparisons it is concluded that, on average, the automatic registration technique
performs either comparably, better than, or even much better than manual pixel
shifting in 95% of all cases. In the other 5% of the cases, the remaining artifacts are
located near the borders of the image, which are generally diagnostically non-relevant.
In addition, the results show that the automatic technique implies a considerable
reduction of post-processing time compared to manual pixel shifting (on average, one
second versus 12 seconds per DSA image).
The last two chapters deal with somewhat dierent topics. Chapter 5 is concerned
with visualization and quantication of vascular anomalies in three-dimensional rota-
tional angiography (3DRA). Similar to DSA imaging, 3DRA involves the acquisition
of a sequence of 2D digital X-ray projection images, following a single injection of
contrast material. Contrary to DSA, however, this sequence is acquired during a 180
rotation of the C-arch on which the X-ray source and detector are mounted antipo-
dally, with the object of interest positioned in its iso-center. The rotation is completed
in about eight seconds and the resulting image sequence typically contains 100 images,
which form the input to a ltered back-projection algorithm for 3D reconstruction. In
contrast with most other 3D medical imaging techniques, 3DRA is capable of provid- 1 Introduction and Summary 5
Figure 1.2. Visualizations of a clinical 3DRA dataset, illustrating the qualitative
improvement obtained after noise reduction ltering. Left: volume rendering of
the original, raw image. Right: volume rendering of the image after application
of edge-enhancing anisotropic diusion ltering (see Chapter 5 for a description of
this technique). The visualizations were obtained by using the exact same settings
for the parameters of the volume rendering algorithm.
ing high-resolution isotropic datasets. However, due to the relatively high noise level
and the presence of other unwanted background variations caused by surrounding
tissue, the use of noise reduction techniques is inevitable in order to obtain smooth
visualizations of these datasets (see Fig. 1.2). Chapter 5 presents an inquiry into the
eects of several linear and nonlinear noise reduction techniques on the visualization
and subsequent quantication of vascular anomalies in 3DRA images. The evalua-
tion is focussed on frequently occurring anomalies such as a narrowing (or stenosis)
of the internal carotid artery or a circumscribed dilation (or aneurysm) of intracra-
nial arteries. Experiments on anthropomorphic vascular phantoms indicate that, of
the techniques considered, edge-enhancing anisotropic diusion ltering is most suit-
able, although the practical use of this technique may currently be limited due to its
memory and computation-time requirements.
Finally, Chapter 6 addresses the problem of interpolation of sampled data, which
occurs e.g. when applying geometrical transformations to digital medical images for
the purpose of registration or visualization. In most practical situations, interpola-
tion of a sampled image followed by resampling of the resulting continuous image
on a geometrically transformed grid, inevitably implies loss of grey-level information,
and hence image degradation, the amount of which is dependent on image content,
but also on the employed interpolation scheme (see Fig. 1.3). It follows that the
choice for a particular interpolation scheme is important, since it in uences the re-
sults of registrations and visualizations, and the outcome of subsequent quantitative
analyses which rely on grey-level information contained in transformed images. Al-
though many interpolation techniques have been developed over the past decades, 6 1 Introduction and Summary
Figure 1.3. Illustration of the fact that the loss of information due to interpola-
tion and resampling operations is dependent on the employed interpolation scheme.
Left: slice of a 3DRA image after rotation over 5:0, by using linear interpolation.
Middle: the same slice, after rotation by using cubic spline interpolation. Right:
the dierence between the two rotated images. Although it is not possible with
such a comparison to come to conclusions as to which of the two methods yields
the smallest loss of grey-level information, this example clearly illustrates the point
that dierent interpolation methods usually yield dierent results.
thorough quantitative evaluations and comparisons of these techniques for medical
image transformation problems are still lacking. Chapter 6 presents such a compar-
ative evaluation. The study is limited to convolution-based interpolation techniques,
as these are most frequently used for registration and visualization of medical image
data. Because of the ubiquitousness of interpolation in medical image processing and
analysis, the study is not restricted to XRA and 3DRA images, but also includes
datasets from many other modalities. It is concluded that for all modalities, spline
interpolation constitutes the best trade-o between accuracy and computational cost,
and therefore is to be preferred over all other methods.
In summary, this thesis is concerned with the improvement of image quality and the
reduction of image quality degradation and loss of quantitative information. The
subsequent chapters describe techniques for reduction of patient motion artifacts in
DSA images, noise reduction techniques for improved visualization and quantication
of vascular anomalies in 3DRA images, and interpolation techniques for the purpose
of accurate geometrical transformation of medical image data. The results and con-
clusions of the evaluations described in this thesis provide general guidelines for the
applicability and practical use of these techniques
Artifacts due to dental and maxillofacial restoration materials in cone beam computed tomography images
Over the last 20 years, three-dimensional X-ray imaging, cone beam computed tomography (CBCT), has become an important method when making the diagnoses in the dental and maxillofacial area. There has been rapid development in CBCT devices, and the image quality has improved considerably during the last two decades. Despite the many improvements in CBCT image quality, artifacts induced by dental and maxillofacial restoration materials are still a problem, especially when diagnosing the dental area. CBCT manufacturers produce artifact reduction algorithms, which are intended to decrease or remove the artifacts in the image. However, the results of the studies on artifact reduction algorithms vary and there is no final consensus, as yet, on their efficacy. The studies of the present thesis focus on the arti-facts induced by different dental restoration materials in CBCT images. Another aim was to compare how the different materials interfere with the radiologic diagnosis. The materials investigated were titanium, zirconia, composite, and fiber reinforced composite (FRC).
The results showed that composites with ra-dio-opacifying BaAlSiO2 20% (weight%) or more caused artifacts in the CBCT images. Composites with BaAlSiO2 68% (weight%) or more caused artifacts with similar intensity as titanium. Titanium orbital floor implant caused artifacts in the CBCT images, whereas nonmetallic fiber reinforced composite (FRC) orbital floor implant did not cause hampering artifacts in the CBCT images. The diagnosis of apical perio-dontitis can be complicated in 70% of the CBCT images of paranasal sinuses because of the artifacts induced by dental and endodontic restorations. In the CBCT images, zirconia dental implants caused in-tense artifacts despite the artifact reduction algorithm.
To conclude, different dental restoration materials cause image hampering artifacts of different intensities in CBCT images. Zirconia is especially problem-atic in CBCT images. More studies are needed on artifact reduction methods to achieve an image quality without artifacts to make the correct diagnosis. In addition, the consequences of restoration and implant material options should be considered in postoperative CBCT images
Medical Image Enhancement using Deep Learning and Tensor Factorization Techniques
La résolution spatiale des images acquises par tomographie volumique à faisceau conique (CBCT) est limitée par la géométrie des capteurs, leur sensibilité, les mouvements du patient, les techniques de reconstruction d'images et la limitation de la dose de rayonnement. Le modèle de dégradation d'image considéré dans cette thèse consiste en un opérateur de ou avec la fonction d'étalement du système d'imagerie (PSF), un opérateur de décimation, et du bruit, qui relient les volumes CBCT à une image 3D super-résolue à estimer. Les méthodes proposées dans cette thèse (SISR - single image super-résolution) ont comme objectif d'inverser ce modèle direct, c'est à dire d'estimer un volume haute résolution à partir d'une image CBCT. Les algorithmes ont été évalués dans le cadre d'une application dentaire, avec comme vérité terrain les images haute résolution acquises par micro CT (µCT), qui utilise des doses de rayonnement très importantes, incompatibles avec les applications cliniques. Nous avons proposé une approche de SISR par deep learning, appliquée individuellement à des coupes CBCT. Deux types de réseaux ont été évalués : U-net et subpixel. Les deux ont amélioré les volumes CBCT, avec un gain en PSNR de 21 à 22 dB et en coefficient de Dice pour la segmentation canalaire de 1 à 2.2 %. Le gain a été plus particulièrement important dans la partie apicale des dents, ce qui représente un résultat important étant donnée son importance pour les applications cliniques. Nous avons proposé des algorithmes de SISR basés sur la décomposition canonique polyadique des tenseurs. Le principal avantage de cette méthode, lié à l'utilisation de la théorie des tenseur, est d'utiliser la structure 3D des volumes CBCT. L'algorithme proposé regroupe plusieurs étapes: débruitage base sur la factorisation des tenseurs, déconvolution et super-résolution, avec un faible nombre d'hyperparamètres. Le temps d'exécution est très faible par rapport aux algorithmes existants (deux ordres de magnitude plus petit), pour des performances légèrement supérieures (gain de 1.2 à 1.5 dB en PSNR). La troisième contribution de la thèse est en lien avec la contribution 2 : l'algorithme de SISR basé sur la décomposition canonique polyadique des tenseurs est combiné avec une méthode d'estimation de la PSF, inconnues dans les applications pratiques. L'algorithme résultant effectue les deux tâche de manière alternée, et s'avère précis et rapide sur des données de simulation et expérimentales. La dernière contribution de la thèse a été d'évaluer l'intérêt d'un autre type de décomposition tensorielle, la décomposition de Tucker, dans le cadre d'un algorithme de SISR. Avant la déconvolution, le volume CBCT est débruité en tronquant sa décomposition de Tucker. Comparé à l'algorithme de la contribution 2, cette approche permet de diminuer encore plus le temps de calcul, d'un facteur 10, pour des performances similaires pour des SNR importants et légèrement supérieures pour de faibles SNR. Le lien entre cette méthode et les algorithmes 2D basés sur une SVD facilite le réglage des hyperparamètres comparé à la décomposition canonique polyadique.The resolution of dental cone beam computed tomography (CBCT) images is imited by detector geometry, sensitivity, patient movement, the reconstruction technique and the need to minimize radiation dose. The corresponding image degradation model assumes that the CBCT image is a blurred (with a point spread function, PSF), downsampled, noisy version of a high resolution image. The quality of the image is crucial for precise diagnosis and treatment planning. The methods proposed in this thesis aim to give a solution for the single image super-resolution (SISR) problem. The algorithms were evaluated on dental CBCT and corresponding highresolution (and high radiation-dose) µCT image pairs of extracted teeth. I have designed a deep learning framework for the SISR problem, applied to CBCT slices. I have tested the U-net and subpixel neural networks, which both improved the PSNR by 21-22 dB, and the Dice coe_cient of the canal segmentation by 1-2.2%, more significantly in the medically critical apical region. I have designed an algorithm for the 3D SISR problem, using the canonical polyadic decomposition of tensors. This implementation conserves the 3D structure of the volume, integrating the factorization-based denoising, deblurring with a known PSF, and upsampling of the image in a lightweight algorithm with a low number of parameters. It outperforms the state-of-the-art 3D reconstruction-based algorithms with two orders of magnitude faster run-time and provides similar PSNR (improvement of 1.2-1.5 dB) and segmentation metrics (Dice coe_cient increased on average to 0.89 and 0.90). Thesis II b: I have implemented a joint alternating recovery of the unknown PSF parameters and of the high-resolution 3D image using CPD-SISR. The algorithm was compared to a state-of-the-art 3D reconstruction-based algorithm, combined with the proposed alternating PSF-optimization. The two algorithms have shown similar improvement in PSNR, but CPD-SISR-blind converged roughly 40 times faster, under 6 minutes both in simulation and on experimental dental computed tomography data. I have proposed a solution for the 3D SISR problem using the Tucker decomposition (TD-SISR). The denoising step is realized _rst by TD in order to mitigate the ill-posedness of the subsequent deconvolution. Compared to CPDSISR the algorithm runs ten times faster. Depending on the amount of noise, higher PSNR (0.3 - 3.5 dB), SSI (0.58 - 2.43%) and segmentation values (Dice coefficient, 2% improvement) were measured. The parameters in TD-SISR are familiar from 2D SVD-based algorithms, so their tuning is easier compared to CPD-SISR
Efficient sampling strategies for x-ray micro computed tomography with an intensity-modulated beam
The term "cycloidal CT" refers to a family of efficient sampling strategies that can be applied to x-ray micro-computed tomography (CT) systems which operate with an intensity-modulated beam. Such a beam can be employed to provide access to a phase contrast channel and high spatial resolutions (a few um). Phase contrast can offer better image contrast of samples which have traditionally been "invisible” to x-rays due to their weak attenuation, and high resolutions help view crucial details in samples.
Cycloidal sampling strategies provide images more quickly than the gold standard in the field ("dithering”). I conceived and compared four practical implementation strategies for cycloidal CT, three of which are "flyscans” (the sample moves continuously). Flyscans acquire images of similar resolution to dithering with no overheads, reducing acquisition time to exposure time. I also developed a "knife-edge” position tracking method which tracks subpixel motions of the sample stage. This information can be used to facilitate, automate, and improve the reconstruction of cycloidal data. I analysed the effects of different levels of dose on the signal-to-noise ratio (SNR) of an image acquired with cycloidal CT. The results show that cycloidal images yield the same SNR as dithered images with less dose, although a more extensive study is required. Finally, I explored the potential of using cycloidal CT for intraoperative specimen imaging and tissue engineering. My results are encouraging for tissue engineering; for intraoperative imaging, the cycloidal images did not show comparable resolution to the dithered images, although that is possibly linked to issues with the dataset.
Overall, my work has provided a benchmark for the implementation and application of cycloidal CT for the first time. Besides a summary of my research, this thesis is meant to be a comprehensive guide for facilitating uptake of cycloidal CT within the scientific community and beyond
Development and application of an optogenetic platform for controlling and imaging a large number of individual neurons
The understanding and treatment of brain disorders as well as the development of intelligent machines is hampered by the lack of knowledge of how the brain fundamentally functions. Over the past century, we have learned much about how individual neurons and neural networks behave, however new tools are critically needed to interrogate how neural networks give rise to complex brain processes and disease conditions. Recent innovations in molecular techniques, such as optogenetics, have enabled neuroscientists unprecedented precision to excite, inhibit and record defined neurons. The impressive sensitivity of currently available optogenetic sensors and actuators has now enabled the possibility of analyzing a large number of individual neurons in the brains of behaving animals. To promote the use of these optogenetic tools, this thesis integrates cutting edge optogenetic molecular sensors which is ultrasensitive for imaging neuronal activity with custom wide field optical microscope to analyze a large number of individual neurons in living brains. Wide-field microscopy provides a large field of view and better spatial resolution approaching the Abbe diffraction limit of fluorescent microscope. To demonstrate the advantages of this optical platform, we imaged a deep brain structure, the Hippocampus, and tracked hundreds of neurons over time while mouse was performing a memory task to investigate how those individual neurons related to behavior. In addition, we tested our optical platform in investigating transient neural network changes upon mechanical perturbation related to blast injuries. In this experiment, all blasted mice show a consistent change in neural network. A small portion of neurons showed a sustained calcium increase for an extended period of time, whereas the majority lost their activities. Finally, using optogenetic silencer to control selective motor cortex neurons, we examined their contributions to the network pathology of basal ganglia related to Parkinson’s disease. We found that inhibition of motor cortex does not alter exaggerated beta oscillations in the striatum that are associated with parkinsonianism. Together, these results demonstrate the potential of developing integrated optogenetic system to advance our understanding of the principles underlying neural network computation, which would have broad applications from advancing artificial intelligence to disease diagnosis and treatment
Accurate 3D-reconstruction and -navigation for high-precision minimal-invasive interventions
The current lateral skull base surgery is largely invasive since it requires wide exposure and direct visualization of anatomical landmarks to avoid damaging critical structures. A multi-port approach aiming to reduce such invasiveness has been recently investigated. Thereby three canals are drilled from the skull surface to the surgical region of interest: the first canal for the instrument, the second for the endoscope, and the third for material removal or an additional instrument. The transition to minimal invasive approaches in the lateral skull base surgery requires sub-millimeter accuracy and high outcome predictability, which results in high requirements for the image acquisition as well as for the navigation.
Computed tomography (CT) is a non-invasive imaging technique allowing the visualization of the internal patient organs. Planning optimal drill channels based on patient-specific models requires high-accurate three-dimensional (3D) CT images. This thesis focuses on the reconstruction of high quality CT volumes. Therefore, two conventional imaging systems are investigated: spiral CT scanners and C-arm cone-beam CT (CBCT) systems. Spiral CT scanners acquire volumes with typically anisotropic resolution, i.e. the voxel spacing in the slice-selection-direction is larger than the in-the-plane spacing. A new super-resolution reconstruction approach is proposed to recover images with high isotropic resolution from two orthogonal low-resolution CT volumes.
C-arm CBCT systems offers CT-like 3D imaging capabilities while being appropriate for interventional suites. A main drawback of these systems is the commonly encountered CT artifacts due to several limitations in the imaging system, such as the mechanical inaccuracies. This thesis contributes new methods to enhance the CBCT reconstruction quality by addressing two main reconstruction artifacts: the misalignment artifacts caused by mechanical inaccuracies, and the metal-artifacts caused by the presence of metal objects in the scanned region.
CBCT scanners are appropriate for intra-operative image-guided navigation. For instance, they can be used to control the drill process based on intra-operatively acquired 2D fluoroscopic images. For a successful navigation, accurate estimate of C-arm pose relative to the patient anatomy and the associated surgical plan is required. A new algorithm has been developed to fulfill this task with high-precision. The performance of the introduced methods is demonstrated on simulated and real data
Image Registration Workshop Proceedings
Automatic image registration has often been considered as a preliminary step for higher-level processing, such as object recognition or data fusion. But with the unprecedented amounts of data which are being and will continue to be generated by newly developed sensors, the very topic of automatic image registration has become and important research topic. This workshop presents a collection of very high quality work which has been grouped in four main areas: (1) theoretical aspects of image registration; (2) applications to satellite imagery; (3) applications to medical imagery; and (4) image registration for computer vision research
Robust density modelling using the student's t-distribution for human action recognition
The extraction of human features from videos is often inaccurate and prone to outliers. Such outliers can severely affect density modelling when the Gaussian distribution is used as the model since it is highly sensitive to outliers. The Gaussian distribution is also often used as base component of graphical models for recognising human actions in the videos (hidden Markov model and others) and the presence of outliers can significantly affect the recognition accuracy. In contrast, the Student's t-distribution is more robust to outliers and can be exploited to improve the recognition rate in the presence of abnormal data. In this paper, we present an HMM which uses mixtures of t-distributions as observation probabilities and show how experiments over two well-known datasets (Weizmann, MuHAVi) reported a remarkable improvement in classification accuracy. © 2011 IEEE