1,042 research outputs found
Endoscopic image analysis of aberrant crypt foci
Tese de Mestrado Integrado. Bioengenharia. Faculdade de Engenharia. Universidade do Porto. 201
ANALYSIS OF VOCAL FOLD KINEMATICS USING HIGH SPEED VIDEO
Vocal folds are the twin in-folding of the mucous membrane stretched horizontally across the larynx. They vibrate modulating the constant air flow initiated from the lungs. The pulsating pressure wave blowing through the glottis is thus the source for voiced speech production. Study of vocal fold dynamics during voicing are critical for the treatment of voice pathologies. Since the vocal folds move at 100 - 350 cycles per second, their visual inspection is currently done by strobosocopy which merges information from multiple cycles to present an apparent motion. High Speed Digital Laryngeal Imaging(HSDLI) with a temporal resolution of up to 10,000 frames per second has been established as better suited for assessing the vocal fold vibratory function through direct recording. But the widespread use of HSDLI is limited due to lack of consensus on the modalities like features to be examined. Development of the image processing techniques which circumvents the need for the tedious and time consuming effort of examining large volumes of recording has room for improvement. Fundamental questions like the required frame rate or resolution for the recordings is still not adequately answered. HSDLI cannot get the absolute physical measurement of the anatomical features and vocal fold displacement. This work addresses these challenges through improved signal processing. A vocal fold edge extraction technique with subpixel accuracy, suited even for hard to record pediatric population is developed first. The algorithm which is equally applicable for pediatric and adult subjects, is implemented to facilitate user inspection and intervention. Objective features describing the fold dynamics, which are extracted from the edge displacement waveform are proposed and analyzed on a diverse dataset of healthy males, females and children. The sampling and quantization noise present in the recordings are analyzed and methods to mitigate them are investigated. A customized Kalman smoothing and spline interpolation on the displacement waveform is found to improve the feature estimation stability. The relationship between frame rate, spatial resolution and vibration for efficient capturing of information is derived. Finally, to address the inability to measure physical measurement, a structured light projection calibrated with respect to the endoscope is prototyped
Détection du Contour Actif de Différentes Images
La segmentation d'image est le problème de partitionnement d'une image en différentes sous-régions sur la base d'une caractéristique préférée. La segmentation est un domaine de recherche important dans le traitement d'images et joue un rôle important dans la vision par ordinateur et la détection d'objets. Un grand nombre d'approches différentes ont été développées pour traiter le problème de segmentation, y compris les modèles de contour actif (MCA). L'idée fondamentale est d'élaborer une courbe initiale sous certaines contraintes d'une image donnée pour détecter les limites d'objet en minimisant une énergie. Dans ce cadre, cette thèse vise à développer des modèles traitant de problèmes de détection d'objets (segmentation d'images) caractérisés par inhomogénéité d'intensité et des limites bruyantes et mal définies. Pour faire face à ces défis, nous avons proposé un certain nombre de MCAs s'appuyant sur la méthode d’ensemble de niveaux. Le premier combine les informations de contour et de région sur la base d'une fonction de pondération adaptative. La deuxième approche utilise la force de pression signée locale basée-région dans une formulation d’ensemble de niveaux simple et efficace. Enfin, le troisième implémente une segmentation basée contour avec une nouvelle fonction d'arrêt basée sur le motif binaire local (LBP). Les résultats expérimentaux démontrent la grande précision de la segmentation obtenue sur diverses images en niveaux de gris synthétiques et réelles par rapport à l’état de l’art des MCA
Foetal echocardiographic segmentation
Congenital heart disease affects just under one percentage of all live births [1].
Those defects that manifest themselves as changes to the cardiac chamber volumes
are the motivation for the research presented in this thesis.
Blood volume measurements in vivo require delineation of the cardiac chambers and
manual tracing of foetal cardiac chambers is very time consuming and operator
dependent. This thesis presents a multi region based level set snake deformable
model applied in both 2D and 3D which can automatically adapt to some extent
towards ultrasound noise such as attenuation, speckle and partial occlusion artefacts.
The algorithm presented is named Mumford Shah Sarti Collision Detection (MSSCD).
The level set methods presented in this thesis have an optional shape prior term for
constraining the segmentation by a template registered to the image in the presence
of shadowing and heavy noise.
When applied to real data in the absence of the template the MSSCD algorithm is
initialised from seed primitives placed at the centre of each cardiac chamber. The
voxel statistics inside the chamber is determined before evolution. The MSSCD stops
at open boundaries between two chambers as the two approaching level set fronts
meet. This has significance when determining volumes for all cardiac compartments
since cardiac indices assume that each chamber is treated in isolation. Comparison
of the segmentation results from the implemented snakes including a previous level
set method in the foetal cardiac literature show that in both 2D and 3D on both real
and synthetic data, the MSSCD formulation is better suited to these types of data.
All the algorithms tested in this thesis are within 2mm error to manually traced
segmentation of the foetal cardiac datasets. This corresponds to less than 10% of
the length of a foetal heart. In addition to comparison with manual tracings all the
amorphous deformable model segmentations in this thesis are validated using a
physical phantom. The volume estimation of the phantom by the MSSCD
segmentation is to within 13% of the physically determined volume
Automatic Video Self Modeling for Voice Disorder
Video self modeling (VSM) is a behavioral intervention technique in which a learner models a target behavior by watching a video of him- or herself. In the field of speech language pathology, the approach of VSM has been successfully used for treatment of language in children with Autism and in individuals with fluency disorder of stuttering. Technical challenges remain in creating VSM contents that depict previously unseen behaviors. In this paper, we propose a novel system that synthesizes new video sequences for VSM treatment of patients with voice disorders. Starting with a video recording of a voice-disorder patient, the proposed system replaces the coarse speech with a clean, healthier speech that bears resemblance to the patient’s original voice. The replacement speech is synthesized using either a text-to-speech engine or selecting from a database of clean speeches based on a voice similarity metric. To realign the replacement speech with the original video, a novel audiovisual algorithm that combines audio segmentation with lip-state detection is proposed to identify corresponding time markers in the audio and video tracks. Lip synchronization is then accomplished by using an adaptive video re-sampling scheme that minimizes the amount of motion jitter and preserves the spatial sharpness. Results of both objective measurements and subjective evaluations on a dataset with 31 subjects demonstrate the effectiveness of the proposed techniques
Image based real-time ice load prediction tool for ship and offshore platform in managed ice field
The increased activities in arctic water warrant modelling of ice properties and ice-structure
interaction forces to ensure safe operations of ships and offshore platforms. Several established
analytical and numerical ice force estimation models can be found in the literature. Recently,
researchers have been working on Machine Learning (ML) based, data-driven force predictors
trained on experimental data and field measurement. Application of both traditional and ML-based
image processing for extracting information from ice floe images has also been reported in recent
literature; because extraction of ice features from real-time videos and images can significantly
improve ice force prediction.
However, there exists room for improvement in those studies. For example, accurate extraction of
ice floe information is still challenging because of their complex and varied shapes, colour
similarities and reflection of light on them. Besides, real ice floes are often found in groups with
overlapped and/or connected boundaries, making detecting even more challenging due to weaker
edges in such situations. The development of an efficient coupled model, which will extract
information from the ice floe images and train a force predictor based on the extracted dataset, is
still an open problem.
This research presents two Hybrid force prediction models. Instead of using analytical or
numerical approaches, the Hybrid models directly extract floe characteristics from the images and
later train ML-based force predictors using those extracted floe parameters. The first model
extracted ice features from images using traditional image processing techniques and then used
SVM and FFNN to develop two separate force predictors. The improved ice image processing
technique used here can extract useful ice properties from a closely connected, unevenly
illuminated floe field with various floe sizes and shapes. The second model extracted ice features
from images using RCNN and then trained two separate force predictors using SVM and FFNN,
similar to the first model.
The dataset for training SVM and FFNN force predictors involved variables extracted from the
image (floe number, density, sizes, etc.) and variables taken from the experimental analysis results
(ship speed, floe thickness, force etc.). The performance of both Hybrid models in terms of image
segmentation and force prediction, are analyzed and compared to establish their validity and
applicability.
Nevertheless, there exists room for further development of the proposed Hybrid models. For
example, extend the current models to include more data and investigate other machine learning
and deep learning-based network architectures to predict the ice force directly from the image as
an input
Computer Vision Techniques for Transcatheter Intervention
Minimally invasive transcatheter technologies have demonstrated substantial promise for the diagnosis and treatment of cardiovascular diseases. For example, TAVI is an alternative to AVR for the treatment of severe aortic stenosis and TAFA is widely used for the treatment and cure of atrial fibrillation. In addition, catheter-based IVUS and OCT imaging of coronary arteries provides important information about the coronary lumen, wall and plaque characteristics. Qualitative and quantitative analysis of these cross-sectional image data will be beneficial for the evaluation and treatment of coronary artery diseases such as atherosclerosis. In all the phases (preoperative, intraoperative, and postoperative) during the transcatheter intervention procedure, computer vision techniques (e.g., image segmentation, motion tracking) have been largely applied in the field to accomplish tasks like annulus measurement, valve selection, catheter placement control, and vessel centerline extraction. This provides beneficial guidance for the clinicians in surgical planning, disease diagnosis, and treatment assessment. In this paper, we present a systematical review on these state-of-the-art methods.We aim to give a comprehensive overview for researchers in the area of computer vision on the subject of transcatheter intervention. Research in medical computing is multi-disciplinary due to its nature, and hence it is important to understand the application domain, clinical background, and imaging modality so that methods and quantitative measurements derived from analyzing the imaging data are appropriate and meaningful. We thus provide an overview on background information of transcatheter intervention procedures, as well as a review of the computer vision techniques and methodologies applied in this area
Liver segmentation using 3D CT scans.
Master of Science in Computer Science. University of KwaZulu-Natal, Durban, 2018.Abstract available in PDF file
Object Segmentation Using Active Contours: A Level Set Approach
Image segmentation is responsible for partitioning an image into sub-regions based on a preferred feature. Active contour models have widely been used for image segmentation. The use of level set theory has enriched the implementation of active contours with more flexibility and simplicity. The past models of active contours rely on a gradient based stopping function to stop the curve evolution. However, when using gradient information for noisy and textured images, the evolving curve may pass through, or stop far from the salient object boundaries. We propose using a polarity based stopping function. Comparing to the gradient information, the polarity information accurately distinguishes the boundaries or edges of the salient objects more precisely. With combining the polarity information with the active contour model, we obtain an efficient active contour model for salient object detection. Experiments are performed on several images to show the advantage of the polarity based active contour
- …