1,042 research outputs found

    Real-time systems for moving objects detection and tracking using pixel difference method.

    Get PDF

    Endoscopic image analysis of aberrant crypt foci

    Get PDF
    Tese de Mestrado Integrado. Bioengenharia. Faculdade de Engenharia. Universidade do Porto. 201

    ANALYSIS OF VOCAL FOLD KINEMATICS USING HIGH SPEED VIDEO

    Get PDF
    Vocal folds are the twin in-folding of the mucous membrane stretched horizontally across the larynx. They vibrate modulating the constant air flow initiated from the lungs. The pulsating pressure wave blowing through the glottis is thus the source for voiced speech production. Study of vocal fold dynamics during voicing are critical for the treatment of voice pathologies. Since the vocal folds move at 100 - 350 cycles per second, their visual inspection is currently done by strobosocopy which merges information from multiple cycles to present an apparent motion. High Speed Digital Laryngeal Imaging(HSDLI) with a temporal resolution of up to 10,000 frames per second has been established as better suited for assessing the vocal fold vibratory function through direct recording. But the widespread use of HSDLI is limited due to lack of consensus on the modalities like features to be examined. Development of the image processing techniques which circumvents the need for the tedious and time consuming effort of examining large volumes of recording has room for improvement. Fundamental questions like the required frame rate or resolution for the recordings is still not adequately answered. HSDLI cannot get the absolute physical measurement of the anatomical features and vocal fold displacement. This work addresses these challenges through improved signal processing. A vocal fold edge extraction technique with subpixel accuracy, suited even for hard to record pediatric population is developed first. The algorithm which is equally applicable for pediatric and adult subjects, is implemented to facilitate user inspection and intervention. Objective features describing the fold dynamics, which are extracted from the edge displacement waveform are proposed and analyzed on a diverse dataset of healthy males, females and children. The sampling and quantization noise present in the recordings are analyzed and methods to mitigate them are investigated. A customized Kalman smoothing and spline interpolation on the displacement waveform is found to improve the feature estimation stability. The relationship between frame rate, spatial resolution and vibration for efficient capturing of information is derived. Finally, to address the inability to measure physical measurement, a structured light projection calibrated with respect to the endoscope is prototyped

    Détection du Contour Actif de Différentes Images

    Get PDF
    La segmentation d'image est le problème de partitionnement d'une image en différentes sous-régions sur la base d'une caractéristique préférée. La segmentation est un domaine de recherche important dans le traitement d'images et joue un rôle important dans la vision par ordinateur et la détection d'objets. Un grand nombre d'approches différentes ont été développées pour traiter le problème de segmentation, y compris les modèles de contour actif (MCA). L'idée fondamentale est d'élaborer une courbe initiale sous certaines contraintes d'une image donnée pour détecter les limites d'objet en minimisant une énergie. Dans ce cadre, cette thèse vise à développer des modèles traitant de problèmes de détection d'objets (segmentation d'images) caractérisés par inhomogénéité d'intensité et des limites bruyantes et mal définies. Pour faire face à ces défis, nous avons proposé un certain nombre de MCAs s'appuyant sur la méthode d’ensemble de niveaux. Le premier combine les informations de contour et de région sur la base d'une fonction de pondération adaptative. La deuxième approche utilise la force de pression signée locale basée-région dans une formulation d’ensemble de niveaux simple et efficace. Enfin, le troisième implémente une segmentation basée contour avec une nouvelle fonction d'arrêt basée sur le motif binaire local (LBP). Les résultats expérimentaux démontrent la grande précision de la segmentation obtenue sur diverses images en niveaux de gris synthétiques et réelles par rapport à l’état de l’art des MCA

    Foetal echocardiographic segmentation

    Get PDF
    Congenital heart disease affects just under one percentage of all live births [1]. Those defects that manifest themselves as changes to the cardiac chamber volumes are the motivation for the research presented in this thesis. Blood volume measurements in vivo require delineation of the cardiac chambers and manual tracing of foetal cardiac chambers is very time consuming and operator dependent. This thesis presents a multi region based level set snake deformable model applied in both 2D and 3D which can automatically adapt to some extent towards ultrasound noise such as attenuation, speckle and partial occlusion artefacts. The algorithm presented is named Mumford Shah Sarti Collision Detection (MSSCD). The level set methods presented in this thesis have an optional shape prior term for constraining the segmentation by a template registered to the image in the presence of shadowing and heavy noise. When applied to real data in the absence of the template the MSSCD algorithm is initialised from seed primitives placed at the centre of each cardiac chamber. The voxel statistics inside the chamber is determined before evolution. The MSSCD stops at open boundaries between two chambers as the two approaching level set fronts meet. This has significance when determining volumes for all cardiac compartments since cardiac indices assume that each chamber is treated in isolation. Comparison of the segmentation results from the implemented snakes including a previous level set method in the foetal cardiac literature show that in both 2D and 3D on both real and synthetic data, the MSSCD formulation is better suited to these types of data. All the algorithms tested in this thesis are within 2mm error to manually traced segmentation of the foetal cardiac datasets. This corresponds to less than 10% of the length of a foetal heart. In addition to comparison with manual tracings all the amorphous deformable model segmentations in this thesis are validated using a physical phantom. The volume estimation of the phantom by the MSSCD segmentation is to within 13% of the physically determined volume

    Automatic Video Self Modeling for Voice Disorder

    Get PDF
    Video self modeling (VSM) is a behavioral intervention technique in which a learner models a target behavior by watching a video of him- or herself. In the field of speech language pathology, the approach of VSM has been successfully used for treatment of language in children with Autism and in individuals with fluency disorder of stuttering. Technical challenges remain in creating VSM contents that depict previously unseen behaviors. In this paper, we propose a novel system that synthesizes new video sequences for VSM treatment of patients with voice disorders. Starting with a video recording of a voice-disorder patient, the proposed system replaces the coarse speech with a clean, healthier speech that bears resemblance to the patient’s original voice. The replacement speech is synthesized using either a text-to-speech engine or selecting from a database of clean speeches based on a voice similarity metric. To realign the replacement speech with the original video, a novel audiovisual algorithm that combines audio segmentation with lip-state detection is proposed to identify corresponding time markers in the audio and video tracks. Lip synchronization is then accomplished by using an adaptive video re-sampling scheme that minimizes the amount of motion jitter and preserves the spatial sharpness. Results of both objective measurements and subjective evaluations on a dataset with 31 subjects demonstrate the effectiveness of the proposed techniques

    Image based real-time ice load prediction tool for ship and offshore platform in managed ice field

    Get PDF
    The increased activities in arctic water warrant modelling of ice properties and ice-structure interaction forces to ensure safe operations of ships and offshore platforms. Several established analytical and numerical ice force estimation models can be found in the literature. Recently, researchers have been working on Machine Learning (ML) based, data-driven force predictors trained on experimental data and field measurement. Application of both traditional and ML-based image processing for extracting information from ice floe images has also been reported in recent literature; because extraction of ice features from real-time videos and images can significantly improve ice force prediction. However, there exists room for improvement in those studies. For example, accurate extraction of ice floe information is still challenging because of their complex and varied shapes, colour similarities and reflection of light on them. Besides, real ice floes are often found in groups with overlapped and/or connected boundaries, making detecting even more challenging due to weaker edges in such situations. The development of an efficient coupled model, which will extract information from the ice floe images and train a force predictor based on the extracted dataset, is still an open problem. This research presents two Hybrid force prediction models. Instead of using analytical or numerical approaches, the Hybrid models directly extract floe characteristics from the images and later train ML-based force predictors using those extracted floe parameters. The first model extracted ice features from images using traditional image processing techniques and then used SVM and FFNN to develop two separate force predictors. The improved ice image processing technique used here can extract useful ice properties from a closely connected, unevenly illuminated floe field with various floe sizes and shapes. The second model extracted ice features from images using RCNN and then trained two separate force predictors using SVM and FFNN, similar to the first model. The dataset for training SVM and FFNN force predictors involved variables extracted from the image (floe number, density, sizes, etc.) and variables taken from the experimental analysis results (ship speed, floe thickness, force etc.). The performance of both Hybrid models in terms of image segmentation and force prediction, are analyzed and compared to establish their validity and applicability. Nevertheless, there exists room for further development of the proposed Hybrid models. For example, extend the current models to include more data and investigate other machine learning and deep learning-based network architectures to predict the ice force directly from the image as an input

    Computer Vision Techniques for Transcatheter Intervention

    Get PDF
    Minimally invasive transcatheter technologies have demonstrated substantial promise for the diagnosis and treatment of cardiovascular diseases. For example, TAVI is an alternative to AVR for the treatment of severe aortic stenosis and TAFA is widely used for the treatment and cure of atrial fibrillation. In addition, catheter-based IVUS and OCT imaging of coronary arteries provides important information about the coronary lumen, wall and plaque characteristics. Qualitative and quantitative analysis of these cross-sectional image data will be beneficial for the evaluation and treatment of coronary artery diseases such as atherosclerosis. In all the phases (preoperative, intraoperative, and postoperative) during the transcatheter intervention procedure, computer vision techniques (e.g., image segmentation, motion tracking) have been largely applied in the field to accomplish tasks like annulus measurement, valve selection, catheter placement control, and vessel centerline extraction. This provides beneficial guidance for the clinicians in surgical planning, disease diagnosis, and treatment assessment. In this paper, we present a systematical review on these state-of-the-art methods.We aim to give a comprehensive overview for researchers in the area of computer vision on the subject of transcatheter intervention. Research in medical computing is multi-disciplinary due to its nature, and hence it is important to understand the application domain, clinical background, and imaging modality so that methods and quantitative measurements derived from analyzing the imaging data are appropriate and meaningful. We thus provide an overview on background information of transcatheter intervention procedures, as well as a review of the computer vision techniques and methodologies applied in this area

    Liver segmentation using 3D CT scans.

    Get PDF
    Master of Science in Computer Science. University of KwaZulu-Natal, Durban, 2018.Abstract available in PDF file

    Object Segmentation Using Active Contours: A Level Set Approach

    Get PDF
    Image segmentation is responsible for partitioning an image into sub-regions based on a preferred feature. Active contour models have widely been used for image segmentation. The use of level set theory has enriched the implementation of active contours with more flexibility and simplicity. The past models of active contours rely on a gradient based stopping function to stop the curve evolution. However, when using gradient information for noisy and textured images, the evolving curve may pass through, or stop far from the salient object boundaries. We propose using a polarity based stopping function. Comparing to the gradient information, the polarity information accurately distinguishes the boundaries or edges of the salient objects more precisely. With combining the polarity information with the active contour model, we obtain an efficient active contour model for salient object detection. Experiments are performed on several images to show the advantage of the polarity based active contour
    • …
    corecore