1,799 research outputs found

    Brain tumor segmentation with minimal user assistance

    Get PDF
    In this thesis, we propose a brain tumor segmentation system that requires only 4 clicks from users to specify a tight bounding box that completely contains the tumor. We convert the segmentation problem to an energy minimization problem. We utilize the basic energy function that combines intensity appearance and boundary smoothness. Global and local appearance models are experimented and compared in our work. The basic energy function does not assume any shape prior and thus leads to unrealistic shapes. We take the advantage of the fact that most of the tumors are approximately convex in shape and incorporate the star shape prior to prohibit unlikely segmentations. Another problem with the basic energy function is the undersegmentation problem. With the bounding box provided by the user, we are able to have a rough idea of the tumor size. Therefore, to encourage the segmentation to be a certain size, we add volumetric bias to our energy, which helps solve this problem. We also try to model the tumor as multi-region object where regions have distinct appearance. Specifically, we incorporate interior+exterior model for the tumor into our energy function. Our final result is promising in terms of f-measure. Our best performance for 88 volumes is 87% using volumetric ballooning

    Designing a Visual Front End in Audio-Visual Automatic Speech Recognition System

    Get PDF
    Audio-visual automatic speech recognition (AVASR) is a speech recognition technique integrating audio and video signals as input. Traditional audio-only speech recognition system only uses acoustic information from an audio source. However the recognition performance degrades significantly in acoustically noisy environments. It has been shown that visual information also can be used to identify speech. To improve the speech recognition performance, audio-visual automatic speech recognition has been studied. In this paper, we focus on the design of the visual front end of an AVASR system, which mainly consists of face detection and lip localization. The front end is built upon the AVICAR database that was recorded in moving vehicles. Therefore, diverse lighting conditions and poor quality of imagery are the problems we must overcome. We first propose the use of the Viola-Jones face detection algorithm that can process images rapidly with high detection accuracy. When the algorithm is applied to the AVICAR database, we reach an accuracy of 89% face detection rate. By separately detecting and integrating the detection results from all different color channels, we further improve the detection accuracy to 95%. To reliably localize the lips, three algorithms are studied and compared: the Gabor filter algorithm, the lip enhancement algorithm, and the modified Viola-Jones algorithm for lip features. Finally, to increase detection rate, a modified Viola-Jones algorithm and lip enhancement algorithms are cascaded based on the results of three lip localization methods. Overall, the front end achieves an accuracy of 90% for lip localization

    Design of an interactive computer system for microelectronic mask making

    Get PDF
    • …
    corecore