1,141 research outputs found
Hardware acceleration using FPGAs for adaptive radiotherapy
Adaptive radiotherapy (ART) seeks to improve the accuracy of radiotherapy by adapting the treatment based on up-to-date images of the patient's anatomy captured at the time of treatment delivery. The amount of image data, combined with the clinical time requirements for ART, necessitates automatic image analysis to adapt the treatment plan. Currently, the computational effort of the image processing and plan adaptation means they cannot be completed in a clinically acceptable timeframe. This thesis aims to investigate the use of hardware acceleration on Field Programmable Gate Arrays (FPGAs) to accelerate algorithms for segmenting bony anatomy in Computed Tomography (CT) scans, to reduce the plan adaptation time for ART.
An assessment was made of the overhead incurred by transferring image data to an FPGA-based hardware accelerator using the industry-standard DICOM protocol over an Ethernet connection. The rate was found to be likely to limit the performanceof hardware
accelerators for ART, highlighting the need for an alternative method of integrating hardware accelerators with existing radiotherapy equipment.
A clinically-validated segmentation algorithm was adapted for implementation in hardware. This was shown to process three-dimensional CT images up to 13.81 times faster than the original software implementation. The segmentations produced by the
two implementations showed strong agreement. Modifications to the hardware implementation were proposed for segmenting fourdimensional CT scans. This was shown to process image volumes 14.96 times faster than the original software implementation, and the segmentations produced by the two
implementations showed strong agreement in most cases.A second, novel, method for segmenting four-dimensional CT data was also proposed.
The hardware implementation executed 1.95 times faster than the software implementation. However, the algorithm was found to be unsuitable for the global segmentation task examined here, although it may be suitable as a refining segmentation in the context of a larger ART algorithm.Adaptive radiotherapy (ART) seeks to improve the accuracy of radiotherapy by adapting the treatment based on up-to-date images of the patient's anatomy captured at the time of treatment delivery. The amount of image data, combined with the clinical time requirements for ART, necessitates automatic image analysis to adapt the treatment plan. Currently, the computational effort of the image processing and plan adaptation means they cannot be completed in a clinically acceptable timeframe. This thesis aims to investigate the use of hardware acceleration on Field Programmable Gate Arrays (FPGAs) to accelerate algorithms for segmenting bony anatomy in Computed Tomography (CT) scans, to reduce the plan adaptation time for ART.
An assessment was made of the overhead incurred by transferring image data to an FPGA-based hardware accelerator using the industry-standard DICOM protocol over an Ethernet connection. The rate was found to be likely to limit the performanceof hardware
accelerators for ART, highlighting the need for an alternative method of integrating hardware accelerators with existing radiotherapy equipment.
A clinically-validated segmentation algorithm was adapted for implementation in hardware. This was shown to process three-dimensional CT images up to 13.81 times faster than the original software implementation. The segmentations produced by the
two implementations showed strong agreement. Modifications to the hardware implementation were proposed for segmenting fourdimensional CT scans. This was shown to process image volumes 14.96 times faster than the original software implementation, and the segmentations produced by the two
implementations showed strong agreement in most cases.A second, novel, method for segmenting four-dimensional CT data was also proposed.
The hardware implementation executed 1.95 times faster than the software implementation. However, the algorithm was found to be unsuitable for the global segmentation task examined here, although it may be suitable as a refining segmentation in the context of a larger ART algorithm
Neural Network Methods for Radiation Detectors and Imaging
Recent advances in image data processing through machine learning and
especially deep neural networks (DNNs) allow for new optimization and
performance-enhancement schemes for radiation detectors and imaging hardware
through data-endowed artificial intelligence. We give an overview of data
generation at photon sources, deep learning-based methods for image processing
tasks, and hardware solutions for deep learning acceleration. Most existing
deep learning approaches are trained offline, typically using large amounts of
computational resources. However, once trained, DNNs can achieve fast inference
speeds and can be deployed to edge devices. A new trend is edge computing with
less energy consumption (hundreds of watts or less) and real-time analysis
potential. While popularly used for edge computing, electronic-based hardware
accelerators ranging from general purpose processors such as central processing
units (CPUs) to application-specific integrated circuits (ASICs) are constantly
reaching performance limits in latency, energy consumption, and other physical
constraints. These limits give rise to next-generation analog neuromorhpic
hardware platforms, such as optical neural networks (ONNs), for high parallel,
low latency, and low energy computing to boost deep learning acceleration
Computational advancements in the D-bar reconstruction method for 2-D electrical impedance tomography
2016 Spring.Includes bibliographical references.We study the problem of reconstructing 2-D conductivities from boundary voltage and current density measurements, also known as the electrical impedance tomography (EIT) problem, using the D-bar inversion method, based on the 1996 global uniqueness proof by Adrian Nachman. We focus on the computational implementation and efficiency of the D-bar algorithm, its application to finite-precision practical data in human thoracic imaging, and the quality and spatial resolution of the resulting reconstructions. The main contributions of this work are (1) a parallelized computational implementation of the algorithm which has been shown to run in real-time, thus demonstrating the feasibility of the D-bar method for use in real-time bedside imaging, and (2) a modification of the algorithm to include \emph{a priori} data in the form of approximate organ boundaries and (optionally) conductivity estimates, which we show to be effective in improving spatial resolution in the resulting reconstructions. These computational advancements are tested using both numerically simulated data as well as experimental human and tank data collected using the ACE1 EIT machine at CSU. In this work, we provide details regarding the theoretical background and practical implementation for each advancement, we demonstrate the effectiveness of the algorithm modifications through multiple experiments, and we provide discussion and conclusions based on the results
Realtime photoacoustic microscopy in vivo with a 30-MHz ultrasound array transducer
We present a novel high-frequency photoacoustic microscopy system capable of imaging the microvasculature of living subjects in realtime to depths of a few mm. The system consists of a high-repetition-rate Q-switched pump laser, a tunable dye laser, a 30-MHz linear ultrasound array transducer, a multichannel high-frequency data acquisition system, and a shared-RAM multi-core-processor computer. Data acquisition, beamforming, scan conversion, and display are implemented in realtime at 50 frames per second. Clearly resolvable images of 6-µm-diameter carbon fibers are experimentally demonstrated at 80 µm separation distances. Realtime imaging performance is demonstrated on phantoms and in vivo with absorbing structures identified to depths of 2.5–3 mm. This work represents the first high-frequency realtime photoacoustic imaging system to our knowledge
Applications in GNSS water vapor tomography
Algebraic reconstruction algorithms are iterative algorithms that are used in many area including medicine, seismology or meteorology. These algorithms are known to be highly computational intensive. This may be especially troublesome for real-time applications or when processed by conventional low-cost personnel computers. One of these real time applications
is the reconstruction of water vapor images from Global Navigation Satellite System (GNSS) observations. The parallelization of algebraic reconstruction algorithms has the potential to diminish signi cantly the required resources permitting to obtain valid solutions in time to be used for nowcasting and forecasting weather models.
The main objective of this dissertation was to present and analyse diverse shared memory
libraries and techniques in CPU and GPU for algebraic reconstruction algorithms. It was concluded that the parallelization compensates over sequential implementations. Overall the GPU implementations were found to be only slightly faster than the CPU implementations, depending on the size of the problem being studied.
A secondary objective was to develop a software to perform the GNSS water vapor reconstruction using the implemented parallel algorithms. This software has been developed with success and diverse tests were made namely with synthetic and real data, the preliminary results shown to be satisfactory.
This dissertation was written in the Space & Earth Geodetic Analysis Laboratory (SEGAL) and was carried out in the framework of the Structure of Moist convection in high-resolution GNSS observations and models (SMOG) (PTDC/CTE-ATM/119922/2010) project funded by FCT.Algoritmos de reconstrução algébrica são algoritmos iterativos que são usados em muitas áreas
incluindo medicina, sismologia ou meteorologia. Estes algoritmos são conhecidos por serem bastante
exigentes computacionalmente. Isto pode ser especialmente complicado para aplicações
de tempo real ou quando processados por computadores pessoais de baixo custo. Uma destas
aplicações de tempo real é a reconstrução de imagens de vapor de água a partir de observações
de sistemas globais de navegação por satélite. A paralelização dos algoritmos de reconstrução
algébrica permite que se reduza significativamente os requisitos computacionais permitindo
obter soluções válidas para previsão meteorológica num curto espaço de tempo.
O principal objectivo desta dissertação é apresentar e analisar diversas bibliotecas e técnicas
multithreading para a reconstrução algébrica em CPU e GPU. Foi concluído que a paralelização
compensa sobre a implementações sequenciais. De um modo geral as implementações GPU
obtiveram resultados relativamente melhores que implementações em CPU, isto dependendo do
tamanho do problema a ser estudado. Um objectivo secundário era desenvolver uma aplicação
que realizasse a reconstrução de imagem de vapor de água através de sistemas globais de
navegação por satélite de uma forma paralela. Este software tem sido desenvolvido com sucesso
e diversos testes foram realizados com dados sintéticos e dados reais, os resultados preliminares
foram satisfatórios.
Esta dissertação foi escrita no Space & Earth Geodetic Analysis Laboratory (SEGAL) e foi realizada de acordo com o projecto Structure 01' Moist convection in high-resolution GNSS observations and models (SMOG) (PTDC / CTE-ATM/ 11992212010) financiado pelo FCT.Fundação para a Ciência e a Tecnologia (FCT
Accelerated CTIS Using the Cell Processor
The Computed Tomography Imaging Spectrometer (CTIS) is a device capable of simultaneously acquiring imagery from multiple bands of the electromagnetic spectrum. Due to the method of data collection from this system, a processing intensive reconstruction phase is required to resolve the image output. This paper evaluates a parallelized implementation of the Vose-Horton CTIS reconstruction algorithm using the Cell processor. In addition to demonstrating the feasibility of a mixed precision implementation, it is shown that use of the parallel processing capabilities of the Cell may provide a significant reduction in reconstruction time
- …