30 research outputs found

    Multigrid Backprojection Super-Resolution and Deep Filter Visualization

    Full text link
    We introduce a novel deep-learning architecture for image upscaling by large factors (e.g. 4x, 8x) based on examples of pristine high-resolution images. Our target is to reconstruct high-resolution images from their downscale versions. The proposed system performs a multi-level progressive upscaling, starting from small factors (2x) and updating for higher factors (4x and 8x). The system is recursive as it repeats the same procedure at each level. It is also residual since we use the network to update the outputs of a classic upscaler. The network residuals are improved by Iterative Back-Projections (IBP) computed in the features of a convolutional network. To work in multiple levels we extend the standard back-projection algorithm using a recursion analogous to Multi-Grid algorithms commonly used as solvers of large systems of linear equations. We finally show how the network can be interpreted as a standard upsampling-and-filter upscaler with a space-variant filter that adapts to the geometry. This approach allows us to visualize how the network learns to upscale. Finally, our system reaches state of the art quality for models with relatively few number of parameters.Comment: Spotlight paper in the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19

    ์—ญ์—ฐ์‚ฐ์— ๊ธฐ๋ฐ˜ํ•œ ํ•ฉ์„ฑ๊ณฑ์‹ ๊ฒฝ๋ง์˜ ์„ค๋ช… ๋ฐ ์‹œ๊ฐํ™”

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ „๊ธฐยท์ •๋ณด๊ณตํ•™๋ถ€, 2021.8. ๊ถŒํ˜์ง„.Interpretability and explainability of machine learning systems have received ever-increasing attention, especially for convolutional neural networks (CNN). Although there are various interpretation techniques for learning algorithms, post-hoc local explanation methods (e.g., the attribution method that visualizes pixel-level contribution of input to its corresponding result) are under great interest because they can deal with the high dimensional parameters and nonlinear operations of CNNs. Therefore, this dissertation presents three new post-hoc local explanation methods to visualize and understand the working mechanisms of CNNs. At first, this dissertation presents a new method called guided nonlinearity (GNL) that improves the performance of attribution by backpropagating only positive gradients through nonlinear operations. GNL is inspired by the mechanism of action potential (AP) generation in the postsynaptic neuron that depends on the sum of excitatory (EPSP) and inhibitory postsynaptic potentials (IPSP). This dissertation assumes that paths consisting of excitatory synapses faithfully reflect the contributions of inputs to the output. Then this assumption is applied to CNNs by allowing only positive gradients backpropagate through nonlinear operations. Experimental results have shown that GNL outperforms existing methods for computing attributions in terms of the deletion metrics and yields fine-grained and human-interpretable attributions. However, the attributions from existing methods, including GNL, lack a common theoretical background and sometimes give contradicting results. To address this problem, this dissertation develops the operation-wise inverse method that computes the inverse of prediction in an operation-wise manner by considering that CNNs can be decomposed with four fundamental operations (convolution, max-pooling, ReLU, and fully-connected). The operation-wise inverse process assumes that the forward-pass of CNN is a sequential propagation of physical quantities that indicate the magnitude of specific image features. The inverses of fundamental operations are formulated as constrained optimization problems that inverse results should generate output features consistent with the forward-pass. Then, the inverse of prediction is computed by sequentially applying inverses of fundamental operations of CNN. Experimental results show that the proposed operation-wise approach can be a reference tool for computing attributions because it can provide equivalent visualization results to several conventional methods, and the attributions from the operation-wise method achieve state-of-the-art performances in terms of deletion score. Although the operation-wise method can provide a reference framework to compute attributions, applying the attribution concept to CNNs with multiple-valued predictions has not yet been addressed because the computation of attribution requires a single scalar value represents the prediction. To address this problem, this dissertation proposes the layer-wise inverse-based approach by decomposing CNNs into a set of layers that process only positive values that can be interpreted as neural activations. Then, the inverses of layers are formulated as constrained optimization problems that identify activations-of-interest in lower-layers. Then, the inverse of prediction is computed by sequentially applying inverses of layers of CNN as in the operation-wise method. Experimental results show that the proposed layer-wise inverse-based method can analyze CNNs for classification and regression in the same framework. Especially for the case of regression, the layer-wise approach showed that conventional CNNs for single image super-resolution overlook a portion of frequency bands that may result in performance degradation.ํ•ด์„ ๊ฐ€๋Šฅํ•œ ๊ธฐ๊ณ„ํ•™์Šต ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค์€ ์ตœ๊ทผ ๋งŽ์€ ๊ด€์‹ฌ์„ ๋ฐ›๊ณ  ์žˆ์œผ๋ฉฐ, ์ด ์ค‘ ํ•ฉ์„ฑ๊ณฑ์‹ ๊ฒฝ๋ง (CNN)์˜ ์„ค๋ช… ๋ฐ ์‹œ๊ฐํ™”๋Š” ์ฃผ์š”ํ•œ ์—ฐ๊ตฌ์ฃผ์ œ๋กœ์„œ ์ทจ๊ธ‰๋˜๊ณ  ์žˆ๋‹ค. ๊ธฐ๊ณ„ํ•™์Šต ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ดํ•ดํ•˜๊ธฐ ์œ„ํ•œ ๋‹ค์–‘ํ•œ ๋ฐฉ๋ฒ• ์ค‘ ํŠนํžˆ ์ฃผ์–ด์ง„ ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ์˜ˆ์ธก ๊ฒฐ๊ณผ์— ๋Œ€ํ•œ ์ž…๋ ฅ์˜ ๊ธฐ์—ฌ๋„๋ฅผ ์‹œ๊ฐํ™”ํ•˜๋Š” ๊ท€์ธ (attribution)๊ณผ ๊ฐ™์€ ์‚ฌํ›„๊ฒ€์ • (post-hoc) ๊ตญ์†Œ์„ค๋ช… (local explanation) ๋ฐฉ๋ฒ•์€ ๊ณ ์ฐจ์› ๋งค๊ฐœ ๋ณ€์ˆ˜๋ฅผ ๊ฐ€์ง„ ๋น„์„ ํ˜• ํ•จ์ˆ˜์— ์ ์šฉํ•  ์ˆ˜ ์žˆ์–ด์„œ CNN์˜ ์„ค๋ช… ๋ฐ ์‹œ๊ฐํ™”์˜ ์ฃผ์š”ํ•œ ๋ฐฉ๋ฒ•์œผ๋กœ ์‚ฌ์šฉ๋˜๊ณ  ์žˆ๋‹ค. ์ด์— ๋”ฐ๋ผ ๋ณธ ๋…ผ๋ฌธ์€ CNN์˜ ์ž‘๋™ ์›๋ฆฌ๋ฅผ ์‹œ๊ฐํ™”ํ•˜๊ณ  ์ดํ•ดํ•˜๋Š”๋ฐ ์‚ฌ์šฉ๋  ์ˆ˜ ์žˆ๋Š” ์„ธ ๊ฐ€์ง€ ์‚ฌํ›„๊ฒ€์ • ๊ตญ์†Œ์„ค๋ช… ๋ฐฉ๋ฒ•๋“ค์„ ์ œ์‹œํ•œ๋‹ค. ์ฒซ ๋ฒˆ์งธ๋กœ, ๋ณธ ๋…ผ๋ฌธ์€ ๋น„์„ ํ˜• ์—ฐ์‚ฐ์˜ ์–‘์˜ ๊ธฐ์šธ๊ธฐ (positive valued gradient)๋งŒ ์—ญ์ „ํŒŒ (backpropagation)ํ•˜์—ฌ ๊ท€์ธ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ์œ ๋„๋œ๋น„์„ ํ˜•๋ฒ• (guided nonlinearity method)์„ ์ œ์‹œํ•œ๋‹ค. ์œ ๋„๋œ๋น„์„ ํ˜•๋ฒ•์˜ ์„ค๊ณ„๋Š” ํฅ๋ถ„์„ฑ ๋ฐ ์–ต์ œ์„ฑ ์‹œ๋ƒ…์Šค ํ›„ ์ „์œ„์˜ ํ•ฉ์— ์˜์กดํ•˜๋Š” ์‹œ๋ƒ…์Šค ํ›„ ๋‰ด๋Ÿฐ์˜ ํ™œ๋™ ์ „์œ„ ์ƒ์„ฑ ๋ฉ”์ปค๋‹ˆ์ฆ˜์œผ๋กœ๋ถ€ํ„ฐ ๋น„๋กฏ๋˜์—ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ํฅ๋ถ„์„ฑ ์‹œ๋ƒ…์Šค๋กœ ๊ตฌ์„ฑ๋œ ๊ฒฝ๋กœ๊ฐ€ ์ถœ๋ ฅ์— ๋Œ€ํ•œ ์ž…๋ ฅ์˜ ๊ธฐ์—ฌ๋„๋ฅผ ์ถฉ์‹คํ•˜๊ฒŒ ๋ฐ˜์˜ํ•˜๊ณ  ์žˆ๋‹ค๊ณ  ๊ฐ€์ •ํ•˜์˜€๋‹ค. ๊ทธ ํ›„, ๋ณธ ๋…ผ๋ฌธ์€ ๋น„์„ ํ˜• ์—ฐ์‚ฐ์˜ ์–‘์˜ ๊ธฐ์šธ๊ธฐ๋งŒ ์—ญ์ „ํŒŒ ๋˜๋„๋ก ํ—ˆ์šฉํ•จ์œผ๋กœ์จ ์ด ๊ฐ€์ •์„ CNN์˜ ์„ค๋ช… ๋ฐ ์‹œ๊ฐํ™”์— ์ ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ๊ตฌํ˜„ํ•˜์˜€๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์‹คํ—˜์„ ํ†ตํ•ด, ์ œ์•ˆ๋œ ์œ ๋„๋œ๋น„์„ ํ˜•๋ฒ•์ด ์‚ญ์ œ์ฒ™๋„ (deletion metric) ์ธก๋ฉด์—์„œ ๊ธฐ์กด์˜ ๋ฐฉ๋ฒ•๋“ค๋ณด๋‹ค ํ–ฅ์ƒ๋œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋ฉฐ ํ•ด์„ ๊ฐ€๋Šฅํ•˜๊ณ  ์„ธ๋ฐ€ํ•œ (fine-grained) ๊ท€์ธ์„ ์‚ฐ์ถœํ•จ์„ ๋ณด์˜€๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์œ ๋„๋œ๋น„์„ ํ˜•๋ฒ•์„ ํฌํ•จํ•œ ๊ธฐ์กด์˜ ๊ท€์ธ ๋ฐฉ๋ฒ•๋“ค์€ ์„œ๋กœ ๋‹ค๋ฅธ ์ด๋ก ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์„ค๊ณ„๋˜์—ˆ์œผ๋ฉฐ, ์ด๋กœ ์ธํ•˜์—ฌ ์„œ๋กœ ๋ชจ์ˆœ๋˜๋Š” ๊ท€์ธ๋“ค์„ ๊ณ„์‚ฐํ•˜๋Š” ๋•Œ๋„ ์žˆ๋‹ค. ์ด ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” CNN์ด ํ•ฉ์„ฑ๊ณฑ (convolution), ์ตœ๋Œ€ํ’€๋ง (max-pooling), ReLU, ์ „์—ฐ๊ฒฐ (full-connected)์˜ 4๊ฐ€์ง€ ๊ธฐ๋ณธ ์—ฐ์‚ฐ๋“ค์˜ ํ•ฉ์„ฑํ•จ์ˆ˜๋กœ ํ‘œํ˜„๋  ์ˆ˜ ์žˆ๋‹ค๋Š” ์ ์— ๊ธฐ๋ฐ˜ํ•˜์—ฌ, CNN์„ ํ†ตํ•œ ์˜ˆ์ธก์˜ ์—ญ์ƒ (inverse image)์„ ๊ธฐ๋ณธ ์—ฐ์‚ฐ๋“ค์˜ ์—ญ์—ฐ์‚ฐ์„ ํ†ตํ•ด ๊ณ„์‚ฐํ•˜๋Š” ์—ฐ์‚ฐ๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ• (operation-wise inverse-based method)์„ ์ œ์•ˆํ•œ๋‹ค. ์—ฐ์‚ฐ๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ•์€ CNN์˜ ์ •๋ฐฉํ–ฅ์ง„ํ–‰ (forward-pass)์„ ํŠน์ • ์ด๋ฏธ์ง€ํŠน์ง• (image feature)์˜ ํฌ๊ธฐ๋ฅผ ์˜๋ฏธํ•˜๋Š” ๋ฌผ๋ฆฌ๋Ÿ‰์˜ ์ˆœ์ฐจ์  ์ „ํŒŒ๋กœ ๊ฐ€์ •ํ•œ๋‹ค. ์ด ๊ฐ€์ •ํ•˜์— ์—ฐ์‚ฐ๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ•์€ ๊ณ„์‚ฐ๋œ ์—ญ์ƒ์ด ๊ธฐ์กด์˜ ์ •๋ฐฉํ–ฅ์ง„ํ–‰ ๊ฒฐ๊ณผ์™€ ๋ชจ์ˆœ๋˜์ง€ ์•Š๋„๋ก ์„ค๊ณ„๋œ ์ œํ•œ๋œ ์ตœ์ ํ™” ๋ฌธ์ œ (constrained optimization problem)๋ฅผ ํ†ตํ•ด ๊ธฐ๋ณธ ์—ฐ์‚ฐ์˜ ์—ญ์—ฐ์‚ฐ์„ ๊ณ„์‚ฐํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์‹คํ—˜์„ ํ†ตํ•ด ์—ฐ์‚ฐ๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ•์ด ๊ธฐ์กด์˜ ์—ฌ๋Ÿฌ ๊ท€์ธ ๋ฐฉ๋ฒ•๋“ค๋ณด๋‹ค ์‚ญ์ œ์ฒ™๋„ ์ธก๋ฉด์—์„œ ํ–ฅ์ƒ๋˜์—ˆ์œผ๋ฉด์„œ๋„ ์งˆ์  ์ธก๋ฉด์—์„œ ์œ ์‚ฌํ•œ ์‹œ๊ฐํ™” ๊ฒฐ๊ณผ๋ฅผ ์ œ๊ณตํ•˜๋Š” ๊ฒƒ์„ ๋ณด์ž„์œผ๋กœ์จ ์—ฐ์‚ฐ๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ•์ด ๊ท€์ธ๊ณ„์‚ฐ์˜ ๊ณตํ†ต ํ”„๋ ˆ์ž„ ์›Œํฌ (reference framework)๋กœ ์‚ฌ์šฉ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€๋‹ค. ํ•œํŽธ, ์˜์ƒ ๋ถ„๋ฅ˜ ๋ฌธ์ œ์™€ ๊ฐ™์ด ๋‹จ์ผ ์˜ˆ์ธก์„ ๋Œ€์ƒ์œผ๋กœ ํ•œ CNN๊ณผ๋Š” ๋‹ฌ๋ฆฌ ๋ณต์ˆ˜์˜ ์˜ˆ์ธก๊ฐ’์„ ๊ฐ€์ง€๋Š” CNN์— ๋Œ€ํ•˜์—ฌ ๊ท€์ธ๊ณ„์‚ฐ์„ ์‹œ๋„ํ•œ ์—ฐ๊ตฌ๋Š” ํ˜„์žฌ๊นŒ์ง€ ๋ณด๊ณ ๋˜์ง€ ์•Š์•˜๋‹ค. ์ด๋Š” ๊ธฐ์กด์˜ ๊ท€์ธ ๊ณ„์‚ฐ๋ฐฉ๋ฒ•๋“ค์€ CNN์— ๋Œ€ํ•˜์—ฌ ๋‹จ์ผ ์Šค์นผ๋ผ (scalar) ๊ฐ’์„ ์ถœ๋ ฅํ•˜๋„๋ก ์š”๊ตฌํ•˜๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค. ์ด ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๊ณ„์ธต๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ• (layer-wise inverse-based method)์„ ์ œ์•ˆํ•œ๋‹ค. ๊ณ„์ธต๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ•์€ CNN์„ ์ธ๊ณต ๋‰ด๋Ÿฐ์˜ ํ™œ์„ฑ๊ฐ’ (neural activation)์œผ๋กœ ํ•ด์„ํ•  ์ˆ˜ ์žˆ๋Š” ์–‘์˜ ์‹ค์ˆ˜๋“ค์„ ์ž…์ถœ๋ ฅ์œผ๋กœ ํ•˜๋Š” ๊ณ„์ธต (layer)์œผ๋กœ ๋ถ„ํ•ดํ•˜๊ณ , ์ œํ•œ๋œ ์ตœ์ ํ™” ๋ฌธ์ œ๋กœ ์ •์˜๋˜๋Š” ๊ฐ ๊ณ„์ธต์˜ ์—ญ์—ฐ์‚ฐ์„ ์ •๋ฐฉํ–ฅ์ง„ํ–‰ ๊ฒฐ๊ณผ์— ์ˆœ์ฐจ์ ์œผ๋กœ ์ ์šฉํ•จ์œผ๋กœ์จ CNN์„ ํ†ตํ•œ ์˜ˆ์ธก์˜ ์—ญ์ƒ์„ ๊ณ„์‚ฐํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์‹คํ—˜์„ ํ†ตํ•ด, ์ œ์•ˆ๋œ ๊ณ„์ธต๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ•์ด ์˜์ƒ ๋ถ„๋ฅ˜ ๋ฐ ํšŒ๊ธฐ๋ฅผ ๋Œ€์ƒ์œผ๋กœ ํ•œ CNN๋“ค์˜ ์„ค๋ช… ๋ฐ ์‹œ๊ฐํ™”๋ฅผ ๋™์ผํ•œ ํ”„๋ ˆ์ž„ ์›Œํฌ (common framework)์—์„œ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ์Œ์„ ํ™•์ธํ•˜์˜€๋‹ค. ๋˜ํ•œ, ๋ณธ ๋…ผ๋ฌธ์€ ๊ณ„์ธต๋ณ„์—ญ์—ฐ์‚ฐ๋ฒ•์„ ํ†ตํ•ด ๋‹จ์ผ ์˜์ƒ ๊ณ ํ•ด์ƒํ™” (single image super-resolution)๋ฅผ ๋Œ€์ƒ์œผ๋กœ ํ•œ CNN์ธ VDSR์ด ์ž…๋ ฅ ์˜์ƒ์˜ ์ฃผํŒŒ์ˆ˜ ๋Œ€์—ญ์˜ ์ผ๋ถ€๋ฅผ ๊ฐ„๊ณผํ•˜๊ณ  ์žˆ๊ณ  ์ด๋Š” VDSR์„ ํ†ตํ•œ ๊ณ ํ•ด์ƒํ™”์‹œ ํŠน์ • ์ฃผํŒŒ์ˆ˜ ๋Œ€์—ญ์—์„œ ์˜์ƒ ํ’ˆ์งˆ์˜ ํ•˜๋ฝ์„ ์œ ๋ฐœํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€๋‹ค.Contents 1 List of Tables 4 List of Figures 5 1 Introduction 7 1.1 Guided Nonlinearity 8 1.2 Inverse-based approach 9 1.2.1 Operation-wise method 10 1.2.2 Layer-wise method 11 1.3 Outline 14 2 RelatedWork 15 2.1 Activation-based approach 15 2.2 Perturbation-based approach 16 2.3 Backpropagation-based approach 17 2.4 Inverse-based approach 18 3 Guided Nonlinearity 19 3.1 Motivation and Overview 19 3.2 Proposed Guided Non-linearity 23 3.2.1 Integrated Gradients 23 3.2.2 Postulations 23 3.2.3 Proposed method 24 3.3 Experimental Results 27 3.3.1 Evaluation Metrics 29 3.3.2 Experiment details 29 3.3.3 Results and Discussions 30 3.4 Summary 30 4 Operation-wise Approach 32 4.1 Motivation and Overview 32 4.2 Proposed Method 35 4.2.1 Problem statement 36 4.2.2 Proposed constraints 36 4.2.3 Mathematical formulation 37 4.3 Implementation details 38 4.3.1 Inverse of ReLU and Max Pooling 38 4.3.2 Inverse of Fully Connected and Convolution Layers 39 4.4 Experimental Settings 40 4.4.1 Qualitative results 40 4.4.2 Quantitative Results 46 4.5 Summary 50 5 Layer-wise Approach 51 5.1 Motivation and Overview 51 5.2 Formulation of the Proposed Inverse Approach 55 5.2.1 Activation range 56 5.2.2 Minimal activation 56 5.2.3 Linear approximation 57 5.2.4 Layer-wise inverse 57 5.3 Details of inverse computation 59 5.3.1 Convolution block (linear part) + ReLU 59 5.3.2 Max-pooling layer 60 5.3.3 Fully connected block (linear part) + ReLU 61 5.3.4 Fully connected block (linear part) + Softmax 61 5.4 Application to the ImageNet classification task 62 5.4.1 Evaluation of output-reconstruction in terms of input-simplicity 62 5.4.2 Deletion and insertion scores 63 5.4.3 Selection of the regularization term weight 64 5.4.4 Comparison to Existing Methods 67 5.4.5 Output-reconstruction versus input-simplicity plot 68 5.4.6 Ablation study of the activation regularization 72 5.5 The inverse of single image super-resolution network 72 5.5.1 Experimental setting 72 5.5.2 Selection of the regularization term weight 74 5.5.3 Evaluation of the proposed inverse process 77 5.5.4 Frequency domain analysis of attribution 77 5.6 Summary 81 6 Conclusions 82 Bibliography 84 Abstract (In Korean) 95๋ฐ•

    Intuitively Assessing ML Model Reliability through Example-Based Explanations and Editing Model Inputs

    Full text link
    Interpretability methods aim to help users build trust in and understand the capabilities of machine learning models. However, existing approaches often rely on abstract, complex visualizations that poorly map to the task at hand or require non-trivial ML expertise to interpret. Here, we present two visual analytics modules that facilitate an intuitive assessment of model reliability. To help users better characterize and reason about a model's uncertainty, we visualize raw and aggregate information about a given input's nearest neighbors. Using an interactive editor, users can manipulate this input in semantically-meaningful ways, determine the effect on the output, and compare against their prior expectations. We evaluate our interface using an electrocardiogram beat classification case study. Compared to a baseline feature importance interface, we find that 14 physicians are better able to align the model's uncertainty with domain-relevant factors and build intuition about its capabilities and limitations

    Optimization of the Parameters of the YAP-(S)PETII Scanner for SPECT Acquisition

    Get PDF
    Abstract Single Photon Emission Computed Tomography (SPECT) could be considered as a milestone in terms of biomedical imaging technique, which visualizes Functional processes in-vivo, based on the emission of gamma rays produced within the body. The most distinctive feature of SPECT from other imaging modalities is that it is based on the tracer principle, discovered by George Charles de Hevesy in the first decade of the twentieth century. As known by everyone, the metabolism of an organism is composed of atoms within a molecule which can be replaced by one of its radioactive isotopes. By using this principle, we are able to follow and detect pathways of the photons which are emitted from the radioactive element inside the metabolism. SPECT produces images by using a gamma camera which consists of two major functional components, the collimator and the radiation detector. The collimator is a thick sheet of a heavy metal like lead, tungsten of gold with densely packed small holes and is put just in front of the photon detector. The radiation detector converts the gamma rays into scintillation light photons. In conventional SPECT, scanners utilize a parallel hole collimator. Defining a small solid angle, each collimator hole is located somewhere along this line and the photons might reach the detector by passing through these holes. Subsequently, we can create projection images of the radioisotope distribution. The quantity of photons which come to the radiation detector through the collimator holes specifies the image quality regarding signal to noise ratio. One of the crucial parts of all SPECT scanners is the collimator design. The main part of this dissertation is to investigate performance characteristics of YAP-(S)PETII scanner collimator and to obtain collimator characteristics curves for optimization purposes. Before starting the collimator performance investigation of YAP-(S)PETII scanner, we first performed simulation of it in SPECT mode with point source Tc-99m to measure collimator and system efficiency by using GATEโ€“the Geant4 Application for Emission Tomography. GATE is an advanced, flexible, precise, opensource Monte Carlo toolkit developed by the international OpenGATE collaboration and dedicated to the numerical simulations in medical imaging. We obtained the results of collimator and system efficiency in terms of collimator length, radius and septa by using GATE_v4. Then, we compared our results with analytical formulation of efficiency and resolution. For those simulation experiments, we found that the difference between the simulated and the analytical results with regard to approximated geometrical collimator efficiency formulation of H. Anger, is within 20%. Then, we wrote a new ASCII sorter algorithm, which reads ASCII output of GATE_v4 and then creates a sinogram and reconstructs it to see the final simulation results. At the beginning, we used the analytical reconstruction method, filtered back projection (FBP), but this method produces severely blurred images. To solve this problem and increase our image quality, we tried different mathematical filters, like ramp, sheep-logan, low-pass cosine filters. After all of those studies mentioned above, we learned that GATE_v4 is not practical to measure collimator efficiency and resolution. On the other hand, the results of GATE_v4 did not show directly septal penetrated photon ratio. Under the light of these findings, we decided to develop a new user-friendly ray tracing program for optimization of low energy general purpose (LEGP) parallelhole collimators. In addition, we tried to evaluate the image quality and quantify the impact of high-energy contamination for I-123 isotope imaging. Due to its promising chemical characteristics, Iodine-123 is increasingly used in SPECT studies. 159 keV photons are used for imaging, however, high-energy photons result in an error in the projection data primarily by penetration of the collimator and scattering inside the crystal with energy close to the photons used for imaging. One of the way to minimize this effect is using a double energy window (DEW) method, because, it decreases noise in main (sensitive) energy window. By using this method, we tried to determine the difference between simulated and experimental projection results and scattered photon ratio (Sk) value of YAP-(S)PETII scanner for I-123 measurements. The main drawback of GATE simulations is that they are CPU-intensive. In this dissertation to handle this problem, we did the feasibility study of the Fully Monte Carlo based implementation of the system matrix derivation of YAP-(S)PETII scanner by using XtreemOS platform. To manage lifecycle of the simulation on the top XtreemOS, we developed a set of scripts. The main purpose of our study is to integrate a distributed platform like XtreemOS to reduce the overall simulation completion time and increase the feasibility of SPECT simulations in a research environment and establish an accurate and fast method for deriving the system matrix of the YAP-(S)PETII scanner by using Monte Carlo simulation approach. We developed also the ML-EM Algorithm to the reconstruct our GATE simulation results and to derive the system matrix directly from GATE output. In addition to the accuracy consideration, we intend to develop a flexible matrix derivation method and GATE output reconstruction tool
    corecore