3 research outputs found

    Two-level compact implicit schemes for three-dimensional parabolic problems.

    Get PDF
    We derive a class of two-level high-order implicit finite difference schemes for solving three-dimensional parabolic problems with mixed derivatives. The schemes are fourth-order accurate in space and second- or lower-order accurate in time depending on the choice of a weighted average parameter ฮผ. Numerical results with ฮผ=0.5 are presented to confirm the high accuracy of the derived scheme and to compare it with the standard second-order central difference scheme. It is shown that the improvement in accuracy does not come at a higher cost of computation and storage since it is possible to choose the grid parameters so that the present scheme requires less work and memory and gives more accuracy than the standard central difference scheme

    Equity Swaps์— ๋Œ€ํ•œ ๊ณ ์ฐจ์ˆ˜๋ ด ์œ ํ•œ์ฐจ๋ถ„๋ฒ•๊ณผ OpenCL์„ ์ด์šฉํ•œ Heterogeneous ์ปดํ“จํŒ…

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ํ˜‘๋™๊ณผ์ • ๊ณ„์‚ฐ๊ณผํ•™ ์ „๊ณต, 2013. 8. ์‹ ๋™์šฐ.๋ณธ ํ•™์œ„ ๋…ผ๋ฌธ์—์„œ๋Š” Equity ์Šค์™‘ ๋ชจ๋ธ์— ๋Œ€ํ•œ 4์ฐจ ์ˆ˜๋ ด ์œ ํ•œ์ฐจ๋ถ„๋ฒ•์„ ์ œ์•ˆํ•˜์˜€๋‹ค. ํŠนํžˆ Equity ์Šค์™‘ ๋ชจ๋ธ์€ ์‹œ๊ฐ„๊ณผ ๊ณต๊ฐ„์— ์ข…์†ํ•˜๋Š” ๊ณ„์ˆ˜๋“ค์„ ๊ฐ€์ง€๊ณ  ์žˆ๊ธฐ ๋•Œ๋ฌธ์—, 4์ฐจ ์ˆ˜๋ ด ์œ ํ•œ์ฐจ๋ถ„๋ฒ•์„ ์œ ๋„ํ•˜๊ธฐ ์œ„ํ•˜์—ฌ ํŠน๋ณ„ํ•œ ์ขŒํ‘œ ๋ณ€ํ™˜์„ ๊ณ ๋ คํ•˜์˜€๋‹ค. ์ด ์ขŒํ‘œ ๋ณ€ํ™˜์€ ํŽธ๋ฏธ๋ถ„ ๋ฐฉ์ •์‹์—์„œ ๊ต์ฐจ๋ฏธ๋ถ„์„ ์ œ๊ฑฐํ•˜๋Š” ๊ฒƒ์œผ๋กœ, ์—ฌ๋Ÿฌ ์˜ˆ์ œ๋“ค์„ ํ†ตํ•ด ๊ทธ ์ˆ˜๋ ด์„ฑ์„ ๊ฒ€์ฆํ•˜์˜€๋‹ค. ๋Œ€๋ถ€๋ถ„์˜ ์„ ํ˜•ํ•ด๋ฒ•๋“ค์€ BLAS ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ธฐ๋ฐ˜ํ•˜์—ฌ ๊ตฌ์„ฑ๋˜์–ด์žˆ๊ธฐ ๋•Œ๋ฌธ์—, CPU์™€ GPU๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ BLAS๋ฅผ ๋ณ‘๋ ฌํ™” ํ•˜๋Š” ์—ฐ๊ตฌ๋ฅผ ์ˆ˜ํ–‰ํ•˜์˜€๋‹ค. ์ด๊ฒƒ์€ CPU์™€ GPU์— ์–ด๋–ป๊ฒŒ ์ž‘์—…์„ ๋ถ„๋ฐฐํ•  ๊ฒƒ์ธ๊ฐ€์˜ ๋ฌธ์ œ๋กœ ๊ท€๊ฒฐ๋˜๊ณ , ๋ถ„๋ฐฐํ•˜๋Š” ์ง€์ ์€ ๊ฐ ๊ณ„์‚ฐ์ž์›์—์„œ ์†Œ์š”๋˜๋Š” ๊ณ„์‚ฐ์‹œ๊ฐ„์˜ ์ตœ์†Œโ€“์ตœ๋Œ€ ๋ฌธ์ œ๋กœ ๋‚˜ํƒ€๋‚ผ ์ˆ˜ ์žˆ๋‹ค. CPU์™€ GPU์—์„œ ํŠน์ • BLAS๋ฅผ ๊ณ„์‚ฐํ•˜๋Š”๋ฐ ๊ฑธ๋ฆฌ๋Š” ์‹œ๊ฐ„์„ ๋‹คํ•ญํ•จ์ˆ˜์˜ ํ˜•ํƒœ๋กœ ์˜ˆ์ธกํ•จ์œผ๋กœ์จ, ์ตœ์†Œโ€“์ตœ๋Œ€ ๋ฌธ์ œ์™€ ์‹ค์ œ ๊ณ„์‚ฐ๊ฒฐ๊ณผ๋ฅผ ๋น„๊ต ๋ถ„์„ํ•˜์˜€๋‹ค.A nine-point compact finite difference scheme with fourth-order convergence is proposed for an equity swap model. In order to derive a compact scheme for the equity swap model, a special treatment is necessary to remove the mixed derivative term so that the resulting scheme is of fourth-order convergence as well as compactness. A suitable coordinate transformation is proposed to eliminate the mixed derivative term successfully. The resulting algorithm is shown to be a fourth order convergent scheme. Various examples confirm the validity of the proposed scheme. Since most of linear solvers consist of basic linear algebra subroutines (BLAS), we optimize computational performance by distributing a subroutine into CPU and GPU with some splitting ratio. We present this splitting ratio by means of a min-max problem concerning with computational times. Computational times for both CPU and GPU are estimated as polynomial functions based on their capabilities. BLAS saxpy, sgemv, and sgemm are implemented in OpenCL and we verified our min-max model with actual heterogeneous computing results.I A Higher-Order Finite-Difference Scheme for Equity Swaps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.1 Previous Studies . . . . . . . . . . . . . . . . . . . . . . . 5 1.2 Equity Swaps . . . . . . . . . . . . . . . . . . . . . . . . . 6 2 Higher-order compact Finite difference scheme 13 2.1 Seeking a higher-order scheme . . . . . . . . . . . . 14 2.2 Coordinate transformation . . . . . . . . . . . . . . . . 17 2.3 A nine-point compact scheme . . . . . . . . . . . . . 19 3 Stability analysis . . . . . . . . . . . . . . . . . . . . . . . . 25 4 Numerical results . . . . . . . . . . . . . . . . . . . . . . . 31 5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 II Heterogeneous Computing with OpenCL . . . . . . . .41 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 2 OpenCL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 3 Implementation Issues . . . . . . . . . . . . . . . . . . . . 47 3.1 Concurrency in Heterogeneous Computing . . . . . 48 3.2 CPU Parking Protocol . . . . . . . . . . . . . . . . . . . . 52 4 Modeling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 4.1 Performance Estimations . . . . . . . . . . . . . . . . . 57 4.2 PCI express Bandwidth . . . . . . . . . . . . . . . . . . 59 5 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 6 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 A Hardware Parameters . . . . . . . . . . . . . . . . . . . . 69 Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71Docto
    corecore