13 research outputs found

    A fast sparse block circulant matrix vector product

    Full text link
    In the context of computed tomography (CT), iterative image reconstruction techniques are gaining attention because high-quality images are becoming computationally feasible. They involve the solution of large systems of equations, whose cost is dominated by the sparse matrix vector product (SpMV). Our work considers the case of the sparse matrices being block circulant, which arises when taking advantage of the rotational symmetry in the tomographic system. Besides the straightforward storage saving, we exploit the circulant structure to rewrite the poor-performance SpMVs into a high-performance product between sparse and dense matrices. This paper describes the implementations developed for multi-core CPUs and GPUs, and presents experimental results with typical CT matrices. The presented approach is up to ten times faster than without exploiting the circulant structure.Romero Alcalde, E.; Tomás Domínguez, AE.; Soriano Asensi, A.; Blanquer Espert, I. (2014). A fast sparse block circulant matrix vector product. En Euro-Par 2014 Parallel Processing. Springer. 548-559. doi:10.1007/978-3-319-09873-9_46S548559Bian, J., Siewerdsen, J.H., Han, X., Sidky, E.Y., Prince, J.L., Pelizzari, C.A., Pal, X.: Evaluation of sparse-view reconstruction from flat-panel-detector cone-beam ct. Physics in Medicine and Biology 55, 6575–6599 (2010)Dalton, S., Bell, N.: CUSP: A C++ templated sparse matrix library version 0.4.0 (2014), http://cusplibrary.github.com/Feldkamp, L., Davis, L., Kress, J.: Practical cone-beam algorithm. Journal of the Optical Society of America 1, 612–619 (1984)Ganine, V., Legrand, M., Michalska, H., Pierre, C.: A sparse preconditioned iterative method for vibration analysis of geometrically mistuned bladed disks. Computers & Structures 87(5-6), 342–354 (2009)Hara, A.K., Paden, R.G., Silva, A.C., Kujak, J.L., Lawder, H.J., Pavlicek, W.: Iterative reconstruction technique for reducing body radiation dose at CT: Feasibility study. American Journal of Roentgenology 193, 764–771 (2009)Heroux, M.A., Bartlett, R.A., Howle, V.E., Hoekstra, R.J., Hu, J.J., Kolda, T.G., Lehoucq, R.B., Long, K.R., Pawlowski, R.P., Phipps, E.T., Salinger, A.G., Thornquist, H.K., Tuminaro, R.S., Willenbring, J.M., Williams, A., Stanley, K.S.: An overview of the Trilinos project. ACM Trans. Math. Softw. 31(3), 397–423 (2005)Im, E.J., Yelick, K., Vuduc, R.: Sparsity: Optimization framework for sparse matrix kernels. International Journal of High Performance Computing Applications 18(1), 135–158 (2004)Jones, E., Oliphant, T., Peterson, P., et al.: SciPy: Open source scientific tools for Python (2001), http://www.scipy.org/Kaveh, A., Rahami, H.: Block circulant matrices and applications in free vibration analysis of cyclically repetitive structures. Acta Mechanica 217(1-2), 51–62 (2011)Kourtis, K., Goumas, G., Koziris, N.: Optimizing sparse matrix-vector multiplication using index and value compression. In: Proceedings of the 5th Conference on Computing Frontiers, CF 2008, pp. 87–96. ACM, New York (2008)Krotkiewski, M., Dabrowski, M.: Parallel symmetric sparse matrix–vector product on scalar multi-core CPUs. Parallel Computing 36(4), 181–198 (2010)Lee, B., Vuduc, R., Demmel, J., Yelick, K.: Performance models for evaluation and automatic tuning of symmetric sparse matrix-vector multiply. In: International Conference on Parallel Processing, ICPP 2004, vol. 1, pp. 169–176 (2004)Leroux, J.D., Selivanov, V., Fontaine, R., Lecomte, R.: Accelerated iterative image reconstruction methods based on block-circulant system matrix derived from a cylindrical image representation. In: Nuclear Science Symposium Conference Record, NSS 2007, vol. 4, pp. 2764–2771. IEEE (2007)NVIDIA: CUSPARSE library (2014), https://developer.nvidia.com/cusparsePan, X., Sidky, E.Y., Vannier, M.: Why do commercial CT scanners still employ traditional, filtered back-projection for image reconstruction? Inverse Problems 25, 123009 (2008)Rodríguez-Alvarez, M.J., Soriano, A., Iborra, A., Sánchez, F., González, A.J., Conde, P., Hernández, L., Moliner, L., Orero, A., Vidal, L.F., Benlloch, J.M.: Expectation maximization (EM) algorithms using polar symmetries for computed tomography CT image reconstruction. Computers in Biology and Medicine 43(8), 1053–1061 (2013)Sheep, L., Vardi, Y.: Maximum likelihood reconstruction for emmision tomography. IEEE Transactions on Medical Imaging 1, 113–122 (1982)Sidky, E.Y., Pan, X.: Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization. Physics in Medicine and Biology 53, 4777–4807 (2008)Soriano, A., Rodríguez-Alvarez, M.J., Iborra, A., Sánchez, F., Carles, M., Conde, P., González, A.J., Hernández, L., Moliner, L., Orero, A., Vidal, L.F., Benlloch, J.M.: EM tomographic image reconstruction using polar voxels. Journal of Instrumentation 8, C01004 (2013)Thibaudeau, C., Leroux, J.D., Pratte, J.F., Fontaine, R., Lecomte, R.: Cylindrical and spherical ray-tracing for ct iterative reconstruction. In: 2011 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC), pp. 4378–4381 (2011)Vuduc, R., Demmel, J.W., Yelick, K.A.: OSKI: A library of automatically tuned sparse matrix kernels. Journal of Physics: Conference Series 16(1), 521 (2005)Vuduc, R.W., Moon, H.-J.: Fast sparse matrix-vector multiplication by exploiting variable block structure. In: Yang, L.T., Rana, O.F., Di Martino, B., Dongarra, J. (eds.) HPCC 2005. LNCS, vol. 3726, pp. 807–816. Springer, Heidelberg (2005)Williams, S., Oliker, L., Vuduc, R., Shalf, J., Yelick, K., Demmel, J.: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 35(3), 178–194 (2009

    Aero-Thermo-Mechanical Modelling and Validation of Transient Effects in a High Pressure Turbine Internal Air System

    No full text
    Accurate prediction of metal temperatures, blade tip and seal clearances in high pressure compressor and turbine air systems leads to dramatic improvements in overall aero-engine efficiency and component life. Fast transients during an engine flight profile may introduce large changes in geometry between adjacent rotor and stator components. The changing dynamics in a few critical seals and interfaces can change the dynamics of the entire engine, compromising efficiency, integrity and long service life. In this paper we present results of a coupled aero-thermo-mechanical transient simulation of a high pressure turbine assembly throughout an engine flight cycle. The fluid and solid model geometry is approximated as 2D axisymmetric. The problem, formulated as a four field coupled multiphysics system of equations, is simplified using a quasi-steady state assumption: the transient thermal solid problem is coupled to a sequence of steady fluid problems and static structural problems. The predicted results compare well with the experimental measurements over the entire fluid-solid interface. We show improvement in transient predictions isolating the effects of the solid domain deformation. The remaining transient error is linked to temporal uncertainties in the fluid model inlet boundary conditions which were taken as a spatial average of the main annulus flow conditions

    Nonlinear acceleration of coupled fluid-structure transient thermalproblems by Anderson mixing

    No full text
    Conjugate heat-transfer problems are typically solved using partitioned methods where fluid and solid subdomains are evaluated separately by dedicated solvers coupled through a common boundary. Strongly coupled schemes for transient analysis require fluid and solid problems to be solved many times each time step until convergence to a steady state. In many practical situations, a fairly simple and frequently employed fixed-point iteration process is rather ineffective; it leads to a large number of iterations per time step and consequently to long simulation times. In this article, Anderson mixing is proposed as a fixed-point convergence acceleration technique to reduce computational cost of thermal coupled fluid–solid problems. A number of other recently published methods with applications to similar fluid–structure interaction problems are also reviewed and analyzed. Numerical experiments are presented to illustrate relative performance of these methods on a test problem of rotating pre-swirl cavity air flow interacting with a turbine disk. It is observed that performance of Anderson mixing method is superior to that of other algorithms in terms of total iteration counts. Additional computational savings are demonstrated by reusing information from previously solved time steps

    Coupled Fluid-Structure Transient Thermal Analysis of a Gas Turbine Internal Air System With Multiple Cavities

    No full text
    This paper presents the transient aerothermal analysis of a gas turbine internal air system through an engine flight cycle featuring multiple fluid cavities that surround a HP turbine disk and the adjacent structures. Strongly coupled fluid-structure thermal interaction problems require significant computational effort to resolve nonlinearities on the interface for each time step. Simulation times may grow impractical if multiple fluid domains are included in the analysis. A new strategy is employed to decrease the cost of coupled aerothermal analysis. Significantly lower fluid domain solver invocation counts are demonstrated as opposed to the traditional coupling approach formulated on the estimates of heat transfer coefficient. Numerical results are presented using 2D finite element conduction model combined with 2D flow calculation in five separate cavities interconnected through the inlet and outlet boundaries. The coupled solutions are discussed and validated against a nominal stand-alone model. Relative performance of both coupling techniques is evaluated
    corecore